Gene Dole_1981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDole_1981 
Symbol 
ID5694821 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfococcus oleovorans Hxd3 
KingdomBacteria 
Replicon accessionNC_009943 
Strand
Start bp2396865 
End bp2398679 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content56% 
IMG OID641264579 
Productribosomal protein S1 
Protein accessionYP_001529862 
Protein GI158521992 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0539] Ribosomal protein S1 
TIGRFAM ID[TIGR00717] ribosomal protein S1 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000204813 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGACA ACAACACAAC GCAGGATGCA GACAACATCG AAAACGAAGA AAGCGGCATG 
GAAAGCATGG AAGAACTCAT GGCCATGTAT GACCACACCT TCAAGCAGTT CACCGAGGGT
GAGGTGGTCG TGGGCAAGGT CATTGCCGTG GACAAGGACT ATGTGCTGGT GGACATCGGC
TACAAGTCCG AAGGACAGAT CAACATCAAC GAGTTCAAGA ACGAAAAGGG CGAGGTCAAC
GTCAAGATCT ATGATGATGT TGAAGTGATG ATTGAGTTCT GGGACGAGGA ACATGAAAAG
CTCGTTCTGT CCAAGGACAA GGCCGAAAAG GACAAGATCT GGGAGACCAT CAAGGACACC
TTTGACAATG ACGGCGTGGT CAGCGGCGTG ATCATCAATC GCGTCAAAGG TGGTTTTTCC
GTGGATGTGG GGCTTCAGGC ATTTTTGCCG GGTTCCCAGG CCGACCTTCG GCCCATCCGC
AACTTTGACG AGATGGTGGG CCAGACCTAT GATTTCAAGG TCCTCAAGTA CAATCGCCGG
CGCAACAACA TCGTGCTGTC CCGCCGGGTA CTGCTGGAAG AGACCCTGGC CACCAAGCGG
GCTGAACTGA TGGAGACCCT GGCCGATGGC CAGGTGGTCG AAGGCATTGT GAAAAACATC
ACCGAGTACG GTGTGTTCGT GGACCTGGGC GGCATCGACG GCCTGCTGCA CATCACCGAT
ATTTCCTGGG GCCGGGTGCG CCACCCCTCC GAGCTGTTTA CTGTGGGCGA CAACGTGACC
CTGAAGGTGC TCTCTTTTGA TCTGGACAAG AAAAAGATAT CCCTGGGCAT GAAGCAGCTG
ACCCCGGATC CCTGGGAGTC GGCGGCCGAG AAGTACCCGG TGGGTGTCAA GACAACCGGC
AAGGTTGTGA GCATCACCAA CTACGGCATT TTCGTGGAAC TGGAAGAGGG CGTCGAGGGC
CTGATTCACG TGTCTGAGAT ATCCTGGACC CGCAAGATCC GCCACCCCTC CAAGGTGGTC
AACATCGGGG AAGAGGTGGA TGCCGTGGTG CTGGACATTC AGCCCGGCAA CCGGCGTATC
TCCCTGGGCA TGAAGCAGGT GGAGCCCAAT CCATGGGAGG TGATCAGCGA GAAGTACCCC
GTGGGCACGG TGATCGAAGG CCGCATCAAG AACATCACCG ATTTTGGTCT GTTCATCGGC
ATTGATGATG ATATTGACGG CTTGGTCCAT ATTTCCGACA TCTCCTGGAC CAAGCGGATC
AAGCATCCTT CTGAGATATA CAAGAAGGGG GACCTGGTGC GGGCCGTTGT GCTGGAGATC
GACAAGGCCA ACGAGCGTTT TTCCATGGGC ATCAAGCAGC TTGAGTCTGA TCCCTGGGAG
AGTGTCCGGG ACCGCTACCC CATCGGCACC AAGGTGTCGG GCGTTGTGAC CAATGTTACT
GACTTCGGCA TTTTTGTGGA GCTGGAAGAG GGTATTGAGG GGCTGGTGCA CGTATCTGAG
GTCAGCACCG AGCGGATCAA AACACCGGTG GGTCAGTACG AAGTGGGCGC CGAGCTGACC
GCCCGGGTGA TGAACGTCAA TGCCGACGAA CGCAAGATCG GCCTTTCCAT CAAGCGGCTC
GACGTGGAAG ATGACAAGAC CCTGCTCAAG GACTACGTGG ACAACTCCAG GGGCACCACC
TCCGCCTTTG GTGAGCTGCT GCGGGAAAAC CTTCAGAACG GCTTTACCCT TCAGGGGAAT
CAGGATGCTG CCGAAAAAGA GGCCGCAACA TCCGACAAAG ACGACGAGGA CGAACCGTCG
GACACCGACG CGTAA
 
Protein sequence
MVDNNTTQDA DNIENEESGM ESMEELMAMY DHTFKQFTEG EVVVGKVIAV DKDYVLVDIG 
YKSEGQININ EFKNEKGEVN VKIYDDVEVM IEFWDEEHEK LVLSKDKAEK DKIWETIKDT
FDNDGVVSGV IINRVKGGFS VDVGLQAFLP GSQADLRPIR NFDEMVGQTY DFKVLKYNRR
RNNIVLSRRV LLEETLATKR AELMETLADG QVVEGIVKNI TEYGVFVDLG GIDGLLHITD
ISWGRVRHPS ELFTVGDNVT LKVLSFDLDK KKISLGMKQL TPDPWESAAE KYPVGVKTTG
KVVSITNYGI FVELEEGVEG LIHVSEISWT RKIRHPSKVV NIGEEVDAVV LDIQPGNRRI
SLGMKQVEPN PWEVISEKYP VGTVIEGRIK NITDFGLFIG IDDDIDGLVH ISDISWTKRI
KHPSEIYKKG DLVRAVVLEI DKANERFSMG IKQLESDPWE SVRDRYPIGT KVSGVVTNVT
DFGIFVELEE GIEGLVHVSE VSTERIKTPV GQYEVGAELT ARVMNVNADE RKIGLSIKRL
DVEDDKTLLK DYVDNSRGTT SAFGELLREN LQNGFTLQGN QDAAEKEAAT SDKDDEDEPS
DTDA