Gene TM1040_0746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0746 
Symbol 
ID4076155 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp803081 
End bp804277 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content61% 
IMG OID638006043 
ProductNADH dehydrogenase subunit E 
Protein accessionYP_612741 
Protein GI99080587 
COG category[C] Energy production and conversion
[S] Function unknown 
COG ID[COG1905] NADH:ubiquinone oxidoreductase 24 kD subunit
[COG3743] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01958] NADH-quinone oxidoreductase, E subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.331264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGTC GTCTTCATCA CGAACAGCCT GACAGCTTTG CGTTCACCCC GGCCAATCAG 
GCCTGGGCCG AAGCTCAGAT GACCAAATAC CCCGAAGGCC GTCAGGCCTC GGCGGTCATT
CCGATTCTGT GGCGCGCTCA GGAGCAAGAG GGCTGGATTT CCAAACCCGC GATCGAATAT
GTGGCCGATA TGCTCGGCAT GGCCTACATC CGCGTGCTGG AAGTGGCCTC TTTCTATTTC
ATGTTCCAGC TGCAGCCCAC GGGTTCGGTT GCGCATATCC AGATCTGTGG CACCACGTCC
TGCATGATCT GTGGTGCCGA GGATCTGGTC GCGATCTGCA AGGACAAGAT CTCTGCCAAG
CCGCATACGC TGTCCGAGGA CGGCAAGTTC TCTTGGGAAG AGGTAGAATG CCTTGGCTCC
TGCGCCAATG CGCCGATGGC GCAGATCGGC AAGGATTATT ACGAGGATCT GACCGCGGCA
TCTTTTACCA AGCTGCTCGA TGATCTGGCG GCGGGTAAAC CCGTCGTACC CGGCCCGCAA
AACGGTCGCT ACGCTGCAGA GCCAAAGGCG GGCCTGACCT CGCTCACCGA ATATGAAGCA
GGCAAGCCGC AGTATAATGC CTCGGCGGAG CTTGCGACCG AAATCGGTGA CGGTGTGAAG
CGTATTCAGG GCGATGAAGT TCCGCTCCTG ACCCCATGGG TCGGCAAGGA TGGCGTGGTT
GCAGGGCGTG CCGCTGCAGA TCCGACGCCG CCCGCGCCAG AGCGTCCGCA ACCCGCGGCC
AAGCAGGCTG AGACCGCCAA GAAGAAGGCT CCGGCCAAGC CTGCGGTCAA GAAATCGGAT
GCGGCAACGC CTGCACAGCC CGAAGCCGCC GCCGCAAAGG TGACGGAACC CAAGGCGGAC
TTGGAAGAAC AAGCACCCGA GACGCTGACA GCGGCGCGCG AGGGCGGGGC GGACGATCTC
AAGCTCCTCA AAGGTGTGGG GCCAAAGCTC GAACAGACGC TCAATGAGCT GGGCTTTTTC
CACTTTGACC AGATTGCCAA ATGGACCGAG GCCGAGGTGG CCTGGGTGGA TGCGCGCCTG
AAGTTCAAAG GCCGCATCGA GCGCGACGGC TGGATCGAGC AAGCCAAGCA ACTGGCAGCC
GGTGAAGAAA CCGAGTTTGC CAAATCGGCC AAGAAAGACG GCCGCTACAA AGACTAA
 
Protein sequence
MLRRLHHEQP DSFAFTPANQ AWAEAQMTKY PEGRQASAVI PILWRAQEQE GWISKPAIEY 
VADMLGMAYI RVLEVASFYF MFQLQPTGSV AHIQICGTTS CMICGAEDLV AICKDKISAK
PHTLSEDGKF SWEEVECLGS CANAPMAQIG KDYYEDLTAA SFTKLLDDLA AGKPVVPGPQ
NGRYAAEPKA GLTSLTEYEA GKPQYNASAE LATEIGDGVK RIQGDEVPLL TPWVGKDGVV
AGRAAADPTP PAPERPQPAA KQAETAKKKA PAKPAVKKSD AATPAQPEAA AAKVTEPKAD
LEEQAPETLT AAREGGADDL KLLKGVGPKL EQTLNELGFF HFDQIAKWTE AEVAWVDARL
KFKGRIERDG WIEQAKQLAA GEETEFAKSA KKDGRYKD