Gene TM1040_0748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0748 
Symbol 
ID4076157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp804573 
End bp805871 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content61% 
IMG OID638006045 
ProductNADH dehydrogenase I subunit F 
Protein accessionYP_612743 
Protein GI99080589 
COG category[C] Energy production and conversion 
COG ID[COG1894] NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit 
TIGRFAM ID[TIGR01959] NADH-quinone oxidoreductase, F subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.147388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAAGG ATCAGGACCG GATCTTTACC AACCTTTACG GGATGCACGA ACGCACGTTG 
GCGGGCGCAC AAAAGCGCGG CCACTGGGAC GGTACGGCGG GCCTCATTGA AAAAGGGCGC
GACTGGATCA TCCAGACCAT GAAGGATTCC GGCCTGCGCG GGCGTGGCGG TGCGGGCTTC
CCCACCGGCC TCAAATGGTC CTTCATGCCC AAGGAAAGCG ACGGGCGTCC CGCCTATCTG
GTCATCAATG CTGATGAGTC CGAGCCCGGC ACCTGCAAAG ACCGCGAAAT CATGCGTCAC
GATCCGCATA CGCTGATCGA GGGCGCGCTG ATCGCTTCCT TCGCGATGAA CGCGCACACC
TGCTACATCT ATCTGCGCGG CGAATATATC CGCGAGCGCG AGGCGCTGCA GGCCGCCATC
GACGAATGCT ACGACAAGGG TCTTCTGGGC AAGAACGCTG CAGGCTCGGG CTGGGATTTC
GATCTTTTCC TGCACCACGG GGCAGGGGCT TATATCTGCG GCGAGGAAAC CGCCCTGATC
GAGAGCCTTG AGGGCAAAAA AGGCATGCCG CGCATGAAGC CGCCATTCCC GGCAGGCGCG
GGGCTTTATG GCTGCCCGAC CACGGTGAAC AATGTCGAAT CCATCGCCGT GGTGCCCACC
ATCCTGCGGC GCGGTGCGGA GTGGTTTGCC GGCTTTGGCC GTCAGAACAA CGCGGGCACC
AAGCTTTTTG CGATCTCCGG TCACGTCAAC AACCCCTGCG TTGTTGAAGA GGCCATGTCG
ATCAGCTTTG AAGAGCTGAT TGAAAAACAC TGCGGTGGCA TTCGCGGCGG CTGGGACAAT
CTTCTGGCGG TGATCCCAGG CGGCTCTTCA GTGCCCTGTG TGCGCGGCGA GAAGATGCGC
GATGCGATCA TGGATTTTGA TTACCTGCGC GGCGAGTTGG GCTCTGGCCT TGGCACTGCG
GCGGTGATCG TGATGGACAA GCAGACCGAT ATCGTCAAAG CGATCTGGCG CCTCTCGAAG
TTCTACAAGC ACGAAAGCTG CGGCCAGTGC ACGCCCTGCC GTGAAGGCAC CGGCTGGATG
ATGCGCGTGA TGGATCGTCT GGTGCGCGGT GAGGCGGAGC TTGAAGAGAT CGACATGCTC
TGGGATGTCA CCAAGCAGGT CGAAGGCCAC ACCATCTGTG CACTGGGCGA TGCGGCCGCA
TGGCCCATTC AGGGTCTCAT TCGCAACTTC CGTGAAGAGA TCGAAGATCG CATCAAGGCG
CAGAAATCTG GCCGTATGGG CGCGATGGCA GCGGAATAA
 
Protein sequence
MLKDQDRIFT NLYGMHERTL AGAQKRGHWD GTAGLIEKGR DWIIQTMKDS GLRGRGGAGF 
PTGLKWSFMP KESDGRPAYL VINADESEPG TCKDREIMRH DPHTLIEGAL IASFAMNAHT
CYIYLRGEYI REREALQAAI DECYDKGLLG KNAAGSGWDF DLFLHHGAGA YICGEETALI
ESLEGKKGMP RMKPPFPAGA GLYGCPTTVN NVESIAVVPT ILRRGAEWFA GFGRQNNAGT
KLFAISGHVN NPCVVEEAMS ISFEELIEKH CGGIRGGWDN LLAVIPGGSS VPCVRGEKMR
DAIMDFDYLR GELGSGLGTA AVIVMDKQTD IVKAIWRLSK FYKHESCGQC TPCREGTGWM
MRVMDRLVRG EAELEEIDML WDVTKQVEGH TICALGDAAA WPIQGLIRNF REEIEDRIKA
QKSGRMGAMA AE