Gene Mfla_2058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_2058 
Symbol 
ID3999854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp2199239 
End bp2200492 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content57% 
IMG OID637938977 
ProductNADH dehydrogenase subunit D 
Protein accessionYP_546166 
Protein GI91776410 
COG category[C] Energy production and conversion 
COG ID[COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 
TIGRFAM ID[TIGR01962] NADH dehydrogenase I, D subunit 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGGAAA TCAGGAATTA CACGATGAAT TTTGGCCCTC AGCACCCCGC AGCGCACGGT 
GTGCTGCGTC TGGTGCTGGA GCTGGATGGC GAAGTGATTC AGCGTGCCGA TCCCCATATC
GGGCTATTGC ACCGCGGGAC GGAAAAGCTC GCCGAGAACC GCACCTATCT ACAGTCTGTG
CCCTATATGG ATCGTCTTGA TTATGTGTCC ATGATGGTCA ATGAGCACGC GTACGTCATG
GCCATCGAGA AATTACTCGG GCTGGAAGTG CCGTTGCGTG CACAATACAT CCGCGTGATG
TTCGACGAGA TTACCCGCAT CCTCAACCAC TTGCTGTGGC TGGGGGCGCA TGCGCTGGAT
GTGGGCGCCA TGACGGTGTT CCTCTATGCG TTCCGGGAGC GTGAGGACCT GTTCGATTGT
TATGAAGCCG TATCCGGCGC TCGCATGCAC GCGGCTTACT ATCGGCCCGG CGGCGTCTAT
CGTGACTTGC CGGACCGTAT GCCGCAGTAT GAAGAGTCAA CGGTGCGCAG CAAGGAGGAT
GTCAAGCAGC TCAATGAGAA CCGTCAGGGG TCGCTGCTGG ACTTCATCGA GGATTTCACC
AATCGGTTTC CAGCCTATGT CGACGAGTAC GAAACCCTGC TTACCGACAA CCGTATCTGG
AAGCAGCGTA CCGTAGGTAT TGGCGTGGTG TCTCCAGAGC GCGCAATGGC CCTGGGGATG
ACCGGACCGA TGCTGCGCGG CAGTGGCGTT GCCTGGGATT TGCGCAAGAA GCAGCCCTAC
GAGGTGTATG ACCGGCTCGA TTTCGATATT CCGATCGGCG TGAATGGCGA CTGTTATGAC
CGCTACCTGG TGCGCATCGA GGAGTTCCGT CAGTCCAATC GCATCATCAA GCAGTGCATC
GACTGGCTAC GCAAGAATCC CGGACCGGTG ATCAGCGACA ATACCAAGGT TGCACCGCCA
CCACGTGAAG AGATGAAGCA TGACATGGAG GCCTTGATCC ATCACTTCAA GCTGTTTACG
GAAGGATTCC ATGTTCCGGC AGGCGAAGCC TATGCCGCAG TCGAGCACCC CAAGGGGGAG
TTCGGCATTT ACCTGATATC CGATGGGGCC AACAAGCCTT ACCGCCTCAA GATACGGGCG
CCGGGTTTTG CCCACTTGGC GGCATTGGAT GAAATGACGC GTGGCCACAT GATTGCCGAC
CTGGTGGCGA TTATCGGTAC GCAGGATATT GTATTCGGAG AAATTGACCG ATGA
 
Protein sequence
MAEIRNYTMN FGPQHPAAHG VLRLVLELDG EVIQRADPHI GLLHRGTEKL AENRTYLQSV 
PYMDRLDYVS MMVNEHAYVM AIEKLLGLEV PLRAQYIRVM FDEITRILNH LLWLGAHALD
VGAMTVFLYA FREREDLFDC YEAVSGARMH AAYYRPGGVY RDLPDRMPQY EESTVRSKED
VKQLNENRQG SLLDFIEDFT NRFPAYVDEY ETLLTDNRIW KQRTVGIGVV SPERAMALGM
TGPMLRGSGV AWDLRKKQPY EVYDRLDFDI PIGVNGDCYD RYLVRIEEFR QSNRIIKQCI
DWLRKNPGPV ISDNTKVAPP PREEMKHDME ALIHHFKLFT EGFHVPAGEA YAAVEHPKGE
FGIYLISDGA NKPYRLKIRA PGFAHLAALD EMTRGHMIAD LVAIIGTQDI VFGEIDR