Gene Mfla_0639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_0639 
Symbol 
ID4000706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp667783 
End bp668991 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content56% 
IMG OID637937539 
Productglobin 
Protein accessionYP_544750 
Protein GI91774994 
COG category[C] Energy production and conversion 
COG ID[COG1017] Hemoglobin-like flavoprotein
[COG1018] Flavodoxin reductases (ferredoxin-NADPH reductases) family 1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTTCTG CCACAGCCCG TCCCTATATT GAAGCTAGTG TTCCCGTACT GAGAGAGCAC 
GGCCTGAAAA TCACACAAAC CTTCTATCAC AATATGCTCA GCGCCCATCC TGAGCTCAAG
AACCTGTTCA ATATGGGCAA CCAGGCCAAC GGCATCCAGC AGCAGTCCTT GGCTGCAGCC
GTATTCGCGT ATGCCGCCAA CATCGACAAT CCCGCAGCGC TGGCGCCCGT CGTCAAACGC
ATCGTCCACA AGCATGCTGC CGTCGGCATC AAACCAGAAC ACTACCCCAT CGTAGGCCAA
TACCTGATCG GCGCCATCAA GCAAGTATTG GGAGAGGCCG CCACCGATGA GCTGCTGGCT
GCATGGGGCG AGGCCTATTG GTCATTGGCC AATCTCCTGA TCGAGGAGGA AAAGTCGTTA
TATGCCAGCA CAGCCAGCAC GGCTGGCGCC CTGATGACGC TCAAGGTCGC GCGCAAGGTA
TTTGAGTCGG AACACATAGT CTCGTTTTAC CTGCAGCACC CAGACGGGCG TAAACCCGGA
GATTTTCTGC CAGGCCAATA CATCAGCGTA GAGATGCAAT TCGAGCAAAA CAAAAGGCGC
CAGCTCAGGC AATACAGCCT CTCGGACTCA AGCACGGCTC CGTGGTGGCG TATCTCGGTG
AAGCGCGAGC AGGAAGCAGA AAAACCGGAG GGCCTGCTTT CCAACTGGCT GCATGACGAG
GTCAAGGTCG GCGACTTAAT TCACGCCACA CCGGCATTTG GCGACTTTGT TCTGGATCCC
GTCATCAGCG ACCAGCCATT GCTGCTTATC TCAGCTGGAA TCGGCATCAC CCCCATGCTG
TCCATGTTGA ATTCAGTGCG CGACCTCGCC CCGCAGCGCC CAGTCTTCTT CGCGCATGCG
GCTCGCAGCC CTGCATGGCA AAGCCATAGC CTGGATCTGC AAGAGGCCAG GTATCGTATG
ACCAATCTGC AAACGGCTTT ATTCTATGAA AGCCTGGAAG GTGCACATGG CAGCATAGGG
GGAGTGTATC AAGGACGCAT GGATATCACT GACTTATGGC CGGCGGTCCT GCATCATGCC
GATATCTACC TGTGTGGCCC ACTAGGCTTT ATGCAAGCAC AACGCGAGCA GTTATTGCAA
GCAGGGGTGG CGGCAAGCCA TATCAAGCGG GAAGTATTCG GGCCAGATCT TCTGGACCAC
CTGCTGTAG
 
Protein sequence
MLSATARPYI EASVPVLREH GLKITQTFYH NMLSAHPELK NLFNMGNQAN GIQQQSLAAA 
VFAYAANIDN PAALAPVVKR IVHKHAAVGI KPEHYPIVGQ YLIGAIKQVL GEAATDELLA
AWGEAYWSLA NLLIEEEKSL YASTASTAGA LMTLKVARKV FESEHIVSFY LQHPDGRKPG
DFLPGQYISV EMQFEQNKRR QLRQYSLSDS STAPWWRISV KREQEAEKPE GLLSNWLHDE
VKVGDLIHAT PAFGDFVLDP VISDQPLLLI SAGIGITPML SMLNSVRDLA PQRPVFFAHA
ARSPAWQSHS LDLQEARYRM TNLQTALFYE SLEGAHGSIG GVYQGRMDIT DLWPAVLHHA
DIYLCGPLGF MQAQREQLLQ AGVAASHIKR EVFGPDLLDH LL