Gene Mfla_1798 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1798 
Symbol 
ID4000545 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp1938793 
End bp1939719 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content62% 
IMG OID637938711 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_545906 
Protein GI91776150 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.116127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTATCCC GACGCAGTTT CCTCTCAGGT CTTGCAGCGG CTAGCGCAGG CATTGTCCTC 
GCCCGCCATG TATTTGCCAA TCCGTTATCC CCAACCCAAT CGGGGATCAT GCAAACTCGC
GTTGTTCCGT CCAGCGGCGA GCGTTTGCCC GTGATCGGCA TGGGTAGTTC CGGCAGCTTC
GAAGTTGGCA ATAGCGCCGC CGAACTCGAC CCCTTGCGTG AAGTACTGCG GCGGTTCTTC
GCGGGCGGCG CCACTGTGAT CGATACCGCC CCCTCCTATG GCACAGCAGA GAAAGTCATC
GGGCAATTGC TTGAAGAGCT GGGACTGCGC TCCAGCGCCT TCCTCGCCAC CAAGATCGGC
ACTTCTGGCC GTGAGGCCGG GCTGGCGCAG TTCCAGGATT CGCTCAAGCG GTTGCGCACG
GACAAGGTGG AGCTGCTTCA GGTGCACAAC CTGCGGGACT GGCGTACCCA GTTCGAAGTG
ATCAAGGAAC TCAAGGCCCA GGGCAAGACC CGCTACACCG GGCTCACCCA TTATCTGGAC
AGCAGTCATG ACGAGCTTGC CGAGGTAGTG CGCAAGGTGA AGCCGGACTT CCTGCAGGTG
AATTACTCCG TCGTCTCGCG CAACGCAGAG CAAACAGTCT TCCCAGTGGC GCGGGAGCTA
GGCGTGGCGG TACTGGTCAA CCGCGCTTTT GAGGACGGAC GCCTGTTTTC CAGGGTGCAG
GGCAAAGCGC TACCGCCATG GGCCGCCGAA GTCGGGATTA CCTCATGGGC GCAAGCTTTC
CTCAGGTTTG CCCTGAGCCA CCCTGCCGTC ACCACCGTAA TCCCTGCCAC CGGCAAGCCG
GAGCGCCAGA GCGACAACCT CAAGGCTGGC AGCGGGCCCA TCCTGACCGA AGCGCAGCGG
CAATCCCTGA TCGACACGGT CGGCTGA
 
Protein sequence
MLSRRSFLSG LAAASAGIVL ARHVFANPLS PTQSGIMQTR VVPSSGERLP VIGMGSSGSF 
EVGNSAAELD PLREVLRRFF AGGATVIDTA PSYGTAEKVI GQLLEELGLR SSAFLATKIG
TSGREAGLAQ FQDSLKRLRT DKVELLQVHN LRDWRTQFEV IKELKAQGKT RYTGLTHYLD
SSHDELAEVV RKVKPDFLQV NYSVVSRNAE QTVFPVAREL GVAVLVNRAF EDGRLFSRVQ
GKALPPWAAE VGITSWAQAF LRFALSHPAV TTVIPATGKP ERQSDNLKAG SGPILTEAQR
QSLIDTVG