Gene Mfla_1850 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_1850 
Symbol 
ID4000977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp1987332 
End bp1988831 
Gene Length1500 bp 
Protein Length499 aa 
Translation table11 
GC content59% 
IMG OID637938766 
Productextracellular solute-binding protein 
Protein accessionYP_545958 
Protein GI91776202 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTGGC AGGTATGGCG TCTCGGCGGC CTGGCCATCA CCATGGTGGC ATCGCTGCTA 
TTGCTGGATG CCTGCAGCCG TGATAAAGCA CCGGTGCAAA TCCGCAGCGC GGTCAGCCAA
CTGCCCCAAA CCCTGGATCC GCGCTATGCC ACCGATGCCG CTTCTGCGCG GGTCAATCGG
TTGATTTATC GCTCGCTTGT GGATTTTGAC GTCCATGCAC GCCCGGCTCC CTCGTTGGCT
AGCTGGCGTG TGCTGTCTCC GCGCCTGTTT CGTTTCACCT TGGGACGGCA TGGACGAGAT
TTCCATGATG GGCGCTACCT GACTGCAGAA GATGTTGCCG CTACCTACCA GTCTGTCATC
GAGCTGGCAG ATTCGCCGCA CGCGGCCGAG TTCGCCAACA TCAGCAAGAT CGAGGTGCGC
GACGAGAATA CCCTGGATTT CTTTCTCCAC GATGCAGACA GCAACTTTCC CGCGCGCCTG
ATACTGGGAA TACTTCCCGC TGAGCTGATT GCGCGGCAGC ATGATTTTTC GAGGGAGCCG
GTAGGCAGCG GACCGTTCAA GTTCGTTTCC TGGCAGCGTT CGCTCAAGCT TGAGCGCCGT
TCCGACAAGA AGCGCTTCGT TTTCGAGGAG GTTCGGGATC CGACGGTGCG TGTGCTCAAG
CTGGTGCGCG GTGAAGCGGA TATCCTGCAG GGCGACTTGC CGCCGGAGCT GGTGAACTAC
CTGAAGCAGC AGCCCGGCAT TGATGTACAG GAGTCCGAAG GGTCCAATTT TGCCTATATC
GGCCTGAACC TGCAGGATGA TGTCCTGCGG AACCCCCTGG TGCGTCAGGC CTTGGCCCAT
GCGATCGACC GGGAATCCAT CATCAAGCAT GCGCTGGTTG GGCACAGCCG ACTTGCGGAA
AGCATTCTCC CGCCAGAGCA TTGGGCTGGC AACGGAAACC TGAAACCGTA CGAATATAAT
CCGGTGCTGG CACGTGAGCT GTTGTTGCGT GCCGGGGTCA GCCTGCCCTT GAAGCTGGTC
TACAAGACCT CTACCGATGC CCAGCGGGTC AGGCTGGCTA CCATGATGCA GGCGCAGATG
CGCGATGCCG GCATCCAGCT CGAGATCCGC AGCCTGGACT GGGGCACATT CTTCGACGAC
ATCAAGCGCG GCAATTTCCA GCTATACGGA CTGATGTGGG TGGGGATCAA GACACCTGAG
ATCTACCATC ATGCGTTCCA CTCCTCCTCG GTACCGCCCA AGGGTGCCAA TCGTGGACAT
TATGCCGATC CGAAAACGGA CGAGTTGATT GCCGTGAGCG ATTGGGCCAG CGTCGCCGAA
CGTGTACATG CACAATTGCC GTATATCCCG CTCTGGTATG AGGGGCAGTT TGCTGCCACC
CGGGATCATA TCAAAGGCTA CTCCTTGCAG CCGGACGGGA GCTGGGATGG CCTGGTGGAT
GTCAAGCGCA GCAAGCTGAA GTCCGTCTTT CCGGGCAGGA ACCAGTCGGC AGGAATGTAG
 
Protein sequence
MTWQVWRLGG LAITMVASLL LLDACSRDKA PVQIRSAVSQ LPQTLDPRYA TDAASARVNR 
LIYRSLVDFD VHARPAPSLA SWRVLSPRLF RFTLGRHGRD FHDGRYLTAE DVAATYQSVI
ELADSPHAAE FANISKIEVR DENTLDFFLH DADSNFPARL ILGILPAELI ARQHDFSREP
VGSGPFKFVS WQRSLKLERR SDKKRFVFEE VRDPTVRVLK LVRGEADILQ GDLPPELVNY
LKQQPGIDVQ ESEGSNFAYI GLNLQDDVLR NPLVRQALAH AIDRESIIKH ALVGHSRLAE
SILPPEHWAG NGNLKPYEYN PVLARELLLR AGVSLPLKLV YKTSTDAQRV RLATMMQAQM
RDAGIQLEIR SLDWGTFFDD IKRGNFQLYG LMWVGIKTPE IYHHAFHSSS VPPKGANRGH
YADPKTDELI AVSDWASVAE RVHAQLPYIP LWYEGQFAAT RDHIKGYSLQ PDGSWDGLVD
VKRSKLKSVF PGRNQSAGM