Gene Mfla_2348 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_2348 
Symbol 
ID4001444 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp2501114 
End bp2502205 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content60% 
IMG OID637939275 
Productextracellular solute-binding protein 
Protein accessionYP_546456 
Protein GI91776700 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.585172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCC CTCGCGTACT CCGACTCATT GCCCTCACCC TACTGGTATC CGTTGCCGTC 
CCGGCGCAGG CAGAGACAGT GCTGCGCGTC TTCATCGGCG GCGCGGCGCA ACGACCTGAC
CTGTTCCGAG CGCTTGCTGA TCGCTATGAA GCCAGCCATC CGGGTGTCCG CATCGAAATC
GGCAGCGGCG CTGCCACTTC AGAGCTGCAA CGTAAATACC TTTCCGTCCT GCTCAATGCG
CACGACCCCA GTTTTGACGC CTTGATGCTG GATATCGTCC ATCCCTACCA ATTCGCCACC
GCTGGCTGGA TCGCCCCGCT AGACCCGTAT TTCGGCGAGG AGAGGCAGAC GCTGCTCGCC
GACGGTTTGC CCATCTACCG CCAGACCAAC CTGATCAAGG GCAAGCTCTA CACCTTGCCC
GCCGTGACCG ATGCCATGTT CATGTACTAT CGCAAGGACC TGCTGGCGCA ACACGGCATT
GCACCACCGC AAACCTGGGA CGAGCTGGCA AATGCCGCGC AGACCATCCT GAAGCAGGAG
AACAATCCTG CGCTGCAAGG GCTTTCCGTC CAGGGAGCGC CGATCGAAGG TACGGTGTGC
AGCTTCCTGT TGCCTTACTG GAGCCAGGGC AAGGACATCC TCGACAGCAA CGGCAAATTG
GCCTTGGACA AACCTGCGGC GTTGCGCAGC CTGCAACTAT GGAAAGGGCT GATCGACAAG
AATGTGATCC GCCGCCACGT AGCCGAGGTC AAGACCGGCG ACACCGTCAA CACGTTTAAG
GCTGGCAATG CGATCTTCGC CATCAACTGG GGCTTTGCCT GGGGCGCGTT CCAGAACGAT
ACGGATTCCC GCGTCAAGGG TAAGGTCGGT GTCATCCGTA TACCGGCGGT GCAAGGAGGC
GAGCATGCGA CCTGCCTGGG AGGATGGCAG TGGGCGCTCT CCAACTATTC GCGCAACAAG
GCGCAAACGG CGGATTTCTT GCGCTTCCTA GCCTCGCCTG AAAGCGTGCG CTTCATTACT
TTGCAAGGCG CATTGTTGCC GCCTTACCTG CCGCTTTATG ACGATGCGGA AGTCCAAGCC
GTGATCCCCT GA
 
Protein sequence
MSFPRVLRLI ALTLLVSVAV PAQAETVLRV FIGGAAQRPD LFRALADRYE ASHPGVRIEI 
GSGAATSELQ RKYLSVLLNA HDPSFDALML DIVHPYQFAT AGWIAPLDPY FGEERQTLLA
DGLPIYRQTN LIKGKLYTLP AVTDAMFMYY RKDLLAQHGI APPQTWDELA NAAQTILKQE
NNPALQGLSV QGAPIEGTVC SFLLPYWSQG KDILDSNGKL ALDKPAALRS LQLWKGLIDK
NVIRRHVAEV KTGDTVNTFK AGNAIFAINW GFAWGAFQND TDSRVKGKVG VIRIPAVQGG
EHATCLGGWQ WALSNYSRNK AQTADFLRFL ASPESVRFIT LQGALLPPYL PLYDDAEVQA
VIP