Gene EcSMS35_4413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4413 
SymbolbtuB 
ID6146182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4509463 
End bp4511307 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content50% 
IMG OID641619234 
Productvitamin B12/cobalamin outer membrane transporter 
Protein accessionYP_001746358 
Protein GI170682641 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID[TIGR01779] TonB-dependent vitamin B12 receptor 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000903793 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.00898611 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTAAAA AAGCTTCGCT GCTGACGGCG TGTTCTGTCA CAGCCTTTTC TGCTTGGGCA 
CAGGATACCA GCCCGGATAC TCTCGTCGTT ACTGCTAACC GTTTTGAACA GCCGCGCAGC
ACTGTGCTTG CACCAACCAC CGTTGTGACC CGTCAGGATA TCGACCGCTG GCAGTCGACC
TCGGTCAATG ATGTGCTGCG CCGTCTTCCG GGCGTCGATA TCACCCAAAA CGGCGGTTCA
GGTCAGCTCT CATCTATTTT TATTCGCGGT ACAAATGCCA GTCATGTGCT GGTGTTAATT
GATGGCGTCC GCCTGAATCT GGCGGGAGTG AGTGGTTCTG CCGACCTTAG CCAGTTCCCT
ATTGCGCTTG TCCAGCGTAT TGAATATATC CGTGGACCGC GCTCCGCTGT TTATGGTTCC
GATGCAATAG GCGGGGTGGT GAATATCATC ACGACGCGCG ATGAACCCGG AACGGAAATT
TCAGCAGGGT GGGGAAGCAA TAGTTATCAG AACTATGATG TCTCCACGCA GCAACAACTG
GGGGATAAGA CACGGGTAAC GCTGTTGGGC GATTATGCCC ATACTCATGG TTATGATGTT
GTTGCCTATG GTAATACCGG AACGCAAGCG CAGACAGATA ACGATGGTTT TTTAAGTAAA
ACGCTTTATG GCGCGCTGGA GCATAACTTT ACTGATGCCT GGAGCGGCTT TGTGCGCGGC
TATGGCTATG ATAACCGTAC CAATTATGAC GCGTATTATT CTCCCGGCTC ACCGTTGCTC
GATACCCGTA AACTCTATAG CCAAAGTTGG GACGCCGGGC TGCGCTATAA CGGTGAACTG
ATTAAATCAC AACTCATTAC CAGCTATAGC CATAGCAAAG ATTACAACTA CGATCCCCAT
TATGGTCGTT ATGATTCGTC CGCGACGCTG GATGAGATGA AGCAGTACAC CGTCCAGTGG
GCAAATAACG TTACCGTTGG CCACGGTAAT ATTGGTGCGG GCGTTGACTG GCAGAAACAG
ACTACGACGC CGGGTACTGG TTATGTTGAG GATGGGTATG ATCAACGTAA CACTGGTATC
TATCTGACCG GATTGCAACA AGTCGGCGAT TTTACATTTG AAGGCGCAGC CCGCAGCGAC
GATAACTCAC AGTTTGGTCG TCATGGAACC TGGCAAACGA GTGCCGGTTG GGAATTCATC
GAAGGCTATC GTTTTATCGC TTCTTATGGC ACGTCGTACA AGGCACCTAA TCTTGGGCAA
CTGTATGGTT TCTACGGAAA TCCGAATCTG AATCCTGAGA AAAGCAAACA GTGGGAAGGC
GCGTTTGAAG GCTTAACCGC TGGGGTGAAC TGGCGTATTT CCGGATATCG TAACGATGTC
AGTGATTTGA TCGATTATGA TGATCACACC CTGAAATATT ACAACGAAGG GAAAGCGCGG
ATTAAGGGGG TCGAGGCGAC CGCCAATTTT GATACCGGAC CACTGACGCA TACTGTGAGT
TATGATTATG TCGATGCGCG TAATGCGATT ACCGACGTGC CGTTGTTACG CCGTGCTAAA
CAGCAGGTGA AATACCAGCT CGACTGGCAA TTGTATGACT TCGACTGGGG TATTACTTAT
CAGTATTTAG GCACTCGCTA TGATAAGGAT TACTCATCTT ATCCTTATCA AACCGTTAAA
ATGGGCGGTG TGAGCTTGTG GGATCTTGCG GTTGCGTATC CGGTCACCTC TCATCTGACA
GTTCGTGGTA AAATAGCCAA CCTGTTCGAC AAAGATTATG AGACAGTCTA TGGCTACCAA
ACTGCAGGAC GGGAATACAC CTTGTCTGGC AGCTACACCT TCTGA
 
Protein sequence
MIKKASLLTA CSVTAFSAWA QDTSPDTLVV TANRFEQPRS TVLAPTTVVT RQDIDRWQST 
SVNDVLRRLP GVDITQNGGS GQLSSIFIRG TNASHVLVLI DGVRLNLAGV SGSADLSQFP
IALVQRIEYI RGPRSAVYGS DAIGGVVNII TTRDEPGTEI SAGWGSNSYQ NYDVSTQQQL
GDKTRVTLLG DYAHTHGYDV VAYGNTGTQA QTDNDGFLSK TLYGALEHNF TDAWSGFVRG
YGYDNRTNYD AYYSPGSPLL DTRKLYSQSW DAGLRYNGEL IKSQLITSYS HSKDYNYDPH
YGRYDSSATL DEMKQYTVQW ANNVTVGHGN IGAGVDWQKQ TTTPGTGYVE DGYDQRNTGI
YLTGLQQVGD FTFEGAARSD DNSQFGRHGT WQTSAGWEFI EGYRFIASYG TSYKAPNLGQ
LYGFYGNPNL NPEKSKQWEG AFEGLTAGVN WRISGYRNDV SDLIDYDDHT LKYYNEGKAR
IKGVEATANF DTGPLTHTVS YDYVDARNAI TDVPLLRRAK QQVKYQLDWQ LYDFDWGITY
QYLGTRYDKD YSSYPYQTVK MGGVSLWDLA VAYPVTSHLT VRGKIANLFD KDYETVYGYQ
TAGREYTLSG SYTF