Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4413 |
Symbol | btuB |
ID | 6146182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4509463 |
End bp | 4511307 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641619234 |
Product | vitamin B12/cobalamin outer membrane transporter |
Protein accession | YP_001746358 |
Protein GI | 170682641 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | [TIGR01779] TonB-dependent vitamin B12 receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000903793 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 0.00898611 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTAAAA AAGCTTCGCT GCTGACGGCG TGTTCTGTCA CAGCCTTTTC TGCTTGGGCA CAGGATACCA GCCCGGATAC TCTCGTCGTT ACTGCTAACC GTTTTGAACA GCCGCGCAGC ACTGTGCTTG CACCAACCAC CGTTGTGACC CGTCAGGATA TCGACCGCTG GCAGTCGACC TCGGTCAATG ATGTGCTGCG CCGTCTTCCG GGCGTCGATA TCACCCAAAA CGGCGGTTCA GGTCAGCTCT CATCTATTTT TATTCGCGGT ACAAATGCCA GTCATGTGCT GGTGTTAATT GATGGCGTCC GCCTGAATCT GGCGGGAGTG AGTGGTTCTG CCGACCTTAG CCAGTTCCCT ATTGCGCTTG TCCAGCGTAT TGAATATATC CGTGGACCGC GCTCCGCTGT TTATGGTTCC GATGCAATAG GCGGGGTGGT GAATATCATC ACGACGCGCG ATGAACCCGG AACGGAAATT TCAGCAGGGT GGGGAAGCAA TAGTTATCAG AACTATGATG TCTCCACGCA GCAACAACTG GGGGATAAGA CACGGGTAAC GCTGTTGGGC GATTATGCCC ATACTCATGG TTATGATGTT GTTGCCTATG GTAATACCGG AACGCAAGCG CAGACAGATA ACGATGGTTT TTTAAGTAAA ACGCTTTATG GCGCGCTGGA GCATAACTTT ACTGATGCCT GGAGCGGCTT TGTGCGCGGC TATGGCTATG ATAACCGTAC CAATTATGAC GCGTATTATT CTCCCGGCTC ACCGTTGCTC GATACCCGTA AACTCTATAG CCAAAGTTGG GACGCCGGGC TGCGCTATAA CGGTGAACTG ATTAAATCAC AACTCATTAC CAGCTATAGC CATAGCAAAG ATTACAACTA CGATCCCCAT TATGGTCGTT ATGATTCGTC CGCGACGCTG GATGAGATGA AGCAGTACAC CGTCCAGTGG GCAAATAACG TTACCGTTGG CCACGGTAAT ATTGGTGCGG GCGTTGACTG GCAGAAACAG ACTACGACGC CGGGTACTGG TTATGTTGAG GATGGGTATG ATCAACGTAA CACTGGTATC TATCTGACCG GATTGCAACA AGTCGGCGAT TTTACATTTG AAGGCGCAGC CCGCAGCGAC GATAACTCAC AGTTTGGTCG TCATGGAACC TGGCAAACGA GTGCCGGTTG GGAATTCATC GAAGGCTATC GTTTTATCGC TTCTTATGGC ACGTCGTACA AGGCACCTAA TCTTGGGCAA CTGTATGGTT TCTACGGAAA TCCGAATCTG AATCCTGAGA AAAGCAAACA GTGGGAAGGC GCGTTTGAAG GCTTAACCGC TGGGGTGAAC TGGCGTATTT CCGGATATCG TAACGATGTC AGTGATTTGA TCGATTATGA TGATCACACC CTGAAATATT ACAACGAAGG GAAAGCGCGG ATTAAGGGGG TCGAGGCGAC CGCCAATTTT GATACCGGAC CACTGACGCA TACTGTGAGT TATGATTATG TCGATGCGCG TAATGCGATT ACCGACGTGC CGTTGTTACG CCGTGCTAAA CAGCAGGTGA AATACCAGCT CGACTGGCAA TTGTATGACT TCGACTGGGG TATTACTTAT CAGTATTTAG GCACTCGCTA TGATAAGGAT TACTCATCTT ATCCTTATCA AACCGTTAAA ATGGGCGGTG TGAGCTTGTG GGATCTTGCG GTTGCGTATC CGGTCACCTC TCATCTGACA GTTCGTGGTA AAATAGCCAA CCTGTTCGAC AAAGATTATG AGACAGTCTA TGGCTACCAA ACTGCAGGAC GGGAATACAC CTTGTCTGGC AGCTACACCT TCTGA
|
Protein sequence | MIKKASLLTA CSVTAFSAWA QDTSPDTLVV TANRFEQPRS TVLAPTTVVT RQDIDRWQST SVNDVLRRLP GVDITQNGGS GQLSSIFIRG TNASHVLVLI DGVRLNLAGV SGSADLSQFP IALVQRIEYI RGPRSAVYGS DAIGGVVNII TTRDEPGTEI SAGWGSNSYQ NYDVSTQQQL GDKTRVTLLG DYAHTHGYDV VAYGNTGTQA QTDNDGFLSK TLYGALEHNF TDAWSGFVRG YGYDNRTNYD AYYSPGSPLL DTRKLYSQSW DAGLRYNGEL IKSQLITSYS HSKDYNYDPH YGRYDSSATL DEMKQYTVQW ANNVTVGHGN IGAGVDWQKQ TTTPGTGYVE DGYDQRNTGI YLTGLQQVGD FTFEGAARSD DNSQFGRHGT WQTSAGWEFI EGYRFIASYG TSYKAPNLGQ LYGFYGNPNL NPEKSKQWEG AFEGLTAGVN WRISGYRNDV SDLIDYDDHT LKYYNEGKAR IKGVEATANF DTGPLTHTVS YDYVDARNAI TDVPLLRRAK QQVKYQLDWQ LYDFDWGITY QYLGTRYDKD YSSYPYQTVK MGGVSLWDLA VAYPVTSHLT VRGKIANLFD KDYETVYGYQ TAGREYTLSG SYTF
|
| |