Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4200 |
Symbol | btuB |
ID | 5593602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4193305 |
End bp | 4195149 |
Gene Length | 1845 bp |
Protein Length | 614 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640923303 |
Product | vitamin B12/cobalamin outer membrane transporter |
Protein accession | YP_001460761 |
Protein GI | 157163443 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG4206] Outer membrane cobalamin receptor protein |
TIGRFAM ID | [TIGR01779] TonB-dependent vitamin B12 receptor |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0000101627 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTAAAA AAGCTTCGCT GCTGACGGCG TGTTCTGTCA CAGCCTTTTC CGCTTGGGCA CAGGATACCA GCCCGGATAC TCTCGTCGTT ACTGCTAACC GTTTTGAACA GCCGCGCAGC ACTGTGCTTG CACCAACCAC CGTTGTGACC CGTCAGGATA TCGACCGCTG GCAGTCGACC TCGGTCAATG ATGTGCTGCG CCGTCTTCCG GGCGTCGATA TCACCCAAAA CGGCGGTTCA GGTCAGCTCT CATCTATTTT TATTCGCGGT ACAAATGCCA GTCATGTGTT GGTGTTAATT GATGGCGTAC GCCTGAATCT GGCGGGGGTG AGTGGTTCTG CCGACCTTAG CCAGTTCCCT ATTGCGCTTG TCCAGCGTGT TGAATATATC CGTGGGCCGC GCTCCGCTGT TTATGGTTCC GATGCAATAG GCGGGGTGGT GAATATCATC ACGACGCGCG ATGAACCCGG AACGGAAATT TCAGCAGGGT GGGGAAGCAA TAGTTATCAG AACTATGATG TCTCTACGCA GCAACAACTG GGGGATAAGA CACGGGTAAC GCTGTTGGGC GATTATGCCC ATACTCATGG TTATGATGTT GTTGCCTATG GTAATACCGG AACGCAAGCG CAGACAGATA ACGATGGTTT TTTAAGTAAA ACGCTTTATG GCGCGCTGGA GCATAACTTT ACTGATGCCT GGAGCGGCTT TGTGCGCGGC TATGGCTATG ATAACCGTAC CAATTATGAC GCGTATTATT CTCCCGGTTC ACCGTTGCTC GATACCCGTA AACTCTATAG CCAAAGTTGG GACGCCGGGC TGCGCTATAA CGGCGAACTG ATTAAATCAC AACTCATTAC CAGCTATAGC CATAGCAAAG ATTACAACTA CGATCCCCAT TTTGGTCGTT ATGATTCGTC GGCGACGCTC GATGAGATGA AGCAATACAC CGTCCAGTGG GCAAACAATG TCATCGTTGG TCACGGTAGT ATTGGTGCGG GTGTCGACTG GCAGAAACAG ACTACGACGC CGGGTACAGG TTATGTTGAG AATGGATATG ATCAACGTAA TACCGGCATC TATCTGACCG GGCTGCAACA AGTCGGCGAT TTTACCTTTG AAGGCGCAGC ACGCAGTGAC GATAACTCAC AGTTTGGTCG TCATGGAACC TGGCAAACCA GCGCCGGTTG GGAATTCATC GAAGGTTATC GCTTCATTGC TTCCTACGGG ACATCTTATA AGGCACCAAA TCTGGGGCAA CTGTATGGCT TCTACGGAAA TCCGAATCTG GACCCGGAGA AAAGCAAACA GTGGGAAGGC GCGTTTGAAG GCTTAACCGC TGGGGTGAAC TGGCGTATTT CCGGATATCG TAACGATGTC AGTGACTTGA TCGATTATGA TGATCACACC CTGAAATATT ACAACGAAGG GAAAGCGCGG ATTAAGGGCG TCGAGGCGAC CGCCAATTTT GATACCGGAC CACTGACGCA TACTGTGAGT TATGATTATG TCGATGCGCG CAATGCGATT ACCGACACGC CGTTGTTACG CCGTGCTAAA CAGCAGGTGA AATACCAGCT CGACTGGCAG TTGTATGACT TCGACTGGGG TATTACTTAT CAGTATTTAG GCACTCGCTA TGATAAGGAT TACTCATCTT ATCCTTATCA AACCGTTAAA ATGGGCGGTG TGAGCTTGTG GGATCTTGCG GTTGCGTATC CGGTCACCTC TCACCTGACA GTTCGTGGTA AAATAGCCAA CCTGTTCGAC AAAGATTATG AGACAGTCTA TGGCTACCAA ACTGCAGGAC GGGAATACAC CTTGTCTGGC AGCTACACCT TCTGA
|
Protein sequence | MIKKASLLTA CSVTAFSAWA QDTSPDTLVV TANRFEQPRS TVLAPTTVVT RQDIDRWQST SVNDVLRRLP GVDITQNGGS GQLSSIFIRG TNASHVLVLI DGVRLNLAGV SGSADLSQFP IALVQRVEYI RGPRSAVYGS DAIGGVVNII TTRDEPGTEI SAGWGSNSYQ NYDVSTQQQL GDKTRVTLLG DYAHTHGYDV VAYGNTGTQA QTDNDGFLSK TLYGALEHNF TDAWSGFVRG YGYDNRTNYD AYYSPGSPLL DTRKLYSQSW DAGLRYNGEL IKSQLITSYS HSKDYNYDPH FGRYDSSATL DEMKQYTVQW ANNVIVGHGS IGAGVDWQKQ TTTPGTGYVE NGYDQRNTGI YLTGLQQVGD FTFEGAARSD DNSQFGRHGT WQTSAGWEFI EGYRFIASYG TSYKAPNLGQ LYGFYGNPNL DPEKSKQWEG AFEGLTAGVN WRISGYRNDV SDLIDYDDHT LKYYNEGKAR IKGVEATANF DTGPLTHTVS YDYVDARNAI TDTPLLRRAK QQVKYQLDWQ LYDFDWGITY QYLGTRYDKD YSSYPYQTVK MGGVSLWDLA VAYPVTSHLT VRGKIANLFD KDYETVYGYQ TAGREYTLSG SYTF
|
| |