Gene EcHS_A4200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4200 
SymbolbtuB 
ID5593602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4193305 
End bp4195149 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content50% 
IMG OID640923303 
Productvitamin B12/cobalamin outer membrane transporter 
Protein accessionYP_001460761 
Protein GI157163443 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID[TIGR01779] TonB-dependent vitamin B12 receptor 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0000101627 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAAA AAGCTTCGCT GCTGACGGCG TGTTCTGTCA CAGCCTTTTC CGCTTGGGCA 
CAGGATACCA GCCCGGATAC TCTCGTCGTT ACTGCTAACC GTTTTGAACA GCCGCGCAGC
ACTGTGCTTG CACCAACCAC CGTTGTGACC CGTCAGGATA TCGACCGCTG GCAGTCGACC
TCGGTCAATG ATGTGCTGCG CCGTCTTCCG GGCGTCGATA TCACCCAAAA CGGCGGTTCA
GGTCAGCTCT CATCTATTTT TATTCGCGGT ACAAATGCCA GTCATGTGTT GGTGTTAATT
GATGGCGTAC GCCTGAATCT GGCGGGGGTG AGTGGTTCTG CCGACCTTAG CCAGTTCCCT
ATTGCGCTTG TCCAGCGTGT TGAATATATC CGTGGGCCGC GCTCCGCTGT TTATGGTTCC
GATGCAATAG GCGGGGTGGT GAATATCATC ACGACGCGCG ATGAACCCGG AACGGAAATT
TCAGCAGGGT GGGGAAGCAA TAGTTATCAG AACTATGATG TCTCTACGCA GCAACAACTG
GGGGATAAGA CACGGGTAAC GCTGTTGGGC GATTATGCCC ATACTCATGG TTATGATGTT
GTTGCCTATG GTAATACCGG AACGCAAGCG CAGACAGATA ACGATGGTTT TTTAAGTAAA
ACGCTTTATG GCGCGCTGGA GCATAACTTT ACTGATGCCT GGAGCGGCTT TGTGCGCGGC
TATGGCTATG ATAACCGTAC CAATTATGAC GCGTATTATT CTCCCGGTTC ACCGTTGCTC
GATACCCGTA AACTCTATAG CCAAAGTTGG GACGCCGGGC TGCGCTATAA CGGCGAACTG
ATTAAATCAC AACTCATTAC CAGCTATAGC CATAGCAAAG ATTACAACTA CGATCCCCAT
TTTGGTCGTT ATGATTCGTC GGCGACGCTC GATGAGATGA AGCAATACAC CGTCCAGTGG
GCAAACAATG TCATCGTTGG TCACGGTAGT ATTGGTGCGG GTGTCGACTG GCAGAAACAG
ACTACGACGC CGGGTACAGG TTATGTTGAG AATGGATATG ATCAACGTAA TACCGGCATC
TATCTGACCG GGCTGCAACA AGTCGGCGAT TTTACCTTTG AAGGCGCAGC ACGCAGTGAC
GATAACTCAC AGTTTGGTCG TCATGGAACC TGGCAAACCA GCGCCGGTTG GGAATTCATC
GAAGGTTATC GCTTCATTGC TTCCTACGGG ACATCTTATA AGGCACCAAA TCTGGGGCAA
CTGTATGGCT TCTACGGAAA TCCGAATCTG GACCCGGAGA AAAGCAAACA GTGGGAAGGC
GCGTTTGAAG GCTTAACCGC TGGGGTGAAC TGGCGTATTT CCGGATATCG TAACGATGTC
AGTGACTTGA TCGATTATGA TGATCACACC CTGAAATATT ACAACGAAGG GAAAGCGCGG
ATTAAGGGCG TCGAGGCGAC CGCCAATTTT GATACCGGAC CACTGACGCA TACTGTGAGT
TATGATTATG TCGATGCGCG CAATGCGATT ACCGACACGC CGTTGTTACG CCGTGCTAAA
CAGCAGGTGA AATACCAGCT CGACTGGCAG TTGTATGACT TCGACTGGGG TATTACTTAT
CAGTATTTAG GCACTCGCTA TGATAAGGAT TACTCATCTT ATCCTTATCA AACCGTTAAA
ATGGGCGGTG TGAGCTTGTG GGATCTTGCG GTTGCGTATC CGGTCACCTC TCACCTGACA
GTTCGTGGTA AAATAGCCAA CCTGTTCGAC AAAGATTATG AGACAGTCTA TGGCTACCAA
ACTGCAGGAC GGGAATACAC CTTGTCTGGC AGCTACACCT TCTGA
 
Protein sequence
MIKKASLLTA CSVTAFSAWA QDTSPDTLVV TANRFEQPRS TVLAPTTVVT RQDIDRWQST 
SVNDVLRRLP GVDITQNGGS GQLSSIFIRG TNASHVLVLI DGVRLNLAGV SGSADLSQFP
IALVQRVEYI RGPRSAVYGS DAIGGVVNII TTRDEPGTEI SAGWGSNSYQ NYDVSTQQQL
GDKTRVTLLG DYAHTHGYDV VAYGNTGTQA QTDNDGFLSK TLYGALEHNF TDAWSGFVRG
YGYDNRTNYD AYYSPGSPLL DTRKLYSQSW DAGLRYNGEL IKSQLITSYS HSKDYNYDPH
FGRYDSSATL DEMKQYTVQW ANNVIVGHGS IGAGVDWQKQ TTTPGTGYVE NGYDQRNTGI
YLTGLQQVGD FTFEGAARSD DNSQFGRHGT WQTSAGWEFI EGYRFIASYG TSYKAPNLGQ
LYGFYGNPNL DPEKSKQWEG AFEGLTAGVN WRISGYRNDV SDLIDYDDHT LKYYNEGKAR
IKGVEATANF DTGPLTHTVS YDYVDARNAI TDTPLLRRAK QQVKYQLDWQ LYDFDWGITY
QYLGTRYDKD YSSYPYQTVK MGGVSLWDLA VAYPVTSHLT VRGKIANLFD KDYETVYGYQ
TAGREYTLSG SYTF