Gene ECH74115_2429 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2429 
SymbolbtuC 
ID6971073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2300180 
End bp2301160 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content55% 
IMG OID643386299 
Productvtamin B12-transporter permease 
Protein accessionYP_002270781 
Protein GI209400661 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4139] ABC-type cobalamin transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.175839 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones87 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACAC TTGCCCGCCA ACAACAGCGA CAAAATATTC GCTGGTTATT ATGCCTGTCA 
GTTTTGATGC TGCTGGCGCT TCTCTTAAGC CTTTGCGCCG GTGAACAATG GATCTCGCCA
GGTGACTGGT TTTCTCCTCG TGGCGAACTG TTCGTCTGGC AAATTCGCCT GCCACGTACG
CTGGCTGTAT TGCTGGTTGG TGCGGCGCTG GCTATATCCG GCGCTGTAAT GCAGGCGTTG
TTTGAAAATC CTCTGGCAGA ACCTGGACTA CTTGGCGTCT CTAACGGCGC AGGCGTGGGG
CTTATCGCCG CGGTATTGCT TGGGCAAGGG CAACTCCCCA ACTGGGCGCT AGGGCTGTGT
GCGATTGCTG GGGCGCTTAT CATCACTTTA ATACTCTTAC GTTTCGCCCG TCGTCATCTT
TCGACCAGTC GGTTATTGCT GGCTGGCGTT GCATTAGGGA TTATCTGTAG CGCACTAATG
ACGTGGGCTA TCTACTTTTC CACCTCTGTT GATTTACGTC AGCTGATGTA CTGGATGATG
GGCGGTTTTG GCGGCGTAGA CTGGCGGCAA AGCTGGCTGA TGCTGGCATT GATCCCCATG
TTGTTGTGGA TCTGTTGTCA GTCCAGGCCG ATGAATATGT TAGCACTTGG CGAGATCTCG
GCGCGGCAAC TGGGTTTACC CCTGTGGTTC TGGCGCAATG TGCTGGTGGC AGCGACCGGC
TGGATGGTTG GCGTCAGTGT AGCGCTGGCG GGTGCTATCG GCTTTATTGG TCTGGTGATC
CCACATATTC TTCGGTTGTG TGGTTTAACC GATCATCGCG CATTACTTCC CGGCTGCGCG
CTGGCAGGGG CGAGCGCATT GCTGCTGGCC GATATTGTAG CGCGCCTGGC ATTAGCTGCC
GCAGAGCTGC CTATTGGCGT GGTCACCGCA ACGTTGGGTG CGCCGGTGTT TATCTGGTTA
TTGTTAAAAG CAGGACGTTA G
 
Protein sequence
MLTLARQQQR QNIRWLLCLS VLMLLALLLS LCAGEQWISP GDWFSPRGEL FVWQIRLPRT 
LAVLLVGAAL AISGAVMQAL FENPLAEPGL LGVSNGAGVG LIAAVLLGQG QLPNWALGLC
AIAGALIITL ILLRFARRHL STSRLLLAGV ALGIICSALM TWAIYFSTSV DLRQLMYWMM
GGFGGVDWRQ SWLMLALIPM LLWICCQSRP MNMLALGEIS ARQLGLPLWF WRNVLVAATG
WMVGVSVALA GAIGFIGLVI PHILRLCGLT DHRALLPGCA LAGASALLLA DIVARLALAA
AELPIGVVTA TLGAPVFIWL LLKAGR