Gene EcE24377A_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1930 
SymbolbtuC 
ID5588734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1919993 
End bp1920973 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content55% 
IMG OID640925605 
Productvtamin B12-transporter permease 
Protein accessionYP_001463008 
Protein GI157158270 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4139] ABC-type cobalamin transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00226641 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGACAC TTGCCCGCCA ACAACAGCGA CAAAATATTC GCTGGTTATT ATGCCTGTCA 
GTTTTGATGC TGCTGGCGCT TCTCTTAAGC CTTTGCGCCG GTGAACAATG GATCTCGCCA
GGTGACTGGT TTTCTCCTCG TGGCGAACTG TTCGTCTGGC AGATTCGCCT GCCACGTACG
CTGGCTGTAT TGCTGGTTGG TGCGGCGCTG GCTATATCCG GCGCAGTAAT GCAGGCGTTG
TTTGAAAATC CTCTGGCAGA ACCTGGACTA CTTGGCGTCT CTAACGGCGC AGGCGTAGGG
CTTATCGCCG CGGTATTGCT TGGGCAAGGG CAACTCCCAA ACTGGGCACT AGGGCTGTGT
GCGATTGCTG GCGCGCTTAT CATCACTTTA ATACTCTTAC GTTTCGCCCG TCGTCATCTT
TCGACCAGTC GGTTATTGCT GGCTGGCGTT GCATTAGGGA TTATCTGTAG CGCACTAATG
ACGTGGGCTA TCTACTTTTC CACCTCTGTT GATTTACGTC AGCTGATGTA CTGGATGATG
GGCGGTTTTG GCGGCGTAGA CTGGCGGCAA AGCTGGCTGA TGCTGGCATT GATCCCCGTG
TTGTTGTGGA TCTGTTGTCA GTCCAGGCCG ATGAATATGT TAGCACTTGG CGAGATCTCG
GCGCGGCAAC TGGGTTTACC CCTGTGGTTC TGGCGCAATG TGCTGGTGGC AGCGACCGGC
TGGATGGTTG GCGTCAGTGT GGCGCTGGCG GGTGCTATCG GCTTTATTGG TCTGGTGATC
CCACATATTC TTCGGTTGTG TGGTTTAACC GATCATCGCG TATTACTTCC CGGCTGCGCG
CTGGCAGGGG CGAGCGCATT GCTGCTGGCC GATGTTGTAG CGCGCTTGGC ATTAGCTGCC
GCAGAGCTGC CTATTGGCGT GGTCACCGCA ACGTTGGGTG CGCCGGTGTT TATCTGGTTA
TTGTTAAAAG CAGGACGTTA G
 
Protein sequence
MLTLARQQQR QNIRWLLCLS VLMLLALLLS LCAGEQWISP GDWFSPRGEL FVWQIRLPRT 
LAVLLVGAAL AISGAVMQAL FENPLAEPGL LGVSNGAGVG LIAAVLLGQG QLPNWALGLC
AIAGALIITL ILLRFARRHL STSRLLLAGV ALGIICSALM TWAIYFSTSV DLRQLMYWMM
GGFGGVDWRQ SWLMLALIPV LLWICCQSRP MNMLALGEIS ARQLGLPLWF WRNVLVAATG
WMVGVSVALA GAIGFIGLVI PHILRLCGLT DHRVLLPGCA LAGASALLLA DVVARLALAA
AELPIGVVTA TLGAPVFIWL LLKAGR