Gene B21_02392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_02392 
SymbolhcaT 
ID8116135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp2531722 
End bp2532861 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content57% 
IMG OID644848594 
Producthypothetical protein 
Protein accessionYP_003000167 
Protein GI251785863 
COG category 
COG ID 
TIGRFAM ID[TIGR00882] oligosaccharide:H+ symporter
[TIGR00902] phenyl proprionate permease family protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTTGC AATCCACGCG CTGGTTGGCG CTCGGCTATT TCACATACTT TTTTAGTTAC 
GGCATTTTTC TACCTTTCTG GAGCGTCTGG CTTAAAGGGA TTGGTTTAAC GCCAGAAACC
ATCGGCCTGT TATTGGGGGC AGGTCTGGTT GCCCGTTTTC TCGGGAGTTT GCTCATCGCG
CCCCGCGTCA GCGATCCTTC CCGCCTGATT TTCGCCTTGC GCGTGCTGGC ACTGCTGACA
CTTCTCTTTG CTGTCGCCTT CTGGGCGGGG GCGCACGTAG CGTGGCTGAT GCTGGTGATG
ATTGGCTTTA ACCTCTTTTT CTCACCGCTG GTACCGTTGA CCGATGCACT GGCGAATACG
TGGCAAAAGC AGTTCCCGCT TGATTACGGC AAAGTGCGAC TGTGGGGCTC GGTGGCGTTT
GTCATTGGCT CGGCGCTGAC GGGCAAACTG GTCACTATGT TTGATTATCG GGTGATCCTC
GCGCTGTTGA CGTTGGGCGT GGCATCCATG CTGCTCGGCT TTCTCATCCG TCCGACGATT
CAGCCACAAG GGGCAAGCCG CCAGCAGGAG AGCACCGGTT GGTCAGCGTG GTTGGCGCTG
GTTCGCCAGA ACTGGCGCTT TCTGGCCTGC GTTTGTTTAT TGCAGGGGGC ACATGCGGCC
TATTACGGTT TTAGCGCCAT TTACTGGCAG GCAGCTGGCT ACTCGGCCTC GGCGGTGGGG
TATTTGTGGT CGCTGGGCGT GGTGGCGGAA GTCATTATCT TTGCGCTGAG TAATAAACTT
TTCCGCCGTT GTAGTGCACG CGATATGCTG TTGATCTCGG CGATTTGCGG CGTAGTGCGC
TGGGGCATTA TGGGAGCAAC TACGGCGTTG CCGTGGTTGA TAGTGGTGCA AATTCTGCAT
TGCGGCACCT TCACGGTCTG CCACCTGGCC GCCATGCGTT ATATTGCTGC TCGCCAGGGT
AGCGAAGTCA TCCGTTTACA GGCGGTTTAC TCTGCCGTCG CGATGGGCGG CAGTATCGCT
ATCATGACCG TTTTCGCCGG TTTCCTGTAT CAATATCTGG GCCACGGCGT GTTCTGGGTA
ATGGCGCTGG TGGCGCTGCC GGCAATGTTT TTGCGCCCGA AAGTTGTTCC CTCATGCTGA
 
Protein sequence
MVLQSTRWLA LGYFTYFFSY GIFLPFWSVW LKGIGLTPET IGLLLGAGLV ARFLGSLLIA 
PRVSDPSRLI FALRVLALLT LLFAVAFWAG AHVAWLMLVM IGFNLFFSPL VPLTDALANT
WQKQFPLDYG KVRLWGSVAF VIGSALTGKL VTMFDYRVIL ALLTLGVASM LLGFLIRPTI
QPQGASRQQE STGWSAWLAL VRQNWRFLAC VCLLQGAHAA YYGFSAIYWQ AAGYSASAVG
YLWSLGVVAE VIIFALSNKL FRRCSARDML LISAICGVVR WGIMGATTAL PWLIVVQILH
CGTFTVCHLA AMRYIAARQG SEVIRLQAVY SAVAMGGSIA IMTVFAGFLY QYLGHGVFWV
MALVALPAMF LRPKVVPSC