Gene B21_01864 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagB21_01864 
SymboltyrP 
ID8113737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21 
KingdomBacteria 
Replicon accessionNC_012892 
Strand
Start bp1934137 
End bp1935348 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content51% 
IMG OID644848083 
Producthypothetical protein 
Protein accessionYP_002999656 
Protein GI251785352 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00837] aromatic amino acid transport protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00693993 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAACA GAACTCTGGG AAGTGTTTTT ATCGTGGCGG GAACCACAAT TGGCGCAGGC 
ATGCTGGCAA TGCCGCTGGC TGCGGCCGGT GTTGGTTTTA GCGTTACGTT AATCTTGTTG
ATTGGGCTTT GGGCGTTGAT GTGCTACACG GCGCTATTAC TGCTGGAGGT GTACCAGCAT
GTTCCGGCAG ATACCGGTCT TGGCACGCTG GCAAAACGCT ATCTGGGACG CTACGGTCAG
TGGCTGACGG GCTTCAGTAT GATGTTCTTA ATGTATGCTC TGACTGCGGC ATACATCAGC
GGTGCCGGTG AATTGTTGGC CTCCAGCATC AGCGACTGGA CAGGTATTTC AATGTCGGCA
ACCGCTGGCG TGCTGTTGTT CACTTTTGTT GCTGGTGGCG TGGTTTGTGT CGGGACATCG
CTGGTCGATT TGTTTAACCG TTTTCTGTTC AGCGCCAAAA TTATTTTTCT GGTGGTGATG
CTGGTATTGC TGCTACCGCA TATTCACAAA GTGAATCTTT TAACCCTGCC GTTGCAACAG
GGGCTGGCTC TGTCCGCAAT CCCGGTGATT TTTACCTCGT TTGGTTTTCA CGGTAGCGTG
CCGAGTATTG TCAGCTATAT GGATGGCAAC GTTCGTAAGC TACGCTGGGT GTTTATAACC
GGTAGTGCTA TCCCCCTGGT GGCATATATT TTCTGGCAGG TGGCAACGCT TGGCAGCATT
GATTCAACAA CCTTTATGGG ATTGCTGGCT AATCATGCTG GATTAAACGG GCTGTTACAG
GCGTTACGCG AAATGGTGGC CTCTCCGCAT GTTGAGCTGG CAGTGCATTT ATTTGCTGAT
TTAGCCCTCG CCACGTCATT TCTCGGCGTT GCGTTAGGCT TATTTGATTA TCTGGCTGAT
TTGTTTCAGC GTTCAAATAC CGTTGGTGGA CGATTGCAAA CTGGTGCAAT TACCTTTCTG
CCGCCGTTGG CGTTTGCACT GTTTTATCCA CGAGGATTTG TGATGGCGCT GGGTTACGCC
GGTGTGGCGC TGGCGGTACT GGCATTGATT ATCCCTTCAC TATTGACCTG GCAAAGCAGA
AAGCACAATC CTCAGGCGGG TTACCGGGTG AAAGGTGGTC GTCCGGCGCT GGTGGTGGTG
TTTCTCTGTG GTATTGCTGT GATTGGCGTG CAATTTTTGA TTGCGGCAGG GTTGTTACCA
GAAGTGGGGT GA
 
Protein sequence
MKNRTLGSVF IVAGTTIGAG MLAMPLAAAG VGFSVTLILL IGLWALMCYT ALLLLEVYQH 
VPADTGLGTL AKRYLGRYGQ WLTGFSMMFL MYALTAAYIS GAGELLASSI SDWTGISMSA
TAGVLLFTFV AGGVVCVGTS LVDLFNRFLF SAKIIFLVVM LVLLLPHIHK VNLLTLPLQQ
GLALSAIPVI FTSFGFHGSV PSIVSYMDGN VRKLRWVFIT GSAIPLVAYI FWQVATLGSI
DSTTFMGLLA NHAGLNGLLQ ALREMVASPH VELAVHLFAD LALATSFLGV ALGLFDYLAD
LFQRSNTVGG RLQTGAITFL PPLAFALFYP RGFVMALGYA GVALAVLALI IPSLLTWQSR
KHNPQAGYRV KGGRPALVVV FLCGIAVIGV QFLIAAGLLP EVG