Gene SbBS512_E1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E1034 
SymboltyrP 
ID6272289 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp951951 
End bp953162 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content51% 
IMG OID641725176 
Producttyrosine-specific transport protein 
Protein accessionYP_001879698 
Protein GI187734006 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00837] aromatic amino acid transport protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000311327 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAACA GAACTCTGGG AAGTGTTTTT ATCGTGGCGG GAACCACAAT TGGCGCAGGC 
ATGCTGGCAA TGCCGCTGGC TGCGGCAGGT GTTGGTTTTA GCGTCACGTT AATCTTGTTG
ATTGGGCTTT GGGCGTTGAT GTGCTACACG GCGCTATTAC TGCTGGAGGT GTACCAGCAT
GTTCCGGCAG ATACCGGTCT GGGCACGCTG GCAAAACGCT ATCTGGGACG CTACGGTCAA
TGGCTGACGG GCTTCAGTAT GATGTTCTTA ATGTATGCTC TGACTGCGGC ATACATCAGC
GGTGCCGGTG AATTGTTGGC CTCCAGCATC AGCGACTGGA CAGGTATTTC AATGTCGGCA
ACCGCTGGCG TGCTGTTGTT CACTTTTGTT GCCGGTGGCG TGGTTTGTGT TGGAACATCG
CTGGTCGATT TATTTAACCG TTTTCTGTTC AGCGCCAAAA TTATTTTTCT GGTGGTGATG
CTGGTATTGC TGCTACCGCA TATTCACAAA GTGAATCTTT TAACCCTGCC GTTGCAACAG
GGGCTGGCTC TGTCAGCAAT CCCGGTGATT TTTACCTCAT TTGGTTTTCA CGGTAGCGTG
CCGAGTATTG TCAGCTATAT GGATGGCAAC GTTCGTAAGC TACGCAGGGT GTTTATAACC
GGTAGTGCGA TCCCCCTGGT GGCATATATT TTCTGGCAGG TGGCAACGCT TGGCAGCATT
GATTCAACAA CCTTTATGGG ATTGCTGGCT AATCATGCTG GATTAAACGG GCTGTTACAG
GCGTTACGCG AAATGGTGGC CTCTCCGCAT GTTGAGCTGG CAGTGCATTT ATTTGCTGAT
TTAGCCCTCG CCACGTCATT TCTCGGCGTT GCGTTAGGCT TATTTGATTA TCTGGCTGAT
TTATTTCAGC GTTCAAATAC CGTTGGTGGA CGGTTGCAAA CTGGTGCAAT TACCTTTCTG
CCGCCGTTGG CGTTTGCACT GTTTTATCCA CGAGGATTTG TGATGGCGCT GGGTTACGCC
GGTGTGGCGC TGGCGGTACT GGCATTGATT ATCCCTTCGC TGTTGACCTG GCAAAGCAGA
AAGCACAATC CTCAGGCGGG TTACCGGGTC AAAGGTGGTC GTCCGGCGCT GGTGGTGGTG
TTTCTCTGTG GTATTGCTGT GATTGGCGTG CAATTTTTGA TTGCGGCAGG GTTGTTACCA
GAAGTGGGGT GA
 
Protein sequence
MKNRTLGSVF IVAGTTIGAG MLAMPLAAAG VGFSVTLILL IGLWALMCYT ALLLLEVYQH 
VPADTGLGTL AKRYLGRYGQ WLTGFSMMFL MYALTAAYIS GAGELLASSI SDWTGISMSA
TAGVLLFTFV AGGVVCVGTS LVDLFNRFLF SAKIIFLVVM LVLLLPHIHK VNLLTLPLQQ
GLALSAIPVI FTSFGFHGSV PSIVSYMDGN VRKLRRVFIT GSAIPLVAYI FWQVATLGSI
DSTTFMGLLA NHAGLNGLLQ ALREMVASPH VELAVHLFAD LALATSFLGV ALGLFDYLAD
LFQRSNTVGG RLQTGAITFL PPLAFALFYP RGFVMALGYA GVALAVLALI IPSLLTWQSR
KHNPQAGYRV KGGRPALVVV FLCGIAVIGV QFLIAAGLLP EVG