Gene ECH74115_5140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_5140 
SymboltnaB 
ID6968487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4782040 
End bp4783287 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content47% 
IMG OID643388811 
Producttryptophan permease TnaB 
Protein accessionYP_002273237 
Protein GI209399599 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00837] aromatic amino acid transport protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.406172 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGATC AAGCTGAAAA AAAGCACTCT GCATTTTGGG GTGTTATGGT TATAGCAGGT 
ACAGTAATCG GTGGAGGTAT GTTTGCTTTA CCTGTGGATC TTGCCGGTGC CTGGTTTTTC
TGGGGTGCCT TTATCCTTAT CATTGCCTGG TTTTCAATGC TTCATTCCGG GTTATTGTTA
TTAGAAGCAA ATTTAAATTA TCCCGTCGGC TCCAGTTTTA ACACCATCAC CAAAGATTTA
ATCGGTAACA CCTGGAACAT TATCAGCGGT ATTACCGTTG CCTTCGTTCT CTATATCCTC
ACTTATGCCT ATATCTCTGC TAATGGTGCG ATCATTAGTG AAACGATATC AATGAATTTG
GGTTATCACG CTAATCCACG TATTGTCGGG ATCTGCACAG CCATTTTCGT TGCCAGCGTA
TTGTGGATAA GCTCGTTAGC CGCCAGTCGT ATTACCTCAT TGTTCCTCGG GCTGAAGATT
ATCTCCTTTG TGATCGTGTT TGGTTCTTTC TTCTTCCTGG TCGATTACTC CATTCTGCGC
GATGCCACCA GCTCCACTGC GGGAACGTCT TACTTCCCGT ATATCTTTAT GGCTTTGCCG
GTGTGTCTGG CGTCATTTGG TTTCCACGGC AATATTCCCA GCCTGATTAT TTGCTATGGC
AAACGCAAAG ATAAGCTAAT CAAAAGCGTG GTCTTCGGTT CGCTGCTGGC GCTGGTGATT
TATCTCTTCT GGCTCTATTG CACCATGGGG AATATTCCGC GAGAAAGCTT TAAGGCGATT
ATTTCCTCAG GCGGCAACGT TGATTCACTG GTGAAATCGT TCCTCGGCAC CAAACAGCAC
GGCATTATCG AGTTTTGCCT GCTGGTGTTC TCCAACTTAG CTGTCGCCAG CTCGTTTTTT
GGTGTCACGC TGGGATTGTT CGATTATCTG GCGGACCTGT TTAAGATTGA TAACTCCCAC
GGCGGGCGTT TCAAAACCGT GCTGTTAACC TTCCTGCCAC CCGCGTTGTT GTATCTGATC
TTCCCGAACG GCTTTATTTA CGGGATCGGC GGTGCCGGAC TGTGCGCCAC TATTTGGGCG
GTCATTATTC CCGCAGTGCT GGCAATCAAA GCTCGCAAAA AGTTTCCCAA TCAGATGTTC
ACGGTCTGGG GCGGCAATCT TATTCCGGCG ATTGTCATTC TCTTTGGTAT AACCGTAATT
TTGTGCTGGT TCGGCAACGT CTTTAACGTG TTACCTAAAT TTGGCTAA
 
Protein sequence
MTDQAEKKHS AFWGVMVIAG TVIGGGMFAL PVDLAGAWFF WGAFILIIAW FSMLHSGLLL 
LEANLNYPVG SSFNTITKDL IGNTWNIISG ITVAFVLYIL TYAYISANGA IISETISMNL
GYHANPRIVG ICTAIFVASV LWISSLAASR ITSLFLGLKI ISFVIVFGSF FFLVDYSILR
DATSSTAGTS YFPYIFMALP VCLASFGFHG NIPSLIICYG KRKDKLIKSV VFGSLLALVI
YLFWLYCTMG NIPRESFKAI ISSGGNVDSL VKSFLGTKQH GIIEFCLLVF SNLAVASSFF
GVTLGLFDYL ADLFKIDNSH GGRFKTVLLT FLPPALLYLI FPNGFIYGIG GAGLCATIWA
VIIPAVLAIK ARKKFPNQMF TVWGGNLIPA IVILFGITVI LCWFGNVFNV LPKFG