Gene EcolC_1729 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1729 
Symbol 
ID6067012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1925798 
End bp1927009 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content51% 
IMG OID641601144 
Productaromatic amino acid transporter 
Protein accessionYP_001724706 
Protein GI170019752 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0814] Amino acid permeases 
TIGRFAM ID[TIGR00837] aromatic amino acid transport protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000563721 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAAAACA GAACTCTGGG AAGTGTTTTT ATCGTGGCGG GAACCACAAT TGGCGCAGGC 
ATGCTGGCAA TGCCGCTGGC TGCGGCCGGT GTTGGTTTTA GCGTTACGTT AATCTTGTTG
ATTGGGCTTT GGGCGTTGAT GTGCTACACG GCGCTATTAC TGCTGGAGGT GTACCAGCAT
GTTCCGGCAG ATACCGGTCT TGGCACGCTG GCAAAACGCT ATCTGGGACG CTACGGTCAG
TGGCTGACGG GCTTCAGTAT GATGTTCTTA ATGTATGCTC TGACTGCGGC ATACATCAGC
GGTGCCGGTG AATTGTTGGC CTCCAGCATC AGCGACTGGA CAGGTATTTC AATGTCGGCA
ACCGCTGGCG TGCTGTTGTT CACTTTTGTT GCCGGTGGCG TGGTTTGTGT CGGGACATCG
CTGGTCGATT TGTTTAACCG TTTTCTGTTC AGCGCCAAAA TTATTTTTCT GGTGGTGATG
CTGGTATTGC TGCTACCGCA TATTCACAAA GTGAATCTTT TAACCCTGCC GTTGCAACAG
GGGCTGGCTC TGTCCGCAAT CCCGGTGATT TTTACCTCGT TTGGTTTTCA CGGTAGCGTG
CCGAGTATTG TCAGCTATAT GGATGGCAAC GTTCGTAAGC TACGCTGGGT GTTTATAACC
GGTAGTGCGA TCCCCCTGGT GGCATATATT TTCTGGCAGG TGGCAACGCT TGGCAGCATT
GATTCAACAA CCTTTATGGG ATTGCTGGCT AATCATGCTG GATTAAACGG GCTGTTACAG
GCGTTACGCG AAATGGTGGC CTCTCCGCAT GTTGAGCTGG CAGTGCATTT ATTTGCTGAT
TTAGCCCTCG CCACGTCATT TCTCGGCGTT GCGTTAGGCT TATTTGATTA TCTGGCTGAT
TTATTTCAGC GTTCAAATAC CGTTGGTGGA CGGTTGCAAA CTGGTGCAAT TACCTTTCTG
CCGCCGTTGG CGTTTGCACT GTTTTATCCA CGAGGATTTG TGATGGCGCT GGGTTACGCC
GGTGTGGCGC TGGCGGTACT GGCATTGATT ATCCCTTCGC TGTTGACCTG GCAAAGCAGA
AAGCACAATC CTCAGGCGGG TTACCGGGTG AAAGGTGGTC GTCCGGCGCT GGTGGTGGTG
TTTCTCTGTG GTATTGCTGT GATTGGCGTG CAATTTTTGA TTGCGGCAGG GTTGTTACCA
GAAGTGGGGT GA
 
Protein sequence
MKNRTLGSVF IVAGTTIGAG MLAMPLAAAG VGFSVTLILL IGLWALMCYT ALLLLEVYQH 
VPADTGLGTL AKRYLGRYGQ WLTGFSMMFL MYALTAAYIS GAGELLASSI SDWTGISMSA
TAGVLLFTFV AGGVVCVGTS LVDLFNRFLF SAKIIFLVVM LVLLLPHIHK VNLLTLPLQQ
GLALSAIPVI FTSFGFHGSV PSIVSYMDGN VRKLRWVFIT GSAIPLVAYI FWQVATLGSI
DSTTFMGLLA NHAGLNGLLQ ALREMVASPH VELAVHLFAD LALATSFLGV ALGLFDYLAD
LFQRSNTVGG RLQTGAITFL PPLAFALFYP RGFVMALGYA GVALAVLALI IPSLLTWQSR
KHNPQAGYRV KGGRPALVVV FLCGIAVIGV QFLIAAGLLP EVG