Gene EcHS_A1005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1005 
Symbol 
ID5592122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1007381 
End bp1009003 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content46% 
IMG OID640920175 
Productamino acid permease family protein 
Protein accessionYP_001457740 
Protein GI157160422 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACTTAC TGTTTCACAC TCTGCTTTTT TGTTTCTTCT ATCTGACTTG CTTTATTCCA 
AATTTTATTC GTTTAAAAAT AAAATGTGCA GCAGGTTATA ATTTTGCATT TCGCTATTTC
CGCACTTCTT ATTTGCCGCG CATAATCCCT CGTTTTACAG ATGCCCCTTT AATTTTGGCG
AAGGATTTGT CTATGGCTGG GAATGTTCAG GAAAAACAGT TGCGATGGTA CAACATTGCG
CTGATGTCTT TTATCACTGT CTGGGGTTTT GGCAACGTTG TTAATAACTA TGCCAACCAG
GGGCTGGTGG TTGTTTTTTC ATGGGTGTTT ATCTTTGCAC TCTATTTCAC ACCTTATGCG
CTAATTGTTG GTCAGTTAGG CTCGACCTTC AAAGATGGGA AGGGCGGGGT CAGTACCTGG
ATTAAACACA CGATGGGACC CGGACTGGCT TATCTCGCCG CGTGGACCTA CTGGGTGGTG
CATATTCCCT ATCTGGCACA AAAACCCCAG GCAATTCTGA TTGCGCTCGG TTGGGCGATG
AAAGGCGACG GTTCGCTAAT CAAAGAATAT TCAGTCGTAG CGTTACAGGG GTTAACGCTG
GTGCTGTTTA TCTTCTTTAT GTGGGTTGCT TCACGCGGTA TGAAATCGCT GAAAATCGTC
GGTTCTGTGG CAGGGATTGC TATGTTTGTT ATGTCGCTCC TGTATGTGGC GATGGCGGTA
ACCGCGCCTG CAATTACTGA AGTGCATATT GCGACCACAA ACATTACCTG GGAAACGTTC
ATTCCTCATA TCGACTTTAC CTACATTACC ACTATTTCAA TGCTGGTTTT CGCGGTTGGC
GGAGCAGAGA AGATTTCTCC TTACGTTAAT CAAACGCGCA ACCCAGGAAA AGAATTTCCA
AAAGGGATGT TATGCCTGGC GGTGATGGTT GCGGTTTGTG CCATTCTGGG CTCGCTGGCG
ATGGGGATGA TGTTTGATTC GCGTAATATC CCGGATGACT TAATGACCAA CGGTCAGTAT
TACGCCTTTC AGAAGCTGGG CGAGTATTAC AACATGGGTA ATACTTTAAT GGTGATTTAC
GCCATTGCGA ATACCCTGGG ACAAGTAGCG GCGCTGGTAT TCTCGATTGA TGCCCCGCTT
AAAGTGCTAT TAGGTGATGC TGACAGCAAA TATATTCCAG CCAGTTTATG TCGTACCAAC
GCTTCTGGTA CGCCCGTTAA TGGCTATTTT CTGACCCTGG TACTGGTGGC GATTCTGATT
ATGCTGCCGA CTCTCGGCAT TGGTGATATG AACAATCTCT ATAAATGGCT GTTGAACCTT
AATTCGGTAG TGATGCCGCT GCGTTATTTA TGGGTATTTG TTGCATTTAT TGCAGTCGTT
CGCTTGGCGC AGAAATATAA ACCAGAATAT GTCTTTATTC GTAATAAGCC TCTGGCAATG
ACCGTCGGGA TTTGGTGTTT TGCCTTTACC GCCTTTGCCT GTTTGACGGG GATCTTCCCG
AAAATGGAAG CCTTCACTGC AGAGTGGACC TTCCAGTTGG CGCTGAATGT TGCAACGCCG
TTTGTGCTGG TAGGATTAGG ACTGATATTC CCGCTGCTGG CGCGTAAAGC GAATAGTAAA
TAA
 
Protein sequence
MNLLFHTLLF CFFYLTCFIP NFIRLKIKCA AGYNFAFRYF RTSYLPRIIP RFTDAPLILA 
KDLSMAGNVQ EKQLRWYNIA LMSFITVWGF GNVVNNYANQ GLVVVFSWVF IFALYFTPYA
LIVGQLGSTF KDGKGGVSTW IKHTMGPGLA YLAAWTYWVV HIPYLAQKPQ AILIALGWAM
KGDGSLIKEY SVVALQGLTL VLFIFFMWVA SRGMKSLKIV GSVAGIAMFV MSLLYVAMAV
TAPAITEVHI ATTNITWETF IPHIDFTYIT TISMLVFAVG GAEKISPYVN QTRNPGKEFP
KGMLCLAVMV AVCAILGSLA MGMMFDSRNI PDDLMTNGQY YAFQKLGEYY NMGNTLMVIY
AIANTLGQVA ALVFSIDAPL KVLLGDADSK YIPASLCRTN ASGTPVNGYF LTLVLVAILI
MLPTLGIGDM NNLYKWLLNL NSVVMPLRYL WVFVAFIAVV RLAQKYKPEY VFIRNKPLAM
TVGIWCFAFT AFACLTGIFP KMEAFTAEWT FQLALNVATP FVLVGLGLIF PLLARKANSK