Gene EcHS_A4400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4400 
Symbol 
ID5594680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4409135 
End bp4410637 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content47% 
IMG OID640923498 
Productamino acid permease family protein 
Protein accessionYP_001460942 
Protein GI157163624 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTCACA CGATAAAAAA GATGAGTCTG ATAGGACTCA TATTGATGAT CTTTACTTCC 
GTATTTGGAT TTGCCAATAG CCCATCGGCT TATTACTTAA TGGGTTATAG TGCGATTCCC
TTTTATATAT TTTCTGCATT GTTATTCTTT ATTCCATTCG CCTTAATGAT GGCTGAAATG
GGAGCTGCTT ATCGCAAAGA AGAAGGCGGT ATCTATTCCT GGATGAATAA TAGTGTCGGA
CCACGTTTTG CCTTCATTGG TACGTTTATG TGGTTTTCCT CTTATATCAT CTGGATGGTG
AGTACCTCCG CGAAAGTTTG GGTACCGTTC TCAACATTCC TCTATGGTAG CGACATGACC
CAGCACTGGC GTATTGCTGG ACTGGAGCCT ACGCAGGTGG TTGGTCTGCT GGCAGTGGCA
TGGATGATTC TGGTCACCGT CGTTGCTTCA AAGGGGATTA ATAAAATTGC CCGCATTACT
GCGGTGGGCG GTATTGCAGT AATGTGTCTG AATTTAGTAT TGCTGTTAGT AAGCATTACT
ATTTTGTTAT TAAATGGTGG GCATTTCGCG CAGGATATTA ATTTCCTTGC ATCACCGAAC
CCAGGTTATC AGTCCGGTCT GGCAATGCTA TCGTTTGTGG TATTTGCTAT TTTTGCCTAT
GGCGGAATTG AAGCGGTTGG TGGTCTGGTC GATAAAACGG AAAATCCAGA AAAGAACTTT
GCCAAAGGTA TTGTTTTTGC CGCTATTGTT ATTTCAATCG GTTATTCGCT GGCAATATTT
TTATGGGGTG TCAGCACAAA CTGGCAGCAG GTATTAAGTA ATGGTTCCGT TAACCTCGGC
AATATTACCT ATGTGCTGAT GAAGAGCCTC GGGGTGACGC TGGGTAACGC ACTGCATTTG
TCACCTGAAG CGTCATTGTC GCTGGGTGTA TGGTTTGCGC GTATTACCGG ACTTTCGATG
TTCCTCGCTT ATACCGGGGC GTTCTTTACG CTTTGCTATT CACCGCTGAA AGCCATCATC
CAGGGGACGC CGAAAGCGTT GTGGCCGGAA CCGATGACGC GCCTGAATGC GATGGGGATG
CCTTCTATCG CCATGTGGAT GCAGTGCGGG TTGGTTACTG TCTTCATCCT GCTGGTTTCG
TTTGGTGGCG GTACCGCATC GGCGTTCTTT AACAAGCTGA CGCTGATGGC GAACGTGTCT
ATGACGCTTC CTTACCTGTT CCTCGCGCTG GCTTTCCCGT TCTTTAAAGC ACGTCAGGAT
CTCGACAGAC CGTTTGTGAT TTTCAAAACG CATTTGTCGG CAATGATTGC GACAGTGGTT
GTCGTACTGG TGGTGACATT TGCGAACGTC TTCACCATCA TTCAACCTGT GGTTGAAGCC
GGAGACTGGG ACAGCACATT GTGGATGATT GGCGGCCCTG TCTTCTTCTC GCTGTTAGCG
ATGGCGATTT ACCAGAACTA TTGCAGTCGC ATGGCGAATA AACCTGAGTT AGCTCTCGAC
TGA
 
Protein sequence
MPHTIKKMSL IGLILMIFTS VFGFANSPSA YYLMGYSAIP FYIFSALLFF IPFALMMAEM 
GAAYRKEEGG IYSWMNNSVG PRFAFIGTFM WFSSYIIWMV STSAKVWVPF STFLYGSDMT
QHWRIAGLEP TQVVGLLAVA WMILVTVVAS KGINKIARIT AVGGIAVMCL NLVLLLVSIT
ILLLNGGHFA QDINFLASPN PGYQSGLAML SFVVFAIFAY GGIEAVGGLV DKTENPEKNF
AKGIVFAAIV ISIGYSLAIF LWGVSTNWQQ VLSNGSVNLG NITYVLMKSL GVTLGNALHL
SPEASLSLGV WFARITGLSM FLAYTGAFFT LCYSPLKAII QGTPKALWPE PMTRLNAMGM
PSIAMWMQCG LVTVFILLVS FGGGTASAFF NKLTLMANVS MTLPYLFLAL AFPFFKARQD
LDRPFVIFKT HLSAMIATVV VVLVVTFANV FTIIQPVVEA GDWDSTLWMI GGPVFFSLLA
MAIYQNYCSR MANKPELALD