Gene EcHS_A3696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3696 
Symbol 
ID5595303 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3683373 
End bp3684842 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content52% 
IMG OID640922810 
Productinner membrane transporter YhiP 
Protein accessionYP_001460290 
Protein GI157162972 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACAA CAACACCCAT GGGGATGCTG CAGCAACCTC GCCCATTTTT CATGATCTTT 
TTTGTCGAGT TATGGGAGCG ATTCGGCTAC TACGGCGTGC AGGGCGTACT GGCGGTTTTC
TTCGTTAAAC AGCTTGGATT CTCGCAAGAG CAGGCTTTTG TCACTTTTGG TGCTTTTGCT
GCGCTGGTCT ATGGCCTCAT TTCCATTGGC GGCTATGTCG GCGACCACCT GCTGGGGACC
AAACGCACCA TTGTTCTTGG TGCACTTGTG CTGGCGATTG GCTACTTCAT GACCGGCATG
TCGCTACTTA AGCCTGACCT GATTTTCATC GCCCTGGGGA CTATCGCTGT AGGTAACGGC
CTGTTTAAAG CTAACCCAGC CAGCTTGCTT TCGAAGTGCT ATCCGCCGAA AGATCCGCGG
CTTGATGGCG CATTCACCCT GTTCTATATG TCGATCAACA TCGGCTCGTT GATAGCGTTA
TCGCTGGCCC CTGTGATCGC TGATAGATTC GGTTATTCAG TCACCTACAA CCTGTGCGGG
GCGGGGTTAA TTATCGCATT ACTGGTTTAC ATCGCCTGTC GTGGAATGGT GAAAGACATT
GGTTCTGAAC CCGACTTCCG GCCAATGAGC TTCAGCAAAC TGTTGTACGT GTTACTTGGC
AGCGTGGTGA TGATCTTCGT ATGCGCATGG CTGATGCACA ACGTAGAAGT CGCCAATCTG
GTGCTGATTG TTCTCTCCAT CGTCGTCACC ATCATCTTCT TTCGTCAGGC ATTCAAGCTG
GATAAAACCG GGCGCAATAA AATGTTTGTC GCCTTTGTCC TGATGCTCGA AGCGGTGGTG
TTTTACATTC TCTACGCCCA GATGCCAACA TCGCTGAACT TCTTTGCCAT CAACAACGTG
CATCATGAAA TTCTCGGTTT TTCCATCAAC CCGGTCAGCT TCCAGGCGCT TAACCCGTTC
TGGGTGGTAC TCGCCAGCCC AATACTGGCA GGCATTTACA CGCATCTGGG TAACAAAGGC
AAAGACCTCT CGATGCCGAT GAAATTTACT CTCGGCATGT TTATGTGCTC ACTGGGCTTT
TTGACGGCGG CAGCTGCGGG AATGTGGTTT GCGGATGCAC AAGGGCTGAC ATCGCCATGG
TTTATCGTGC TGGTGTACTT ATTCCAGAGC TTAGGTGAAC TGTTTATTAG CGCCCTTGGC
CTGGCGATGA TTGCTGCCCT GGTGCCGCAG CATTTGATGG GCTTTATTCT CGGGATGTGG
TTCCTGACGC AGGCTGCCGC GTTCTTGCTG GGCGGCTATG TGGCAACATT TACCGCGGTG
CCGGACAACA TTACCGATCC GCTTGAGACG TTGCCCGTCT ATACCAACGT GTTTGGTAAG
ATTGGTCTGG TCACGCTGGG CGTTGCAGTA GTGATGCTGT TGATGGTGCC GTGGCTGAAA
CGCATGATTG CGACGCCGGA AAGCCATTAA
 
Protein sequence
MNTTTPMGML QQPRPFFMIF FVELWERFGY YGVQGVLAVF FVKQLGFSQE QAFVTFGAFA 
ALVYGLISIG GYVGDHLLGT KRTIVLGALV LAIGYFMTGM SLLKPDLIFI ALGTIAVGNG
LFKANPASLL SKCYPPKDPR LDGAFTLFYM SINIGSLIAL SLAPVIADRF GYSVTYNLCG
AGLIIALLVY IACRGMVKDI GSEPDFRPMS FSKLLYVLLG SVVMIFVCAW LMHNVEVANL
VLIVLSIVVT IIFFRQAFKL DKTGRNKMFV AFVLMLEAVV FYILYAQMPT SLNFFAINNV
HHEILGFSIN PVSFQALNPF WVVLASPILA GIYTHLGNKG KDLSMPMKFT LGMFMCSLGF
LTAAAAGMWF ADAQGLTSPW FIVLVYLFQS LGELFISALG LAMIAALVPQ HLMGFILGMW
FLTQAAAFLL GGYVATFTAV PDNITDPLET LPVYTNVFGK IGLVTLGVAV VMLLMVPWLK
RMIATPESH