Gene EcSMS35_3785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3785 
Symbol 
ID6146935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3852287 
End bp3853756 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content52% 
IMG OID641618611 
Productinner membrane transporter YhiP 
Protein accessionYP_001745751 
Protein GI170682598 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAA CAACACCCAT GGGGATGCTG CAGCAACCTC GCCCATTTTT CATGATCTTT 
TTTGTCGAGT TATGGGAGCG ATTCGGCTAC TACGGCGTGC AGGGCGTACT GGCTGTTTTC
TTCGTTAAGC AGCTTGGATT CTCGCAAGAA CAGGCTTTTG TCACTTTTGG TGCTTTTGCT
GCGCTGGTCT ATGGCCTCAT TTCCATTGGC GGCTATGTCG GCGACCACCT GCTGGGGACC
AAACGCACCA TTGTTCTCGG AGCACTTGTG CTGGCGATTG GCTACTTCAT GACCGGCCTG
TCGCTACTTA AGCCTGACCT GATTTTCATC GCCCTGGGGA CTATCGCTGT CGGTAACGGC
CTGTTTAAAG CTAACCCAGC CAGCTTGCTT TCGAAGTGCT ATCCGCCGAA AGATCCGCGG
CTTGATGGCG CATTTACCCT GTTCTATATG TCGATCAACA TCGGCTCGTT GATAGCGTTA
TCGCTGGCCC CTGTGATCGC TGATAGATTC GGCTATTCAG TCACCTACAA CCTGTGCGGT
GCGGGATTAA TTATCGCGCT ATTGGTTTAC ATCGCCTGTC GCGGAATGGT GAAAGATATT
GGTTCTGAAC CCGACTTCTG CCCGATGAGC TTCAGTAAAC TGTTGTACGT GTTACTTGGC
AGCGTGGTGA TGATCTTCGT CTGTGCATGG CTGATGCACA ACGTAGAAGT CGCCAATCTG
GTGCTGATTG TTCTCTCCAT CGTCGTCACC ATCATTTTCT TTCGTCAGGC ATTCAAGCTG
GATAAAACTG GGCGCAATAA AATGTTTGTC GCCTTTGTCC TGATGCTCGA AGCGGTGGTG
TTTTACATTC TCTACGCCCA GATGCCAACG TCGCTGAACT TCTTTGCCAT CAACAACGTG
CATCATGAAA TTCTCGGCTT TTCCATCAAC CCGGTCAGCT TCCAGGCGCT TAACCCGTTC
TGGGTGGTAC TCGCCAGCCC AATACTGGCA GGCATTTACA CGCATCTGGG TAACAAAGGC
AAAGACCTCT CGATGCCGAT GAAATTTACT CTCGGAATGT TTATGTGCTC GCTGGGCTTT
TTGACGGCTG CAGCGGCTGG AATGTGGTTT GCGGATGCCC AGGGGCTGAC ATCGCCATGG
TTTATCGTGC TGGTGTACTT ATTCCAGAGC CTGGGTGAAC TGTTTATTAG CGCCCTGGGC
CTGGCGATGA TTGCTGCCCT GGTGCCGCAG CATTTGATGG GCTTTATTCT CGGGATGTGG
TTCCTGACGC AGGCTGCCGC GTTCTTGCTG GGCGGCTATG TGGCAACATT TACCGCAGTA
CCAGACAACA TTACCGATCC GCTTGAGACG TTGCCCGTCT ATACCAACGT GTTTGGTAAG
ATTGGCCTGG TTACGCTGGG CGTTGCGGTG GTGATGCTGC TGATGGTGCC GTGGCTGAAA
CGCATGATTG CGACACCCGA AAGCCATTAA
 
Protein sequence
MNTTTPMGML QQPRPFFMIF FVELWERFGY YGVQGVLAVF FVKQLGFSQE QAFVTFGAFA 
ALVYGLISIG GYVGDHLLGT KRTIVLGALV LAIGYFMTGL SLLKPDLIFI ALGTIAVGNG
LFKANPASLL SKCYPPKDPR LDGAFTLFYM SINIGSLIAL SLAPVIADRF GYSVTYNLCG
AGLIIALLVY IACRGMVKDI GSEPDFCPMS FSKLLYVLLG SVVMIFVCAW LMHNVEVANL
VLIVLSIVVT IIFFRQAFKL DKTGRNKMFV AFVLMLEAVV FYILYAQMPT SLNFFAINNV
HHEILGFSIN PVSFQALNPF WVVLASPILA GIYTHLGNKG KDLSMPMKFT LGMFMCSLGF
LTAAAAGMWF ADAQGLTSPW FIVLVYLFQS LGELFISALG LAMIAALVPQ HLMGFILGMW
FLTQAAAFLL GGYVATFTAV PDNITDPLET LPVYTNVFGK IGLVTLGVAV VMLLMVPWLK
RMIATPESH