Gene EcSMS35_0433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0433 
SymbolproY 
ID6144255 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp443258 
End bp444634 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content53% 
IMG OID641615329 
Productputative proline-specific permease 
Protein accessionYP_001742536 
Protein GI170680647 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATGGAAA GTAAGAACAA GCTAAAGCGT GGGCTAAGTA CCCGACACAT ACGCTTTATG 
GCACTGGGTT CAGCAATTGG CACCGGGCTG TTTTACGGTT CGGCAGATGC CATCAAAATG
GCCGGTCCAA GCGTGTTGTT GGCCTATATT ATCGGTGGTA TCGCGGCGTA TATCATTATG
CGCGCGCTGG GGGAAATGTC GGTACATAAC CCGGCCGCCA GCTCTTTCTC GCGTTATGCG
CAGGAAAACC TCGGACCGCT GGCAGGTTAC ATTACCGGCT GGACCTACTG CTTTGAAATC
CTAATTGTCG CCATCGCCGA TGTGACCGCT TTTGGTATCT ATATGGGGGT CTGGTTCCCG
ACGGTGCCGC ACTGGATTTG GGTACTGAGC GTGGTGCTGA TCATTTGCGC CGTAAACCTG
ATGAGCGTGA AGGTATTCGG TGAGCTGGAG TTCTGGTTCT CGTTCTTTAA AGTCGCCACT
ATCATCATCA TGATTGTCGC CGGTTTCGGC ATCATCATCT GGGGAATTGG CAACGGCGGG
CAACCGACCG GTATTCATAA CCTGTGGAGC AACGGCGGCT TCTTCAGTAA CGGCTGGCTT
GGTATGGTGA TGTCGTTGCA AATGGTGATG TTTGCTTACG GTGGGATCGA AATTATCGGG
ATTACCGCCG GTGAAGCGAA AGATCCTGAG AAATCGATTC CGCGTGCGAT TAACTCCGTG
CCGATGCGTA TTCTGGTGTT CTACGTCGGT ACGCTGTTCG TCATAATGTC TATCTACCCG
TGGAATCAGG TTGGCACTGC CGGTAGCCCG TTCGTGCTGA CGTTCCAGCA TATGGGCATT
ACCTTTGCCG CCAGCATTCT TAACTTTGTT GTGCTGACCG CTTCGCTGTC GGCAATTAAC
AGTGACGTAT TTGGCGTAGG CCGTATGCTC CACGGTATGG CAGAGCAGGG CAGCGCGCCA
AAAATTTTCA GCAAAACCTC GCGTCGCGGT ATTCCGTGGG TTACGGTGCT GGTGATGACT
ACCGCGCTGC TATTTGCGGT GTATCTGAAC TACATCATGC CGGAAAACGT CTTCCTGGTG
ATCGCTTCGC TGGCAACCTT CGCCACGGTG TGGGTGTGGA TTATGATCCT GCTGTCGCAA
ATCGCCTTCC GTCGCCGTTT GCCTCCAGAA GAAGTTAAGG CGCTGAAATT TAAAGTGCCG
GGTGGGGTAG CAACGACCAT CGGCGGTTTG ATTTTCCTGC TCTTTATTAT CGGGTTGATT
GGTTATCACC CGGATACGCG TATCTCGCTG TACGTTGGTT TCGCGTGGAT TGTTGTGCTG
TTGATTGGCT GGATGTTTAA ACGCCGCCAC GATCGTCAGC TGGCTGAAAA CCAGTAA
 
Protein sequence
MMESKNKLKR GLSTRHIRFM ALGSAIGTGL FYGSADAIKM AGPSVLLAYI IGGIAAYIIM 
RALGEMSVHN PAASSFSRYA QENLGPLAGY ITGWTYCFEI LIVAIADVTA FGIYMGVWFP
TVPHWIWVLS VVLIICAVNL MSVKVFGELE FWFSFFKVAT IIIMIVAGFG IIIWGIGNGG
QPTGIHNLWS NGGFFSNGWL GMVMSLQMVM FAYGGIEIIG ITAGEAKDPE KSIPRAINSV
PMRILVFYVG TLFVIMSIYP WNQVGTAGSP FVLTFQHMGI TFAASILNFV VLTASLSAIN
SDVFGVGRML HGMAEQGSAP KIFSKTSRRG IPWVTVLVMT TALLFAVYLN YIMPENVFLV
IASLATFATV WVWIMILLSQ IAFRRRLPPE EVKALKFKVP GGVATTIGGL IFLLFIIGLI
GYHPDTRISL YVGFAWIVVL LIGWMFKRRH DRQLAENQ