Gene EcHS_A0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0472 
SymbolproY 
ID5592827 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp484778 
End bp486151 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content53% 
IMG OID640919655 
Productputative proline-specific permease 
Protein accessionYP_001457240 
Protein GI157159922 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAGTA AGAACAAGCT AAAGCGTGGG CTAAGTACCC GCCACATACG CTTTATGGCA 
CTGGGTTCAG CAATTGGCAC CGGGCTGTTT TACGGTTCGG CAGACGCCAT CAAAATGGCC
GGTCCGAGCG TGTTGTTGGC CTATATTATC GGTGGTATCG CGGCGTATAT CATTATGCGT
GCGCTGGGGG AAATGTCGGT ACATAACCCG GCAGCCAGCT CTTTCTCGCG TTATGCGCAG
GAAAACCTCG GCCCGCTGGC AGGTTACATT ACCGGCTGGA CCTATTGCTT TGAAATCCTT
ATTGTCGCCA TCGCCGATGT GACCGCTTTT GGTATCTATA TGGGTGTCTG GTTCCCGACG
GTGCCGCACT GGATTTGGGT ACTGAGCGTG GTGCTGATCA TTTGCGCCGT AAACCTGATG
AGCGTGAAGG TCTTCGGTGA GCTGGAATTC TGGTTCTCGT TCTTTAAAGT CGCCACCATC
ATCATCATGA TTGTCGCCGG TTTCGGCATC ATCATCTGGG GGATTGGCAA CGGCGGGCAA
CCGACCGGTA TTCATAACCT GTGGAGCAAC GGCGGCTTCT TCAGTAACGG CTGGCTTGGG
ATGGTGATGT CGTTGCAAAT GGTGATGTTT GCTTACGGTG GGATCGAAAT TATCGGGATT
ACCGCCGGTG AAGCGAAAGA TCCTGAGAAA TCGATACCGC GTGCGATTAA CTCCGTGCCG
ATGCGTATTC TGGTGTTCTA CGTCGGTACG CTGTTCGTCA TTATGTCTAT CTACCCGTGG
AATCAGGTTG GCACTGCCGG TAGCCCGTTC GTGCTGACGT TCCAGCATAT GGGCATTACC
TTTGCCGCCA GCATTCTTAA CTTTGTTGTG CTGACTGCTT CGCTGTCGGC AATTAACAGT
GACGTATTTG GCGTAGGCCG TATGCTCCAC GGTATGGCAG AGCAGGGCAG CGCGCCGAAA
ATTTTCAGCA AAACGTCGCG TCGCGGTATT CCGTGGGTTA CGGTGCTGGT GATGACTACC
GCGCTGCTGT TTGCGGTGTA TCTGAACTAC ATCATGCCGG AAAACGTCTT CCTGGTGATC
GCTTCGCTGG CAACCTTCGC CACGGTGTGG GTGTGGATTA TGATCCTGCT GTCGCAAATT
GCCTTCCGTC GCCGTTTGCC GCCAGAAGAA GTTAAGGCGC TGAAATTTAA AGTGCCGGGT
GGGGTAGCAA CGACCATCGG CGGTTTGATT TTCCTGCTCT TTATTATCGG GTTGATTGGT
TATCACCCGG ATACGCGTAT CTCGCTGTAT GTCGGTTTCG CGTGGATTGT TGTGCTGTTG
ATTGGCTGGA TGTTTAAACG CCGCCACGAT CGTCAGCTGG CTGAAAACCA ATAA
 
Protein sequence
MESKNKLKRG LSTRHIRFMA LGSAIGTGLF YGSADAIKMA GPSVLLAYII GGIAAYIIMR 
ALGEMSVHNP AASSFSRYAQ ENLGPLAGYI TGWTYCFEIL IVAIADVTAF GIYMGVWFPT
VPHWIWVLSV VLIICAVNLM SVKVFGELEF WFSFFKVATI IIMIVAGFGI IIWGIGNGGQ
PTGIHNLWSN GGFFSNGWLG MVMSLQMVMF AYGGIEIIGI TAGEAKDPEK SIPRAINSVP
MRILVFYVGT LFVIMSIYPW NQVGTAGSPF VLTFQHMGIT FAASILNFVV LTASLSAINS
DVFGVGRMLH GMAEQGSAPK IFSKTSRRGI PWVTVLVMTT ALLFAVYLNY IMPENVFLVI
ASLATFATVW VWIMILLSQI AFRRRLPPEE VKALKFKVPG GVATTIGGLI FLLFIIGLIG
YHPDTRISLY VGFAWIVVLL IGWMFKRRHD RQLAENQ