Gene EcolC_3231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3231 
SymbolproY 
ID6066766 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3537287 
End bp3538660 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content53% 
IMG OID641602646 
Productputative proline-specific permease 
Protein accessionYP_001726180 
Protein GI170021226 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1113] Gamma-aminobutyrate permease and related permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000934301 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGAAAGTA AGAACAAGCT AAAGCGTGGG CTAAGTACCC GCCACATACG CTTTATGGCA 
CTGGGTTCAG CAATTGGCAC CGGGCTGTTT TACGGTTCGG CAGACGCCAT CAAAATGGCC
GGTCCGAGCG TGTTGTTGGC CTATATTATC GGTGGTATCG CGGCGTATAT CATTATGCGT
GCGCTGGGGG AAATGTCGGT ACATAACCCG GCCGCCAGCT CTTTCTCGCG TTATGCGCAG
GAAAACCTCG GCCCGCTGGC AGGTTACATT ACCGGCTGGA CCTACTGCTT TGAAATCCTT
ATTGTCGCCA TCGCCGATGT GACCGCTTTT GGTATCTATA TGGGTGTCTG GTTCCCGACG
GTGCCGCACT GGATTTGGGT ACTGAGCGTG GTGCTGATCA TTTGCGCCGT AAACCTGATG
AGCGTGAAGG TATTCGGTGA GCTGGAATTC TGGTTCTCGT TCTTTAAAGT CGCCACCATC
ATCATCATGA TTGTCGCCGG TTTCGGCATC ATCATCTGGG GGATTGGCAA CGGCGGGCAA
CCGACCGGTA TTCATAACCT GTGGAGCAAC GGCGGCTTCT TCAGTAACGG CTGGCTTGGC
ATGGTAATGT CGTTGCAAAT GGTGATGTTT GCTTACGGTG GGATCGAAAT TATCGGGATT
ACCGCCGGTG AAGCGAAAGA TCCTGAGAAA TCGATACCGC GTGCGATTAA CTCCGTGCCG
ATGCGTATTC TGGTGTTCTA CGTCGGTACG CTGTTCGTCA TTATGTCTAT CTACCCGTGG
AATCAGGTTG GCACTGCCGG TAGCCCGTTC GTGCTGACGT TCCAGCATAT GGGCATTACC
TTTGCCGCCA GCATTCTTAA CTTTGTTGTG CTGACTGCTT CGCTGTCGGC AATTAACAGT
GATGTATTTG GCGTAGGCCG TATGCTCCAC GGTATGGCAG AGCAGGGCAG CGCGCCGAAA
ATTTTCAGCA AAACGTCGCG TCGCGGTATT CCGTGGGTTA CGGTGCTGGT GATGACTACC
GCGCTGCTGT TTGCGGTGTA TCTGAACTAC ATCATGCCGG AAAACGTCTT CCTGGTGATC
GCTTCGCTGG CAACCTTCGC CACGGTGTGG GTGTGGATTA TGATCCTGCT GTCGCAAATT
GCCTTCCGTC GCCGTTTGCC GCCAGAAGAA GTTAAGGCGC TGAAATTTAA AGTGCCGGGT
GGGGTAGCAA CGACCATCGG CGGGCTGATT TTCCTGCTCT TTATTATCGG GTTGATTGGT
TATCACCCGG ATACGCGTAT CTCGCTGTAT GTCGGTTTCG CGTGGATTGT TGTGCTGTTG
ATTGGCTGGA TGTTTAAGCG TCGCCACGAT CGTCAGCTGG CTGAAAACCA ATAA
 
Protein sequence
MESKNKLKRG LSTRHIRFMA LGSAIGTGLF YGSADAIKMA GPSVLLAYII GGIAAYIIMR 
ALGEMSVHNP AASSFSRYAQ ENLGPLAGYI TGWTYCFEIL IVAIADVTAF GIYMGVWFPT
VPHWIWVLSV VLIICAVNLM SVKVFGELEF WFSFFKVATI IIMIVAGFGI IIWGIGNGGQ
PTGIHNLWSN GGFFSNGWLG MVMSLQMVMF AYGGIEIIGI TAGEAKDPEK SIPRAINSVP
MRILVFYVGT LFVIMSIYPW NQVGTAGSPF VLTFQHMGIT FAASILNFVV LTASLSAINS
DVFGVGRMLH GMAEQGSAPK IFSKTSRRGI PWVTVLVMTT ALLFAVYLNY IMPENVFLVI
ASLATFATVW VWIMILLSQI AFRRRLPPEE VKALKFKVPG GVATTIGGLI FLLFIIGLIG
YHPDTRISLY VGFAWIVVLL IGWMFKRRHD RQLAENQ