Gene YpAngola_A2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2039 
SymbolputP 
ID5800509 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2126349 
End bp2127833 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content49% 
IMG OID641339959 
Productproline permease 
Protein accessionYP_001606509 
Protein GI162420458 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID[TIGR00813] transporter, SSS family
[TIGR02121] sodium/proline symporter 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.241899 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGA ATACGCCGAT GCTGGTGACA TTTTTAGTTT ACATTTTGGG GATGATATTA 
ATCGGGTTGA TCGCCTATCG GGCGACCAAT AACTTCGATG ACTACATCTT GGGTGGCCGG
CGTTTAGGCA GTGTCGTCAC CGCGTTATCC GCGGGTGCTT CCGATATGAG TGGCTGGTTA
TTAATGGGGT TACCGGGTGC GGTTTTTTTG TCCGGTATTT CCGAAAGTTG GATAGCCATA
GGCTTGACAA TCGGTGCTTA TCTTAATTGG AAATGGGTGG CAGGCCGTTT ACGGGTTCAT
ACCGAAGCCA ATAATAATGC CCTGACATTA CCCGATTATT TCACCAGTCG CTTTGAAGAC
AAGAGTAAAT TACTGCGGGT CATATCCGCT GTGGTTATCT TAGTCTTCTT TACTATTTAC
TGTGCTTCAG GGATTGTGGC CGGTGCGCGT TTGTTCGAAA GCACTTTCGG TATGGACTAC
GGAACCGCAT TATGGGCAGG GGCCGCCGCG ACCATCATCT ATACCTTTAT TGGGGGGTTC
CTGGCCGTAA GCTGGACAGA TACTGTACAG GCGACGTTAA TGATTTTCGC CCTGATCCTG
ACGCCAATTA TCGTGATTCT TGCTGTTGGG GGGATAGATA CCTCAATGAT GGTGATCGCG
GCGAAAAACC CCGCGAATAT TGATATGTTC AAAGGGTTGA ATCTGGTGGC GATCCTCTCG
CTGTTGGGAT GGGGGTTGGG CTACTTTGGT CAGCCACATA TTTTGGCTCG CTTCATGGCG
GCAGACTCTC ATCGTACTAT CCGTAGTGCC CGCCGTATTA GTATGACCTG GATGATCCTT
TGTCTGGCGG GGACTATCGC AGTAGGCTTC TTTGGCATTG CTTACTTTGA AAATAACCCT
GAGCAAGCAG GGAATGTGAG CCAAAATGGT GAGCGGGTAT TTATCGAATT AGCTAAGCTG
CTGTTCAACC CGTGGATTGC CGGTATTCTG TTGTCAGCCA TTCTGGCTGC GGTTATGAGT
ACCTTGAGTT GTCAATTATT GGTGTGCTCT AGCGCACTGA CCGAAGATTT ATATAAAGCA
TTCTTGCGTA AAAAAGCCAG CCAAAAAGAA CTCGTCTGGG TGGGCCGTGC CATGGTGTTA
TTAGTTGCAT TGATAGCTAT TGCACTGGCG GCAGATCCTG ATAATCGGGT CCTCGGATTA
GTCAGCTATG CATGGGCAGG CTTCGGTGCA GCTTTTGGGC CAGTGGTCCT TATCTCGGTG
CTGTGGCCAC GAATGACACG TAATGGTGCG TTGGTGGGAA TGCTGGTCGG TGCCATCACG
GTGCTTGTCT GGAAACAGTA TGGCTGGTTA GATCTGTATG AAATTATTCC AGGCTTCTTG
TTTGCCAGTT TGGCTATTTT TGTCGTCAGC CTGATGGGCC GCGAGCCAAG TAATGCCATC
ACCGAACGTT TCCATAAAGC CGAGGCGGAG TTTAAAACAG TTTAA
 
Protein sequence
MTMNTPMLVT FLVYILGMIL IGLIAYRATN NFDDYILGGR RLGSVVTALS AGASDMSGWL 
LMGLPGAVFL SGISESWIAI GLTIGAYLNW KWVAGRLRVH TEANNNALTL PDYFTSRFED
KSKLLRVISA VVILVFFTIY CASGIVAGAR LFESTFGMDY GTALWAGAAA TIIYTFIGGF
LAVSWTDTVQ ATLMIFALIL TPIIVILAVG GIDTSMMVIA AKNPANIDMF KGLNLVAILS
LLGWGLGYFG QPHILARFMA ADSHRTIRSA RRISMTWMIL CLAGTIAVGF FGIAYFENNP
EQAGNVSQNG ERVFIELAKL LFNPWIAGIL LSAILAAVMS TLSCQLLVCS SALTEDLYKA
FLRKKASQKE LVWVGRAMVL LVALIAIALA ADPDNRVLGL VSYAWAGFGA AFGPVVLISV
LWPRMTRNGA LVGMLVGAIT VLVWKQYGWL DLYEIIPGFL FASLAIFVVS LMGREPSNAI
TERFHKAEAE FKTV