Gene YpsIP31758_1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1993 
Symbol 
ID5386430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2294953 
End bp2296071 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content51% 
IMG OID640864977 
Productintegral membrane protein 
Protein accessionYP_001400966 
Protein GI153947451 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value0.191087 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATGGT ATGATGGACT ACGGCATCCG GGTGTATTGG CGGCATTGTC TGCTGCAATA 
CTGTTTGGTG CCGGTACCCC TCTGGCAAAA CAATTACTGA ATACCGTTAG CCCCTGGTTA
TTGGCCGGTT TACTTTACCT TGGCTCGGGT ATTGGCTTAA CACTCTACCG TTTGATCACT
CGCCCGGCGG CGGTGAGCCT GCCCCGTAAT GAATTATTGT GGTTTATTGG TGCCATCCTC
TCCGGGGGGA TCATCGCACC CGTGCTGCTA ATGGTCGGCC TCACGGGTAT GCCCGCCTCT
GGCGCATCAC TGTTACTCAA TGCTGAAGGG GTATTCACCG CCCTTTTAGC CTGGTTTGCC
TTCAAAGAGA ATGTTGACCG TCGTATTGCT CTGGGGATGG TGATTATCAT CGCTGGCGCA
GTTGTGCTTA GCTGGCCAGA AGAAGTACTT AACTGGTGGC CAAAAGAGGC TCAATTTGCC
GGATTATGGC CGACGCTGGC CATTTTAGGT GCCTGCTTTG CCTGGGGAAT TGATAACAAT
CTGACCCGTA AAGTCTCGCT GAACGATGCA ACCTGGATTG CCGCCGTCAA AGGGGGCGTT
GCCGGAGTGG TTAATCTGGC GCTGGCCTTC GCCCTCGGAG CAACATTGCC CCCTTTGGCA
AATCTCGCTG GCGCATTGTT GGTTGGATTT TTGGCTTATG GTGTCAGTTT GGCGCTATTT
GTCATTGGAT TACGTCACCT CGGTACTGCC CGCACCGGTG CCTATTTTTC TATTGCTCCG
TTCTTAGGCG CAGTGTTGGC TGTCGCCTTA GGTGACACTG TCACCATTCC GTTGCTCATC
GCGGGTATTT TGATGGCGAT AGGGATCGGG TTACATCTTA CGGAGCAGCA TGAACATCAA
CATACCCATG ATGAAATGAT ACATGAGCAT GAACATATTC ATGACGAACA TCATCAACAT
CGCCATGACT TTCCGGTAGA CGCGGGTACC GCGCATAAGC ATCGTCATCA GCACCTACCG
ATGGCACACT CTCATTCGCA TTTTCCTGAT TCACACCATC AGCATAAACA TCCTCGACAT
AAGCATAATC AACATAAGTA TCATCAACAT AAGCACTAG
 
Protein sequence
MKWYDGLRHP GVLAALSAAI LFGAGTPLAK QLLNTVSPWL LAGLLYLGSG IGLTLYRLIT 
RPAAVSLPRN ELLWFIGAIL SGGIIAPVLL MVGLTGMPAS GASLLLNAEG VFTALLAWFA
FKENVDRRIA LGMVIIIAGA VVLSWPEEVL NWWPKEAQFA GLWPTLAILG ACFAWGIDNN
LTRKVSLNDA TWIAAVKGGV AGVVNLALAF ALGATLPPLA NLAGALLVGF LAYGVSLALF
VIGLRHLGTA RTGAYFSIAP FLGAVLAVAL GDTVTIPLLI AGILMAIGIG LHLTEQHEHQ
HTHDEMIHEH EHIHDEHHQH RHDFPVDAGT AHKHRHQHLP MAHSHSHFPD SHHQHKHPRH
KHNQHKYHQH KH