Gene YpsIP31758_2142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2142 
Symbol 
ID5384650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp2464149 
End bp2465789 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content52% 
IMG OID640865128 
Productmajor facilitator transporter 
Protein accessionYP_001401115 
Protein GI153950075 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAATC AATCAGTTTC TCGTACAGGT ACGGAAGTTA AGGAGTCCTG GGTGCCGATG 
ATTACCATCG CGCTGGCCCA GATCCTGATG TCATTCAACG TGGCCTCCCT ACCTGTTGCG
TTGGGCGGGA TGGTAAAGAG CTTTAATGTG CCACCAACCA CTATTGCCAC GGCGATAGTC
ATGTACTCAT TGTCGGTTGC AGGCTTTGTG ATGTTGGGTG CCAAACTCAA CCAACGCTTT
GGGCCATTGA TAGTATTCCG CTGTACAGTT CTGTTATTCG GCCTGGCTCA GACCATGATG
ACATTCAGCC CGAATGTCAC TGTAATGATC GGTGCGCAGG CACTGAGTGG TCTGGCAGGT
GCGGCATTGG TACCGGCACT GGTGGCGTTA ATTGCTGAAA ACTACCGTGG GCCTCAACAG
GCCACCGCAC TGGGGGCATT AGGTTCTGCT CGCGCAGGGG CGGGTGTTGC TGCGTTCCTG
ATCGGCGGTA TTTTGGGAAC CCATATCGGC TGGCGTCCAG CGTTCGGTAT TTTGATTGTG
TTGTCTGTTA TCGTTTTTGT ACTGAGTTTC CGTCTGAAAG CAGATAAAGG CCGCCCAGAA
GTGGGTATTG ATGTTATCGG TGTCGTTTTA GCTGCATCTG CGATTATTTT GTTGTCGTTC
GGCTTTAATA ACCTGAACCG TTGGGGCTTT GGTCTGGTTC GTGACGGCGC GCCGTTTGAC
CTGCTCGGTT TCTCACCAGC GCCGTTTATG ATCGTGTTGG GTATCGTCTT GGGTCAGGCA
TTTGTGGTCT GGACCCGCCG CCGTCAGGAG CAAGGTAAAA CGCCATTACT GGCGCTGGAC
GTGATTACAT CACCCAGTGA GCGGGCGGCG GTATTTGCCA TGTTTGCGGT GGTTGCGCTG
GAAGCCATGC TGAACTTTTC TGTTCCGCTG TATATACAAA TCGTGCAGGG CAGTTCGCCG
ATGGCAACAG CTATCGCTAT GATGCCGTTT AACCTATCGG TATTCTTCTC TGCGATGTTG
ATTGTACGTT TCTACAAGAA ACTGACTCCG CGTAAAATTG GTCGTTACGG TTTCATCACT
TGTACTTTGG CGCTGTTGTG GCTGGCATTC GTGGTACGTA ATAACTGGAG TGAGTGGTCG
GTCTTGATTG GTTTGGTGGT GTTCGGTATC GGACAAGGCT CACTCGTCAC CTTGCTGTTC
AACGTACTGG TCAGTGCATC ACCAAAAGAA TTGGCAGGCG ATGTGGGTTC TTTACGTGGT
ACGACCAACA ACCTGGCAAG CGCCATCGGT ACAGCGGTTG CAGGTGCCTT GCTGGTGGGC
TTGTTAAGTG CCAACGTGAT GCGCGGTGTC GCTGAAACGC CGATTCTGAC GGATGAGATC
CAAGCTCAGG TCAATATGGA TAGCATCAAC TTCGTCAGTA ATGATCGCCT GAACAGTGTA
TTGGCTCAAA CCTCCGCGAC CCAAGAACAG GTTGCCGAAG CGGTGCGGGT GAATGAAGAA
GCACGGTTAC GTGCGCTGAA ATTCGGTTTG CTGGTTATGG CGCTGCTATC GCTGTTGGCT
ATCTTCCCTG CTGGCCGCTT ACCTGACTAT CTGCCGGGTG AACTGCCTGC TGATAATCTG
GATAAAAAAG CCAGCAAGTA A
 
Protein sequence
MANQSVSRTG TEVKESWVPM ITIALAQILM SFNVASLPVA LGGMVKSFNV PPTTIATAIV 
MYSLSVAGFV MLGAKLNQRF GPLIVFRCTV LLFGLAQTMM TFSPNVTVMI GAQALSGLAG
AALVPALVAL IAENYRGPQQ ATALGALGSA RAGAGVAAFL IGGILGTHIG WRPAFGILIV
LSVIVFVLSF RLKADKGRPE VGIDVIGVVL AASAIILLSF GFNNLNRWGF GLVRDGAPFD
LLGFSPAPFM IVLGIVLGQA FVVWTRRRQE QGKTPLLALD VITSPSERAA VFAMFAVVAL
EAMLNFSVPL YIQIVQGSSP MATAIAMMPF NLSVFFSAML IVRFYKKLTP RKIGRYGFIT
CTLALLWLAF VVRNNWSEWS VLIGLVVFGI GQGSLVTLLF NVLVSASPKE LAGDVGSLRG
TTNNLASAIG TAVAGALLVG LLSANVMRGV AETPILTDEI QAQVNMDSIN FVSNDRLNSV
LAQTSATQEQ VAEAVRVNEE ARLRALKFGL LVMALLSLLA IFPAGRLPDY LPGELPADNL
DKKASK