Gene YpsIP31758_2762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_2762 
Symbol 
ID5387346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3115016 
End bp3116170 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content50% 
IMG OID640865755 
Productmajor facilitator transporter 
Protein accessionYP_001401726 
Protein GI153949416 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.00913666 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCACCA AAATACACCA ACAAGCAGTA CAGCCCGGTA TTAGCCAACA AGTTTCTACC 
CGGTTAGCTT TTTTTATTGC CGGGTTAGGC ATGGCCGCTT GGGCACCACT TGTTCCCTTT
GCAAAAGCGC GCATTGGTCT TAATGATGCC TCATTGGGTT TATTACTGTT ATGCATTGGT
ATTGGATCGA TGCTGGCGAT GCCGCTCACT GGCGTGCTTA CCGCGAAGTG GGGCTGTCGG
GCCGTCATTT TACTGGCAGG CGCAGTGCTC TGTTTAGATT TGCCTTTACT CGTATTGATG
AATACTCCCG CGACGATGGC TATCGCACTA TTAGTATTCG GTGCAGCTAT GGGCATAATA
GATGTGGCGA TGAACATTCA GGCTGTCATT GTTGAAAAAG CCAGTGGCCG GGCGATGATG
TCTGGCTTCC ACGGTTTATT CAGTGTCGGT GGGATTGTTG GTGCAGGAGG TGTCAGTGCT
CTATTGTGGC TAGGCCTCAA CCCACTGACA GCGATTATGG CTACCGTAGT ACTCATGATT
ATTTTGCTGC TGGCAGCCAA TAAGAATCTG TTACGTGGCA GCGGTGAACC CCATGATGGG
CCATTGTTTG TTTTTCCCCG TGGCTGGGTG ATGTTCATCG GCTTTTTATG TTTTGTCATG
TTTTTGGCAG AAGGCTCGAT GCTTGACTGG AGTGCCGTCT TCCTGACGAC GCTACGCGGC
ATGTCGCCAT CACAAGCAGG TATGGGCTAC GCCGTATTCG CCATCGCTAT GACACTTGGC
CGCCTAAACG GTGATCGGAT TGTCAATGGG CTGGGCCGTT ACAAGGTCTT ATTAGGTGGC
AGTTTATGTT CTGCCATCGG GATTATTATC GCAATCAGTA TTGATAGCTC AATGGCTGCC
ATTATTGGCT TCATGTTAGT GGGTTTCGGC GCATCGAATG TGGTACCGAT CTTGTTTACC
GCCGCAGGTA ATCAAACCGT TATGCCTGCC AACCTGGCGG TTGCGTCAAT TACAACGATC
GGTTACGCGG GAATTTTGGC TGGCCCGGCA GCTATCGGCT TTATTGCACA ATTAAGTAGT
CTATCGGTTG CTTTTGGCTG TGTAGCACTT CTGTTATTAA CCGTTGCTGC CAGCGCCAGA
GCCGTCACGC GCTAA
 
Protein sequence
MSTKIHQQAV QPGISQQVST RLAFFIAGLG MAAWAPLVPF AKARIGLNDA SLGLLLLCIG 
IGSMLAMPLT GVLTAKWGCR AVILLAGAVL CLDLPLLVLM NTPATMAIAL LVFGAAMGII
DVAMNIQAVI VEKASGRAMM SGFHGLFSVG GIVGAGGVSA LLWLGLNPLT AIMATVVLMI
ILLLAANKNL LRGSGEPHDG PLFVFPRGWV MFIGFLCFVM FLAEGSMLDW SAVFLTTLRG
MSPSQAGMGY AVFAIAMTLG RLNGDRIVNG LGRYKVLLGG SLCSAIGIII AISIDSSMAA
IIGFMLVGFG ASNVVPILFT AAGNQTVMPA NLAVASITTI GYAGILAGPA AIGFIAQLSS
LSVAFGCVAL LLLTVAASAR AVTR