Gene YpsIP31758_3033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_3033 
Symbolfsr 
ID5387199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp3413089 
End bp3414303 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content45% 
IMG OID640866039 
Productfosmidomycin resistance protein 
Protein accessionYP_001401993 
Protein GI153949257 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.00068897 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGATC GTTCTGATAC CGGATGTCAG CAGCCCGTTA ATGTTTCAGT CAAACGTACA 
TCTTTCTCTA TTTTGGGTGC TATCAGCGTA TCTCACCTAC TCAACGATAT GATCCAGTCG
CTGATCCTGG CGATTTATCC TTTATTACAA GCTGAGTTCT CATTGAGTTT TGCTCAAATA
GGATTGATCA CGCTAAGTTA TCAATTGACG GCTTCGTTAC TACAACCGCT GATTGGTTTA
TATACAGATA AACACCCTCA GCCATACTCA TTACCCATCG GTATGGGGTT CACGCTGTCT
GGCATATTAC TGCTGGCAGT AGCCACAACC TTCCCGGTGG TATTATTAGC GGCAGCATTG
GTCGGTACTG GCTCTTCTGT ATTCCACCCA GAATCATCAC GAGTCGCCCG TATGGCATCC
GGTGGCCGCC ATGGTTTGGC TCAATCTGTT TTTCAGGTAG GGGGGAATTT TGGCAGCGCA
CTCGGTCCGT TACTGGCCGC CATCATTATT GCCCCTTACG GTAAAGGCAA TGTGGGCTGG
TTTTCGCTCG CAGCCCTACT GGCAATTGTC GTGCTGTTAC AGGTGAGTAA ATGGTACAAG
CTTCAGCAAC GTGCTTCGTA TGGCAAAGTG TTAAAAACCT CATCAGCCAA AACACTACCA
AAAAATAAAA TTATCAGTAC GTTAGCTATC TTGATGGTGC TGATATTCTC TAAATACTTC
TATTTGACCA GCATTAGCAG CTATTACACC TTTTATTTAA TACATAAGTT TGGCGTTTCG
GTACAAAGTG CCCAGATACA CCTATTTGTT TTCTTATTTG CGGTTGCCGC CGGCACCATT
ATTGGTGGGC CTCTTGGTGA TAAGATTGGT AGAAAATATG TTATTTGGGG TTCTATACTC
GGTGTTGCCC CCTTTACCCT TGCTTTACCC TACGCGTCTT TGTATTGGAC GGGTATTTTA
ACCGTATTTA TTGGTGTTAT TCTCGCTTCC GCCTTTTCAG CAATACTGGT GTATGCTCAA
GAGCTTATCC CAGGGAAAGT CGGCATGGTA TCGGGTTTAT TCTTCGGTTT CGCTTTCGGT
ATGGGGGGAA TAGGTGCAGC AGTACTGGGG TATGTTGCTG ATTTAACCAG TATTGAACTG
GTTTATCAAA TATGTGCGTT CTTGCCATTA CTGGGAATAT TCACGGCCTT ACTGCCTAAT
TTGGACGATA AGTAA
 
Protein sequence
MTDRSDTGCQ QPVNVSVKRT SFSILGAISV SHLLNDMIQS LILAIYPLLQ AEFSLSFAQI 
GLITLSYQLT ASLLQPLIGL YTDKHPQPYS LPIGMGFTLS GILLLAVATT FPVVLLAAAL
VGTGSSVFHP ESSRVARMAS GGRHGLAQSV FQVGGNFGSA LGPLLAAIII APYGKGNVGW
FSLAALLAIV VLLQVSKWYK LQQRASYGKV LKTSSAKTLP KNKIISTLAI LMVLIFSKYF
YLTSISSYYT FYLIHKFGVS VQSAQIHLFV FLFAVAAGTI IGGPLGDKIG RKYVIWGSIL
GVAPFTLALP YASLYWTGIL TVFIGVILAS AFSAILVYAQ ELIPGKVGMV SGLFFGFAFG
MGGIGAAVLG YVADLTSIEL VYQICAFLPL LGIFTALLPN LDDK