Gene YpsIP31758_1552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_1552 
SymbolmdoG 
ID5388071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009708 
Strand
Start bp1808388 
End bp1809923 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content49% 
IMG OID640864533 
Productglucan biosynthesis protein G 
Protein accessionYP_001400529 
Protein GI153947555 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3131] Periplasmic glucans biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACTGCG CCAGATGGTT GGGTGCTACG CTCTTGCTGT TATTCTCCAC CTCTCAGGCA 
CAGGCTTTTT CGTTAGATGA TGTTGCCAAG CAGGCGCAGG CACTGGCGGG TAAAAGCTTC
GAAGCACCGA AGAGTAACCT TCCAGCGCAA TTTCGCGATA TGAAGTTTAC TGATTATCAA
CAAATCCAGT TTAACCACGA TAAGTCGTAT TGGGGTAAGC TGGATACGCC CTTTAAGCTA
GAGTTTTATC ATCAGGGCAT GTATTTCGAT ACGCCAGTAC AGATCAATGA AGTGACTGCG
ACTCGGGTTA AGCCCATCAG TTATAGCCCG GACTATTTCG ATTTCGGCTC GGTCAAACAT
GATTCGGAAT CCACAAAGAC GTTAGGCTTC GCGGGTTTCA AAGTGCTTTA CCCTATTAAT
AAGGCTGATA AAAAAGACGA AATCGTCAGT ATGCTCGGTG CCAGCTATTT TCGTGTGATT
GGTAAAGGTC AGGTTTATGG TCTTTCTGCT CGCGGGCTGG CTATTGATAC CGCATTACCT
TCTGGTGAGG AATTCCCGCG CTTTCGCGAG TTTTGGATTG AGCAGCCCAA ACCGAATGAC
AAGCATTTGG TGATCTATGC ATTGCTGGAC TCCCCCCGCG CCACCGGTGC TTATCGTTTT
ATCATCTCTC CAGGGCGTGA TACGACCATT GATGTTCAGG CTAAAGTCTT CCTGCGTGAC
AAAGTGGATA AACTGGGGAT TGCGCCGTTA ACCAGTATGT TTTTGTTCGG CCCCCATCAA
CCGTCTACGG TCGTGAATTA CCGTCCAGCA TTACATGATT CCAATGGCCT TTCTATTCAT
GCCGGTAATG GTGAATGGAT CTGGCGGCCA CTGAATAACC CTAAACATTT ATCGGTCAGT
GTCTATACCG TTGAGAATCC AAAAGGTTTC GGCTTATTAC AACGTGGGCG TGAATTCTCT
CAATATGAGG ATTTGGATGA TCGCTATGAC CTGCGCCCAA GCGCGTGGGT TGAACCTCGG
GGCGAGTGGG GGAAAGGCAA AGTGGAATTG GTGGAGATCC CAACGGCTGA TGAGACAAAC
GATAATATCG TGGCTTTCTG GACACCGGAT GTGTTGCCAA ATGCGAAGGA GTCACTCGAT
CTTAATTATC GCCTGCACTT TACCCGTGAT GAAGAGAAGT TACACTCCCC GGATGTGGCC
TATGTGAAAC AGACATTGCG CTCAACGGGG GAGGTGAAAC AGTCGAATTT AGTTCGTGAG
CCTGATGGCA GTATTGCTTT CTTGGTCGAT TTTGTTGGGC CAGCACTGAC ATCATTGGAT
GAAAACACGC CGTTGGCTTC TCAGGTTAGC GTGGATGACA ACGGTGAGTT GCTCGAAAAT
ACGGTGCGCT ATAACCCGGT GACCAAGGGG TGGCGTTTGA CACTGCGGCT GAAAGTTAAG
GATGCGAAGA AACCGATAGA GATGCGCGCC GCGCTGGCTA ACGGTGATAA AACCCTGACT
GAAACCTGGA GCTACCAGTT ACCTGCCAAT GAATAA
 
Protein sequence
MNCARWLGAT LLLLFSTSQA QAFSLDDVAK QAQALAGKSF EAPKSNLPAQ FRDMKFTDYQ 
QIQFNHDKSY WGKLDTPFKL EFYHQGMYFD TPVQINEVTA TRVKPISYSP DYFDFGSVKH
DSESTKTLGF AGFKVLYPIN KADKKDEIVS MLGASYFRVI GKGQVYGLSA RGLAIDTALP
SGEEFPRFRE FWIEQPKPND KHLVIYALLD SPRATGAYRF IISPGRDTTI DVQAKVFLRD
KVDKLGIAPL TSMFLFGPHQ PSTVVNYRPA LHDSNGLSIH AGNGEWIWRP LNNPKHLSVS
VYTVENPKGF GLLQRGREFS QYEDLDDRYD LRPSAWVEPR GEWGKGKVEL VEIPTADETN
DNIVAFWTPD VLPNAKESLD LNYRLHFTRD EEKLHSPDVA YVKQTLRSTG EVKQSNLVRE
PDGSIAFLVD FVGPALTSLD ENTPLASQVS VDDNGELLEN TVRYNPVTKG WRLTLRLKVK
DAKKPIEMRA ALANGDKTLT ETWSYQLPAN E