Gene YpsIP31758_B0116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_B0116 
Symbol 
ID5384151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009705 
Strand
Start bp128505 
End bp129893 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content42% 
IMG OID640857225 
Productputative type IV secretion system protein IcmE/DotG 
Protein accessionYP_001393414 
Protein GI153930641 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones57 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATG AAAACGATAT TGGTGAAACT ATTCCCCCAG AAAATGAAGA TCATTTTGCT 
CCCGATTATG AAGAAGAATA TCAACCTTCA GCCGCAGAAA TTCAGAGAAC AACCAAAGAG
CGGCGTAGAG CAGAAAAAAT ACGAAATCTC AAATCTGTCT TTGGTAGCGG AGTAGGGCGC
ATATCCATCA TATGTGTCGT ATTTGTTGTG GTTATCTTTC TCGCGTTAGG CATCCGTAAT
ATGCGTTCTC CACAACCTCT TGGGACATCT ACAACGGTAG ATGTCCCCGC TGCGCCCCAA
ATCAAAACAG ACAATAGCCC AGTAACAGAG AGCGAAGCTG CACGTCGCAG CCAGATGAAT
GCAAATCAGG CGAGAGAAGC GGCCAAGCTA GGTGACTCAT TTCAGCCCTC TTTTGATACC
AATATCATTG CGGATCAAAA ATCCAAACCC AACTACAACC AAAACAGCAC CGTAATTCCT
GATGCTATTA ACTTCAATAA CCGAAACCAA GCATCTTCAG ATACGCAAAA TCCCAACACG
GGATCACAAA GAAGTGATTC AAATAGTAAT CAAGGAAATA GTCAAAAAAA CCAAGATGGC
CAACAACTAC TGAAAGAATA CACCACTGAG GTAAACAAGC GCGATAAACA TGTAGAAGAT
ATGAAGAGTG AAATTATTAA GCAGTTTTCT CAGGTTCTCG ATAAAGATAC TTTAAATAAT
CAAGGTTCAT ACAGCACGGT CATTTTTAAT GACACTAACA AAAGCAATAA CGACAGAAAA
CCAGAAGAGT CGGTTAAAAC AGTGGCCAGC AATTCATCTG AAAAAAATGC GGCAAAGCCC
CCCCTATTTA AGGCTGGTAG TACCTTATAT GCCGAGACAG GCTCTGCTGC GAATACCGAC
AATGGTGTTG ATACTTTCGC TACCGTTCGT GGGGGAAAAT GGAATGGCAG TGTACTGATC
GGTAAAGTCG TTCAAACAAA TAATAATATC CTTTTTCAAT ACACTCTGCT GGCCCCTCAA
GACAATAGAC CATCGGTGAA AATCAACGCG ATTGCACTGA GAGAGGAAGA CGCCAGCCAA
GGGATGGCTG ATGATGTAGA TCACCACATA TTGATGCGCT ATGGTTCATT AGGTGCGGCT
TCATTGCTAT CTGGTTACGG TAAATCCTAT GAGAACATTG GAACAACAAC CAATAATGGA
AGCACGACAA CTCAAACAAC GAATACACCC AGTAATAAGC AAATAATCGG TCAAGCCGTT
GGTGAACTTG GTTCGAATTT TGCAAATGAG ATAAAACGTG GATTTGATAC ACCGACCACT
TATAGCACTA AAGCAAATAC TGGCTTTGCT TTATTATTTA TGTCTGATGT ACCTGATCCT
GATAAATAA
 
Protein sequence
MADENDIGET IPPENEDHFA PDYEEEYQPS AAEIQRTTKE RRRAEKIRNL KSVFGSGVGR 
ISIICVVFVV VIFLALGIRN MRSPQPLGTS TTVDVPAAPQ IKTDNSPVTE SEAARRSQMN
ANQAREAAKL GDSFQPSFDT NIIADQKSKP NYNQNSTVIP DAINFNNRNQ ASSDTQNPNT
GSQRSDSNSN QGNSQKNQDG QQLLKEYTTE VNKRDKHVED MKSEIIKQFS QVLDKDTLNN
QGSYSTVIFN DTNKSNNDRK PEESVKTVAS NSSEKNAAKP PLFKAGSTLY AETGSAANTD
NGVDTFATVR GGKWNGSVLI GKVVQTNNNI LFQYTLLAPQ DNRPSVKINA IALREEDASQ
GMADDVDHHI LMRYGSLGAA SLLSGYGKSY ENIGTTTNNG STTTQTTNTP SNKQIIGQAV
GELGSNFANE IKRGFDTPTT YSTKANTGFA LLFMSDVPDP DK