Gene YpsIP31758_B0097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpsIP31758_B0097 
Symbol 
ID5384183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pseudotuberculosis IP 31758 
KingdomBacteria 
Replicon accessionNC_009705 
Strand
Start bp107749 
End bp110271 
Gene Length2523 bp 
Protein Length840 aa 
Translation table11 
GC content45% 
IMG OID640857206 
Producttype IV secretion system protein DotA-like protein 
Protein accessionYP_001393395 
Protein GI153930723 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value0.551932 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTATGC CATTTTCACC GTGCGAGATT AGTGATGCTG CGGGGCAATG TTCTGATGTT 
TCGGTAAATC ATTTAAAAGC TGTTTTTGGT GGTGTTATTG ATAGCTTGGT TCTTGGAACT
GACGCTAATA GTGTTGCTGC GAGTTCAAAT ATCCTCGCAA TGATGTTTAG CTTCTTTAAT
AGCGGGTTGT TGATTGTTGC CTCTATGCTT GTTTCTTATG TGGCCATTGT CGGTGTAACT
AATACTGCAA ACGATGGCGA AGCAATGGGC AAACAATGGT CTACAGTCTG GACAACTCTG
CGCTTAGTTT CTGGAGCCGC AGTTTTGTTA CCGACCGCAA GCGGATATTC TATTATCCAG
ATCATTGTAC TTATGATTAC ATTATGGGGG GTAGGATTTG CCAATGGCAT ATATAAAGCT
GGTGTTTCTA TTGGGATATT ATCCCCCAAT GCCATCGTTG GCGGGATAAA TAAGTCGGGA
GAGTATTACG GCTTACGTGA CTTCGCTAAC AATTATATCT CTGTGGCATA TTGTACCCAT
GTTGCAAATA GTATTTTTAA AGATGACACA ACAACCCCCA ATGTCGGGCA GGAACAAATA
AACAACATTA ATGGTGATAT CTCATTTGTT TATCGTGACA GTAACGCCGC GACAAATTTG
GGCGGGGGTT TACCCTTCTG TGGTTCCGTC ACTTTGACTC ACTATAAAGC CGTCAGTAAG
ACTTCTGCTT CGGATCAGAT CTTAGAGACC TTACATGAAA CTTTGCAGTC GGAAAAACAG
CAAGTCGCCC AACAGGTAAT GAACAGCATT AATGCTTGGG TTTCTACATG GCCAGTCACG
CTAAATGATG AGGGGTGGAG TCAGGTAGAT GCTAATAAAT TAAATGAAAT CGTTGTTCAA
GCTGAAAATC AAATTGCTAC TGATGCCAGT GGTCAAGGTG AAGCCGGGGG TAATCAGCTA
AGCAGAACAA TGGATATGTA TGTTGAGACG ATTATCAAGA ATGGTTGGGC TGAGTCTGCC
GCATGGTTCC AAAAGGTGGG AGCAATCAGG GGAGCGATTG CCGGTGTGAT GTCTGCTCAA
GTAGGCAAAG TTTCATCCCC CAGTTTTGTG GCCATGCCTG TCGATGCACG AACAACACAA
CTCGTTTCAT CGGTTAATAC TGTCGCTGAG CGAATTATAA AAAGTGCCGA GTCTAAAGAT
GCCTACAGTA ACAAAGTTGC CGTCCCTCAA GACATTGCTA ATGCCATTCC CAAAAGTGTT
GATTCTATCG ATATAGGATC GTTACAAAAG GATATGGATG ACAAGATGTC TTCTTTTACT
AATGGTGTGA TGCGCACGGT TATTGATACA GCGATTGATG CGGGGGGTAA CAGTAAATCC
TTAGCCTGTG GCACTGCTGG CCAGATGGGA GGATCAGTCA ATAGAATGAA ATGCATCGGT
GATTATCTGT CGGTTATGTC AGGCAGTGTT CAGACTTCAA TTTACATATT AGAAACTGCG
ACAACATCTG TCAGGATTTT ATCTGGTGCT GCCTCATCAG TAAAAATCAT GGGGATCGGG
GTTGATACGG ATAAAGTCAC AACGCCTCTA TGGGATTGGG TTATTGCTGT TCCCGTTGCG
CAGCTTACCT CAATATTGCG TTTTCTGACG CCGATGGCGA TGTATTTCTC AGTGTTACTG
CCTGCCATGC CAAATACCAT TTTTATGATT GTTTATGCAG GCTGGACTTT AGCTGTGTTG
CAAACCGTTA TCGCAGTCTG TTTATGGGCG ATGATGCATA TGACCCCTGA GCGGTCATTC
ATTGGCTCAC AAACTCAAGG TTACTTGCTG CTGTTGGCTC TTTTTGTTCG ACCAGCGCTT
GCGGTTATCG GTTTATTTGC GGCTATTCTT GTCATAGATC CAATGATCGA TTTTACTGCC
CAAGCATTTC TTGATACTCA AGCGGCTATT GGTACATCTA CGGGTGTTAT AGGCTCAATT
AGCCAATTCT TTAGTTTCGG GTGGTGGATG ATGTTGTTCA GTGTAGTGCT GTTGGCCATC
ATGTATATGT GTTTTTCGTT GCCACAAATC TTGCCTGATC ACGTTTTGAA ATGGATTAAC
ATTGGTATCA GTGATCTGGG GGAAACCAGC GCCAGTTCGA ATATGCGTGG AGGATTTGAG
CGTATTGGGC GTGATGATAT GTTGCCACCA AATAGTCCAC TGGCGCGCAT TGGTTCCCGT
GGCGGGTTAG AGAAGAGTGG GGGGGGAGGA ACTGACAGCA GGTTAGAAAA ACCGATCAGA
AGAGATGAAC ATATTGCTTC ATCTGAAGTT ATCTCACCGC ACGGTGCTAA TTCACCTGAT
ACAGGGGGTG GTGGTAAAGG AGGTGGTTCA AGTCATGCTG GGCGTGGTAG CTCATCATCA
TCTTCATCTG GCAATCCAGT TGATACTAGG AGCACTGACT CTAGAGGAAG TGCGAGCGGG
ATTAGTAAGG ATGAGACGTT TTACCAAAGT GAAACACCCG ATGCAGGATC AGGCGAATAT
TAA
 
Protein sequence
MGMPFSPCEI SDAAGQCSDV SVNHLKAVFG GVIDSLVLGT DANSVAASSN ILAMMFSFFN 
SGLLIVASML VSYVAIVGVT NTANDGEAMG KQWSTVWTTL RLVSGAAVLL PTASGYSIIQ
IIVLMITLWG VGFANGIYKA GVSIGILSPN AIVGGINKSG EYYGLRDFAN NYISVAYCTH
VANSIFKDDT TTPNVGQEQI NNINGDISFV YRDSNAATNL GGGLPFCGSV TLTHYKAVSK
TSASDQILET LHETLQSEKQ QVAQQVMNSI NAWVSTWPVT LNDEGWSQVD ANKLNEIVVQ
AENQIATDAS GQGEAGGNQL SRTMDMYVET IIKNGWAESA AWFQKVGAIR GAIAGVMSAQ
VGKVSSPSFV AMPVDARTTQ LVSSVNTVAE RIIKSAESKD AYSNKVAVPQ DIANAIPKSV
DSIDIGSLQK DMDDKMSSFT NGVMRTVIDT AIDAGGNSKS LACGTAGQMG GSVNRMKCIG
DYLSVMSGSV QTSIYILETA TTSVRILSGA ASSVKIMGIG VDTDKVTTPL WDWVIAVPVA
QLTSILRFLT PMAMYFSVLL PAMPNTIFMI VYAGWTLAVL QTVIAVCLWA MMHMTPERSF
IGSQTQGYLL LLALFVRPAL AVIGLFAAIL VIDPMIDFTA QAFLDTQAAI GTSTGVIGSI
SQFFSFGWWM MLFSVVLLAI MYMCFSLPQI LPDHVLKWIN IGISDLGETS ASSNMRGGFE
RIGRDDMLPP NSPLARIGSR GGLEKSGGGG TDSRLEKPIR RDEHIASSEV ISPHGANSPD
TGGGGKGGGS SHAGRGSSSS SSSGNPVDTR STDSRGSASG ISKDETFYQS ETPDAGSGEY