Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | YpsIP31758_0514 |
Symbol | |
ID | 5384425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Yersinia pseudotuberculosis IP 31758 |
Kingdom | Bacteria |
Replicon accession | NC_009708 |
Strand | + |
Start bp | 603786 |
End bp | 607592 |
Gene Length | 3807 bp |
Protein Length | 1268 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640863485 |
Product | hypothetical protein |
Protein accession | YP_001399507 |
Protein GI | 153949116 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.140024 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGTACGA GGTTAAGGGA GCGCCTGAAC ACCCAGTACA CATTGAGTAT CACTGCCAAA ACCCAACGCG ATTTTATCCA GCAGGGCGCG GGCATCAATA TTGGCAAGGA TCTCCAGGTG AACACCGGTG GCGACTGGCT TCTCAGTACG GTGCAACGTA GCGACCAGAT CAGTGCGCAG TATGGCGGCG GCAGTGCGAC CAGTGGCTCT CTCCGCCATC TGGGCAGCGA GGTGAAGGTC GGTGGTGCGC TAAGCGCCAA CGTAGACAAT TTGACCGCCG TGGGGGCGCG GGTGAATGCC GGTACCATCG ATGTGCGGGC GCAGAATATC ACTCTCAGCG CGGCCACTGA CAGCCTGTCT GTTACCGGTG GATCCTCAAG CAAGCGCCAT ACCGCGGCGG TGAACCTCTA TGATGAAACG CTTCTCGGCA GCCAGTTGAA CGCCACGGGC GATATCAATC TGCAGACAGC AAACGATATC ACCCTCAGTG CCAGCGCGGT GCAAACGGAT GGCGCGTTGA AACTGGCGGC GGGTGGCGAT GTCACCCTCA TCTCCCAGAC TGAACAGCAT GACGAGCAGC GCAATCACAC CGGGACTAAA AAAGGGCTGG TCTCCAGCAC CACCGCCCGC AGCGAAGAGG GCCGGAGCCA GACGCTGGCG GTGGGTTCGA TGCTCTCGGC GGGTTCCATT GATGTCAGTA GCCAAAATAT CGCGGTCGCG GGCAGCAGCG TGGTGGCTGA TAAGGATATT CGCCTGCGTG CGCAGGAAAA CCTGACCGTG AGCACCGCGC AGCAGAGCGA GAGCGGGTCT CAGCTATTCG AGCAGAAAAA ATCCGGCCTG ATGAGCACCG GCGGTATCGG TGTCTTTATC GGGACTTCCC GGCAGAAAAC CACCGACCAG ACCCAAACGG TGAGCCATGT TGGCAGCACG GTGGGCAGCC TGACGGGCAA TGTGCGTCTC GAGGCGGGCA ATCAGTTAAC CCTTCACGGC AGCGAAGTGG TGGCGGGTAA AGACCTCGCC CTGACAGGGG CGGATGTCGC GATCAGCGCC GCAGAAAATA GCCGTTCTCA ACAGTATACT GCCGAGAGTA AACAGCGTGG CCTGACGGTG GCGCTGTCCG GACCGGTCGG CAGTGCCGTC AATACGGCGG TCACCACCGC CAAAGCGGCC CGAGAAGAAA ACACCGGCCG GCTGGCGGGA TTACAAGGGG TTAAAGCGGC GCTGTCGGGG GTGCAAGCGG TGCAGGCCGG GCAACTGGTG CAGGCCCAGG GAGGCGGTAT CACTGAGATG GTGGGCGTCA GTGTGTCGTT AGGCTCACAA AAATCGTCCT CGCAGCAACA GCAGGAACAG ACCCAGGTGA GCGGTTCGGC CCTGACGGCG GGCAATAACC TGAGCATCAA GGCCACCGGT GGCGGGAATG CGGCAAACAG CGGCGATATT CTGATCGCCG GCAGCCAGCT TAAGGCCGGG GGCGATACCC GGCTTGATGC GGCGCGTGAC GTGCAGTTAC TCGGCGCGGC CAATAGGCAA AAAACCGACG GTAGCAACAG CAGCCGTGGC GGTAGCATTG GCGTCAGTGT GGGGGGCAGC GGTCTGAGCG TCTTTGCCAA TGCCAACAAG GGGCAGGGCA ATGAGCGCGG TGACGGCACC TTCTGGACGG AAACCACCGT CGACAGTGGC GGGATGTTCT CGTTGCGCAG CGGTCGCGAT ACGGCACTGA CCGGCGCGCA GGTCAGCGCT GAAACGGTCA AGGCCGATGT GGGGCGTAAT CTTACTCTGC AAAGCCAGCA GGACCGCGAT AATTATGACG CGAAGCAGAG CCGTGCCAGC GGCGGTATCA GTGTCCCGGT GGCGGGGGGC GGTGCCGCGG TCAACCTGAG CATGAGCCGT GACAGGCTAT CCAGCCAGTA TGACTCGGTG CAGGCGCAGA CGGGTATTTT TGCCGGTTCT GGCGGTGTTG ATATCCGGGT GGGGGAGCAC ACCCAACTGG ATGGCGCGGT GATTGCCAGC ACGGCGGCAG CCGATAAAAA CACGCTGGAT ACCGGCACAC TGGGCTTCAG TGATATCAAA AATAAAGCGG TATTCACGGT GGAGCATCAG GGCGGCAGCC TGAGCACCGG TGGCCCGGTG GGGTCAGACC TGCTGAGTAA TCTGAGCGGC ATGGTGCTCG CGGGGCTGGG CAATGGCGGA TATGCTGAAG GCACCACGCA GGCGGCAGTG AGTGAGGGCA CGATTACCGT TCGCGACACG GAGAATCAAC AGCAGAATGT TGATGACCTG AGCCGGGACA CCGGGAATGC CAACGGCAGT ATCGGGCCGA TTTTTGATAA AGAGAAAGAG CAGAACCGGC TGAAAGAAGT GCAGCTGATT GGCGAGATAG GCGGTCAGGC GCTGGATATT GCCTCCACGC AGGGCAAGAT AATTGCCACT CACGCGGCAA ACGACAAGAT GAAGGCGGTG AAGCCGGAAG ATATCGCCGC GGCGGAAAAA CAGTGGGAGA AAGCCCATCC GGGCAAGGCG GCCACGGCAG AGGACATCAA CCAGCAGATT TACCAGACGG CATATAATCA GGCATTTAAC GAGTCAGGAT TTGGGACAGG CGGCCCGGTG CAACGCGGTA TGCAGGCGGC GACAGCCGCC GTGCAGGGGC TGGCCGGCGG GAATCTGGGT GCGGCCCTGA CGGGTGCCAG TGCGCCGTAT CTGGCGGGGG TGATTAAGCA AAGTACAGGC GATAATCCGG CGGCTAACAC AATGGCACAC GCGGTATTGG GCGCGGTGAC GGCCTATGCC AGCGGCAACC ATGCGCTGGC GGGTGCGGCT GGCGCGGCCA CGGCGGAGTT GATGGCCCCC ACGATTATCA GCGCGCTGGG CTGGGACAAG AACACACTCA CCGAAGGTCA GAAACAGGCT GTCAGCGCAC TGAGTACATT AGCCGCCGGG CTGGCCGGTG GTTTGACAGG TGACAGCACG GCGGATGCGC TAGCCGGGGG ACAGGCGGGG AAAAATGCGG TAGAGAATAA CTTGCTGGGT GGTAATGAAG AGACCCAGAC CAAGTTCGTG CAGGAGCATG GTAAGAACAT CATGTCCTGT AGCACTGACC CCGGCTCAGT GTCCTGCCAG AAAGGACTGG CGATGCAGGA TGCGTTAATG GTAGCGCTTC CGGTGGGTCT GGGTGGTGGC TTACTGGCTG CAGCCTCTCC AGAAATCGCA GCGGCAGCTA AAGCAGCGAT ACAGGCGTGT GCGGGTAATG TAGTACTGTG CCTGAATAAT GGCGGTATCC AGATGTCAGA AGCGATAGTG CCGGGTGGTG TGGGAGCCAG TGGTGCAATC GGTATAGGTA AGACAGCTGC AGAGGCGACG GTGGCGAAAG CTGAGGCTCT TGCTGTAAAT ACTGCTAAAC CTGGATGGCT GGCGAAAATA GAAGCTGGGA ATAGATTTAA TACCGAGCAG TCTAAAAAAT ACCCATATAA TGAGTTATAT GTAAATAGGC CTAATGGGAG TGGTTATTAT CGTGTTGATT CTTATAACCC GACAACAGGT GAGATAATCT CTCGAAAACT CACTCAACTG TCTGATATAT CAGAAAAGAC GGCAAATAAC TATATTCGGG AAGTGGTGAA TAAATATCCG GCGGGTGCTA CTATTGCTAA TGTCCCATCA AGTGGAAATC TGGGCGGGGA ATTGCTTAAA GGTAGTAATA TACTTGAAGT TCCACCTCAG ATTAAACCAG TACCTCAATC TGTTTTGGAT GCAGCAAAAA GTGCTGACGT TATCATTCGA GATACATATG GTAAGGTATA TAAATGA
|
Protein sequence | MGTRLRERLN TQYTLSITAK TQRDFIQQGA GINIGKDLQV NTGGDWLLST VQRSDQISAQ YGGGSATSGS LRHLGSEVKV GGALSANVDN LTAVGARVNA GTIDVRAQNI TLSAATDSLS VTGGSSSKRH TAAVNLYDET LLGSQLNATG DINLQTANDI TLSASAVQTD GALKLAAGGD VTLISQTEQH DEQRNHTGTK KGLVSSTTAR SEEGRSQTLA VGSMLSAGSI DVSSQNIAVA GSSVVADKDI RLRAQENLTV STAQQSESGS QLFEQKKSGL MSTGGIGVFI GTSRQKTTDQ TQTVSHVGST VGSLTGNVRL EAGNQLTLHG SEVVAGKDLA LTGADVAISA AENSRSQQYT AESKQRGLTV ALSGPVGSAV NTAVTTAKAA REENTGRLAG LQGVKAALSG VQAVQAGQLV QAQGGGITEM VGVSVSLGSQ KSSSQQQQEQ TQVSGSALTA GNNLSIKATG GGNAANSGDI LIAGSQLKAG GDTRLDAARD VQLLGAANRQ KTDGSNSSRG GSIGVSVGGS GLSVFANANK GQGNERGDGT FWTETTVDSG GMFSLRSGRD TALTGAQVSA ETVKADVGRN LTLQSQQDRD NYDAKQSRAS GGISVPVAGG GAAVNLSMSR DRLSSQYDSV QAQTGIFAGS GGVDIRVGEH TQLDGAVIAS TAAADKNTLD TGTLGFSDIK NKAVFTVEHQ GGSLSTGGPV GSDLLSNLSG MVLAGLGNGG YAEGTTQAAV SEGTITVRDT ENQQQNVDDL SRDTGNANGS IGPIFDKEKE QNRLKEVQLI GEIGGQALDI ASTQGKIIAT HAANDKMKAV KPEDIAAAEK QWEKAHPGKA ATAEDINQQI YQTAYNQAFN ESGFGTGGPV QRGMQAATAA VQGLAGGNLG AALTGASAPY LAGVIKQSTG DNPAANTMAH AVLGAVTAYA SGNHALAGAA GAATAELMAP TIISALGWDK NTLTEGQKQA VSALSTLAAG LAGGLTGDST ADALAGGQAG KNAVENNLLG GNEETQTKFV QEHGKNIMSC STDPGSVSCQ KGLAMQDALM VALPVGLGGG LLAAASPEIA AAAKAAIQAC AGNVVLCLNN GGIQMSEAIV PGGVGASGAI GIGKTAAEAT VAKAEALAVN TAKPGWLAKI EAGNRFNTEQ SKKYPYNELY VNRPNGSGYY RVDSYNPTTG EIISRKLTQL SDISEKTANN YIREVVNKYP AGATIANVPS SGNLGGELLK GSNILEVPPQ IKPVPQSVLD AAKSADVIIR DTYGKVYK
|
| |