Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_4001 |
Symbol | |
ID | 8599445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | - |
Start bp | 4232791 |
End bp | 4235670 |
Gene Length | 2880 bp |
Protein Length | 959 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | |
Product | hypothetical protein |
Protein accession | YP_003310764 |
Protein GI | 269122587 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000175491 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAAAA AGTTATTGTT AGCATTTTTT ATGATTTTAG GAGTAATTAT CCGGGGAAAT GAAATAAGCA GTATCAGGCT TGAAATAAAA AATCCCGGTG AAAAAGCTGT TGTTTTGGAT ACAATGTCTG TTAATGTGGA AATAAACGGC GATATAAGTG TTACCACATA TGACATGACG TTTTATAATC CAAACACCAG AATTCTGGAG GGAGAATTCT CTTTTCCTCT TCAGGAGGGA CAGAAGGTAA CAAGGTATGC TCTTGATGTG AACGGAAAGC TCAGAGAAGG TGTGGCTGCG GAAAAGGAAA AAGCAAGAAC AGCATATGAA AATACAATAA GACAGAAGAT AGATCCCGGA ATAATAGAAA AAACAGCTGG AAATAATTAC AAAACCAGAA TATATCCAAT ACCGTCAAAC GGCTATAAAA GAGTGGTAAT AGCCTATACG GAGGTATTGA AAAACAAAAA CGGAAGCCTT GATTATTTTC TTCCGCTAAA TTACAGTCAG AAAGTAAATA ATTTTTCTCT TGAGATAAAA ATGCTTAAGC AGGAAAGCAG ACCAAACTGG ATAGAAAAAA TAGACAGACT GGAATTTGAT AAAATGGAAA GCGGTTATTA TGCGAAAACT TCACTGAAAA ACTACACGCC TGATAAAAAT GTAAGGATCA GTCTGCCGCT TGATAAAAAG GATAAGGTTT ATACTGAGAA AGCTGATGAC ACTGCTTATT TTACAGCTAA GCTGAATCTG GAAAATAATT ATTATACAGA AAAACCAAAG GCGAAAAATA TTGTGCTTAT CTGGGATACT TCAAATTCAG GTGAAAAAAG AGATACTGAA AAAGAGCTTA CACTGCTGGG AAAATATTTT TCCTATCTGG GAAATGTAAA TATCAGCTTG TATTCAATTG ATAATGATTT TTTGAGCAGA GGAAGCTTTC AGATAAAGAA CGGCAGCTGG GATCAGCTGA AAAATACAAT AAATAATTTT GTATACGACG GAGGGACGCG GTTTAATAAG ATAAGCTTTA AAGATAATGC CGATGAGGTC ATTTTTGTTA CAGACGGTGT GAATACCATA GATTCCAGTG AATTTAAGCT GGCAAATATA CCGTTTATAG TTATAAATTC ATCTAAAGAA TCAGACAGCG GGTTTATAAA ATATATGGCT GATACCAGTA ATGGAAAAGT AATAGATCTG AACAGAGAAG ATATTGATAC GGAATTTGAT AAAATGAAAT ATAACTACTT GAATCTGATA TCATATGAAT ATAATAAATC TGAGATAGAT GAGGTATATC CGAAATCAGA ATCTGATATC AGAGGAAGCT TTGATTTTTC AGGTATACTT AAGGGCAGCA GGGCAGAAAT AACTGTAAAT CTCGGCTTTG GGAATATTAT TACAGAAACA AGAAAAATTC TTATTTCAGC AGATACTAAT TCGCATAATA TAAGTAAGAT ATGGGCAGAG AAAAAAATAG AGAATTTAAG CGGAAATTAT GAAAAAAATA AAAAGGAAAT ATTGAAAACT GCCAAGAAAT ATACTCTTGT TACAAACGAA ACTTCCCTTA TAGTACTGGA CAGAGTAGAG GATTATGTCA GCTATGAAAT AATCCCGCCT GCAGAGCTGC TGGATGAATA TAACAGACAG ATGGCATACA GAAGGAAAAA TGAACAGGAT GAAAAGAAAA ATGGTTTGAA GGAAAGTGTG GAAGTGCTTG AAAGAAGAAA AGAATGGTAT GTCAAACCTG TGCTAAAGTC TGAAAGAGCA AGAGAAAATG TTACGAAAGT CGAAAAAAAT ACAGCTGTAA ATGATTATAT ACCGCAGCCT TCAATACCTG CTGCCGAAAT GCAGAGTGAA CCGGCTTTAA ATAAATCGGC TGTGACTTCA AGCGGAAAGG CTGCTTATTC TGATATGAAA CAGGATGAAA CAGCTGTCGG CAGTAAAGAC ACAGGCGGAA AAAATCTGAG AGTAATTGTT TATGAGGATA AGGACAGAAA CAGTGAATAT ATTAATGAAT ATAAAAAAGT AAAACCGGAA AATATTTATG AAAAATATCT GGAAATGAAA ACAAAATATG GAAAAAATCC TTTCTTCTAC ATAGATACAG CTGATTATCT GATTAAGAAC AATCAGAGAA ATACGGCTCT GAAAGTATTG ACAAATATAC CGGAGCTGTC TTTGGAAAAC CATGAGTATT ACAGAATTCT CGGATATAAG CTTTTGGAAA CAGGAGATAA CGGACTTGCG GTAAAAATAT TTGAAAAAGT TCTGGATCTA AAGGGAGAGG ATCTCCAGTC GATACGGGAT CTGGCCATAG CCTATGAAAT AAACGGAGAA AAGCAGAAAG CGCTTGAGCT GATGAACAGC ATACTGGAGA AAAGAACTCC GAATGAACTG GAATTAAAAG GGATAGTAAT AAATGAAATG AATAACCTTA TAGAAAAAAA TAAAAACAAG CTGAATACAA ACATGATAGA TAAAAGACTG ATTTATCCGA TGCCTGTAGA TTACAGGGTG GTGCTGGACT GGTCAAAGGA TAACCAGGAA ATAGATTTGT GGCTGACACA GCCTGACGGA GTGAGAGTTT ATTACAGTAA TTCACAGACG CCTGATAATA ATGCAGTTAT TTATAACCAT ACTACATTCA GATACGGGCC TGAAGAGCTT CTTATAAAGA AAGCCAAAAA AGGTGAGTAT GAAATAAAGG TGGAGTATTA TGCAGATCAG TCGCAGACTT TGAGAGAGCC TGTAATAATC AGACTGGAAA TCATTACAAA CTACGGGTCA AAAAATGAAA AAAGACAGAT AATAACAAGA AGAGTAGAAA ATGTAAGAGA GTTTATTGAT ATAGGAAAAT TTATGTATGA CGGAAAGTAA
|
Protein sequence | MSKKLLLAFF MILGVIIRGN EISSIRLEIK NPGEKAVVLD TMSVNVEING DISVTTYDMT FYNPNTRILE GEFSFPLQEG QKVTRYALDV NGKLREGVAA EKEKARTAYE NTIRQKIDPG IIEKTAGNNY KTRIYPIPSN GYKRVVIAYT EVLKNKNGSL DYFLPLNYSQ KVNNFSLEIK MLKQESRPNW IEKIDRLEFD KMESGYYAKT SLKNYTPDKN VRISLPLDKK DKVYTEKADD TAYFTAKLNL ENNYYTEKPK AKNIVLIWDT SNSGEKRDTE KELTLLGKYF SYLGNVNISL YSIDNDFLSR GSFQIKNGSW DQLKNTINNF VYDGGTRFNK ISFKDNADEV IFVTDGVNTI DSSEFKLANI PFIVINSSKE SDSGFIKYMA DTSNGKVIDL NREDIDTEFD KMKYNYLNLI SYEYNKSEID EVYPKSESDI RGSFDFSGIL KGSRAEITVN LGFGNIITET RKILISADTN SHNISKIWAE KKIENLSGNY EKNKKEILKT AKKYTLVTNE TSLIVLDRVE DYVSYEIIPP AELLDEYNRQ MAYRRKNEQD EKKNGLKESV EVLERRKEWY VKPVLKSERA RENVTKVEKN TAVNDYIPQP SIPAAEMQSE PALNKSAVTS SGKAAYSDMK QDETAVGSKD TGGKNLRVIV YEDKDRNSEY INEYKKVKPE NIYEKYLEMK TKYGKNPFFY IDTADYLIKN NQRNTALKVL TNIPELSLEN HEYYRILGYK LLETGDNGLA VKIFEKVLDL KGEDLQSIRD LAIAYEINGE KQKALELMNS ILEKRTPNEL ELKGIVINEM NNLIEKNKNK LNTNMIDKRL IYPMPVDYRV VLDWSKDNQE IDLWLTQPDG VRVYYSNSQT PDNNAVIYNH TTFRYGPEEL LIKKAKKGEY EIKVEYYADQ SQTLREPVII RLEIITNYGS KNEKRQIITR RVENVREFID IGKFMYDGK
|
| |