Gene Sterm_3655 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3655 
Symbol 
ID8599101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3880475 
End bp3882817 
Gene Length2343 bp 
Protein Length780 aa 
Translation table11 
GC content35% 
IMG OID 
ProductMutS2 family protein 
Protein accessionYP_003310420 
Protein GI269122243 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAATA GAAGTTATGA AGTTTTGGAA TTTCATAAAG TTATTAATAA AATAATAGAT 
TTGGCAAAGC TGGAAGCTAC CAAGGAAAAG TTTCTAGATT TGGACATTAT GAAAAATAAA
GGTGAACTGG ATAAAGAGCT TGCCCTTTTG GTAGAGTTTA TTGATTTTTA TAAATATGAC
GACGGACTGG AGCTGACTAA TCTTTCAGAT ATAGGAAAAT TTCTTAGAAC GATAGATTTG
ATAGGTTCCT ATTTGTCTGT AGAAGATCTG GCGGAATTAA GAAAAAATCT TGCTGTTTAC
AGAATTTCCA AAAGCCGTGC TAAGAATATA AAGGATAAGT ACGGTCTTGT ATGGAATATT
TTTTCTGATA CAGAAGATTT GAAAGATCTG GAAGACTTTA TTTCCGAAGT GGTGGATGAC
GAGGGAAATA TGAAGGACAC AGCTTCGCTG GGCTTAAGAG ATATCAGAAG ACAAAAAAGC
AATATAAATG TCAATATAAA AGAAAAGTTT GATGAGATAA TAAATAACAG AGATTTGCAG
AAGGCAATAC AGGAAAAGAT CGTTACTAAA AGGAACGAGC GTTATGTAAT ACCTGTGAAA
ACAGAGTTCA AATCTCTTGT AAAGGGAATA GAGCATGACA GATCATCAAC AGGAAGTACT
GTATACATAG AACCCCTGAA TACAGTTTCA TTAAATAATA AACTCAGAGA ATACGAGGCA
AAGGAAAGAG AGGAAATAAG AAAAGTCCTT ATCCGTATTA CAGAGCTTAT AAGAAATAAA
AAAGACGAAA TAGCATTAAT AAAGGATCTT CTCGAAAGAC TGGATTTTAT CAATGCCAAA
GTTCTTTATT CAATAGAAAA TGAGTGCAGA GTGCCTAAGG TAGTCAATAA AGAGTATCTG
AAGCTAGTAG TAGCAAGACA TCCTCTTATA GACAGGGAAA AAATGGTTCC TATTAACTTT
GAACTCGGGG ACAATGACAA TATTATGCTT ATAACCGGTC CTAATACAGG GGGGAAGACT
GTAACGCAAA AGATAGCAGG ACTTCTTACA ATAATGGCTT TATCAGGAAT TCCTATTCCG
GCAGATGAAA AAACGGAAAT AGGATTTTTT GGCAGTGTTC TTGCGGATAT AGGGGATGAA
CAGAGTATAG AACAGAATCT GTCGTCGTTT TCTGCACATA TAAAGAATAT AAAAGAAATA
CTTGAAGCAG CCAACAGAAG ATCGCTTGTT CTTATAGATG AAATCGGGAG CGGAACTGAT
CCTATGGAGG GAGCAGCTTT TGCAATGTCA GTAATTGATT ATCTGAATCA GAAGAATGTG
AAATCAATAA TCACTACACA CTACAGTGAA GTAAAGGCAC ATGCCTTTAA TACTGACGGG
ATAAAGAGTG CTTCCATGGA ATTTAACGTA GAAACGCTGC TTCCCACATA CAGACTTCTG
GAAGGTATTC CGGGAGAAAG CAATGCTTTG ATTATTGCCG GGAAATACGG TATAAACGAA
GAGATCATAA ATAATGCAAA ATCATATATA AGTGAAGAAA ATCAAAAAGT GGAGAAAATG
CTTATATCCA TAAAGGAAAA AACAGATGAA GTAGAAAAAC TGAAGATAGA GCTGGAAAAT
GCAAAAGAAG AAATGGAGAG CAGAAAGCAG AAGTATGAAG CTGACATAAT CACTCTGGAA
AATGAAAAAA ATCAAATAGT AAAAGAAGCA TATGATGAAG CTGACAAGTA TCTGCGTGAG
GTTCAGGCAA AGGCTAAGAA TCTTGTGGAT AAAATAAGCC AGGATGAGAT GAAAAAAGAA
GAGGCAAAGG ATGCACAGAG AAGTCTGAAT ATGCTTCGTG AATCTTTCAG GCTTGAAAAA
GAGCAGAATG TAAAGAAAAA AGTAAAAACA AACAAAAAGA CAGATTTTCA GCTTGGTGAG
GAAGTTTTTG TAAAGTCGAT AAATCAGAAC GGAAAAATCC TAAGAATAAT CGGAGAATCA
GACAGTGTTC AGATACAGGC AGGAATATTG AAACTTGTGG TAAGTACTGA TGACATACAG
AAGATAGAAA AGAAAAATAA AAAGAAATTA GGCGGATTTG CATCATTAAA ATCAACTAAT
GTAAAAGGAG AAGTGGATTT GAGAGGAATG ACCGGTGATG AAGCAATGAC TGAGCTAGAG
CTGTATCTGG ACAGGGCAAT GCTGACAGGG TATTCAGAAG TATACATAAT TCACGGAAAA
GGAACTATGG CATTAAGAAC CAGAATACAG GAGTATTTGA AAAAGTCTAA ATATATATCG
GAGTATAGAG ATGCAAATCA GAATGAGGGA GGGCTTGGCT GTACTGTCGC TAAGTTAAAA
TAA
 
Protein sequence
MENRSYEVLE FHKVINKIID LAKLEATKEK FLDLDIMKNK GELDKELALL VEFIDFYKYD 
DGLELTNLSD IGKFLRTIDL IGSYLSVEDL AELRKNLAVY RISKSRAKNI KDKYGLVWNI
FSDTEDLKDL EDFISEVVDD EGNMKDTASL GLRDIRRQKS NINVNIKEKF DEIINNRDLQ
KAIQEKIVTK RNERYVIPVK TEFKSLVKGI EHDRSSTGST VYIEPLNTVS LNNKLREYEA
KEREEIRKVL IRITELIRNK KDEIALIKDL LERLDFINAK VLYSIENECR VPKVVNKEYL
KLVVARHPLI DREKMVPINF ELGDNDNIML ITGPNTGGKT VTQKIAGLLT IMALSGIPIP
ADEKTEIGFF GSVLADIGDE QSIEQNLSSF SAHIKNIKEI LEAANRRSLV LIDEIGSGTD
PMEGAAFAMS VIDYLNQKNV KSIITTHYSE VKAHAFNTDG IKSASMEFNV ETLLPTYRLL
EGIPGESNAL IIAGKYGINE EIINNAKSYI SEENQKVEKM LISIKEKTDE VEKLKIELEN
AKEEMESRKQ KYEADIITLE NEKNQIVKEA YDEADKYLRE VQAKAKNLVD KISQDEMKKE
EAKDAQRSLN MLRESFRLEK EQNVKKKVKT NKKTDFQLGE EVFVKSINQN GKILRIIGES
DSVQIQAGIL KLVVSTDDIQ KIEKKNKKKL GGFASLKSTN VKGEVDLRGM TGDEAMTELE
LYLDRAMLTG YSEVYIIHGK GTMALRTRIQ EYLKKSKYIS EYRDANQNEG GLGCTVAKLK