Gene Sterm_3801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3801 
Symbol 
ID8599247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4038390 
End bp4039727 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content37% 
IMG OID 
ProductRNA modification enzyme, MiaB family 
Protein accessionYP_003310566 
Protein GI269122389 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.920944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAGGAT GGTTTAAAAA TTTGGAAAAG AAAGCAACAA TAATTACTTA CGGCTGTCAG 
ATGAATGTGA ATGAAAGCGC TAAGATGAAA AAAATGCTGC AGTCCATCGG GTATAAGATT
GTTGATGATA TAAAAATTTC TGATCTTGTT CTTCTGAATA CATGTACTGT ACGAGAAGGA
GCAGCGGTAA AAGTCTACGG AAAATTAGGA GAACTGAAGA AATTAAAAGA AAAAAGAAAC
AACATGATAA TAGGTGTAAC TGGGTGTCTT GCCCAGGAGG TCAGAGAAGA ATTTATTAAA
AGAACTCCTT TTGTAGATCT GGTAATAGGA AATCAGAATA TTGCCAAGCT TCCTGACATC
ATAGAAAAAA TTCAAAAAGG AACAGTAGAT CATATAGTAA TGGTAGAAGA TGAAGATGAG
CTTCCAAAAA GGGTAGATGC TGATTTCGGA GATGATATAG TAGCATCTGT TTCAATAACT
TACGGCTGTA ATAATTACTG CACATTCTGT ATAGTGCCTT ACGTACGGGG AATGGAGAGA
TCGGTTCCAA TGAGGGAAAT ACTTGATGAT GTAAAGCAGT ATGCAGATAA AGGTTACAAA
GAAATATTAT TTTTAGGACA AAATGTTAAT TCTTACGGAA GTGACAGAAT CGAAATGGGA
GAAGATTTTG CCGGGCTTCT TACAAAGGCT GCCAATATAG AAGGAGACTT CTGGCTGAAA
TATATTTCGC CGCATCCGAA AGATTTTACT GATTCGGTAA TAAAAGCAAT AGCAGAAAAT
CCCAAGGTAG CAAGAATGCT TCATCTGCCT CTGCAGTCAG GCTCTACTAA GATACTCGGG
GCAATGAACC GAGGATATAC AAAGGAAGAA TTTATAGAAC TTGCTCTTAA AATAAAAAAA
GAGATTCCTG ATATAGGTAT AACAACAGAT ATTATCGTAG GATTTCCGGG AGAGACTGAC
GAGGATTTTC AGGATACTCT GGATGTAGTG GAGCAGGTAG GTTTTGAAAA CGCATTCATG
TTTATGTATT CCAAAAGAAG CGGAACTCCT GCAGCAGTGC TGGAAGAACA GGTGCCTGAA
CAGGTAAAGA AAGAAAGACT TCAGCAGCTG ATGAGACTTC AGAATGCAAG AGCAAAAGAA
GAGAGCAAAA AATATTATGG TCAGACTTTG AAGGTTCTTG TAGAGGGACC GAGCAGCAAA
AATCCTGATA TGCTTACAGG AAGAACCTCT ACTCATAAAA TAGTGCTTTT TAAAGGTGAT
GAAGAGCTTT CGGGGAAATT TGTAAATGTA AAAATATATG AAACAAAAAC ATGGACATTA
TATGGTGAAT TAGTCTAG
 
Protein sequence
MKGWFKNLEK KATIITYGCQ MNVNESAKMK KMLQSIGYKI VDDIKISDLV LLNTCTVREG 
AAVKVYGKLG ELKKLKEKRN NMIIGVTGCL AQEVREEFIK RTPFVDLVIG NQNIAKLPDI
IEKIQKGTVD HIVMVEDEDE LPKRVDADFG DDIVASVSIT YGCNNYCTFC IVPYVRGMER
SVPMREILDD VKQYADKGYK EILFLGQNVN SYGSDRIEMG EDFAGLLTKA ANIEGDFWLK
YISPHPKDFT DSVIKAIAEN PKVARMLHLP LQSGSTKILG AMNRGYTKEE FIELALKIKK
EIPDIGITTD IIVGFPGETD EDFQDTLDVV EQVGFENAFM FMYSKRSGTP AAVLEEQVPE
QVKKERLQQL MRLQNARAKE ESKKYYGQTL KVLVEGPSSK NPDMLTGRTS THKIVLFKGD
EELSGKFVNV KIYETKTWTL YGELV