Gene Sterm_1684 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_1684 
Symbol 
ID8597153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp1798186 
End bp1799580 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content38% 
IMG OID 
Productdihydropyrimidinase 
Protein accessionYP_003308473 
Protein GI269120296 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAATAA AAAACGGGTT GATTGCAACA GCAGGAGATT TATATCAGGG GGATATTTAT 
ATTGAGGACG GAATTATCAA AGAGATAGGA AAAGATCTGA ATATAGAGGA TTCTGAAATT
ATAGATGCGG ACGGGAAATA TGTTATCCCC GGAGGAATTG ACGTACATAC ACACTTTCAT
CTTGATGTGG GAATAGCTGT TTCCAGCGAT GACTTCAGAA CAGGAACAAT TGCGGCAGCG
TGCGGAGGAA CTACATCTAT TGTAGATCAT ATAGGACAGG GACCGAGAGG AACTACACTT
CATGATCCGA TTAATCATTA TCATAAACTG GCTGACGGAA AAGCAGTAAT AGATTACGGC
TTTCACGGGG TAATTCCATA TGAAGTGGAT GATGACAGGC TGAAAGAAAT GGATCAGCTT
TTGGAAGACG GTATAGAAAG CTTTAAGATA TATATGACTT ATGGTCAGAT GGTACATGAT
GAGGATTCCA TAAAGGTCTT GAAGAAGGCT AAGGAAAAAG GCGGGATTAT TGCAGTTCAT
CCCGAAAATA ACGATACTGT AAATTATTTG AAAAAATATT ACAGTGAAAA CGGAATGACT
GCACCGATAT ATCATGCCAA AAGCAGACCG GAAGAATGTG AAGGAGAGGC AATAAACAGA
ATTCTGAATA TAGCTCACCT TGTGGGTGAT GCCCCTATCT ATATTGTCCA TCTTTCAGCT
AAGCTTGGTC TTGATTATAT AAAAATGGCA AGAGACAGGG GACAGGAAAA TATATATGCC
GAAACATGCC CGCAGTATCT TGTTCTGGAT GAAGAAAAAT ATAATCTTCC GGGAACTGAA
GGGCTAAAAT ATGTAATCAG TCCGCCGCTA AGAAATAAAG CAAATCAGGA ACCGCTGTGG
AGAGCTGTAA GAGAAGGAGA TATACAGGTA ATAGCCACAG ATCACTGCCC GTTTCTTTTT
GAAAAAGAAA AGGAAGCAAT GGGAAAGGAT GATTTTACAA AGTGTCCTAA CGGGGCTCCG
GGTGTGGAAA CAAGAATGCC TGTGATTTTT TCTGAGGGAG TAATGAAGGG CAGAATATCT
ATAAATAAAT TTGTAGAGGT AACAAGTACC AATCCTGCGA AAATATACGG GATGTATCCC
CAAAAAGGGA CAATTGCCGT GGGAAGTGAT GCAGATATAG TAATTATTGA TAAGGATAAG
GAAGTTACCA TAACGAAATC AATGCTTCAT GAAAATGTAG ATTACACTCC TTATGAAGGA
ATGAAAGTAA AAGGCTATCC TGTAATGACC ATAGTCAGAG GAAAGGTAAT AGTTAAGGAT
AATGAATTCA TAGGTGAAGA AGGATACGGA AAATTTATCA AAAGATATAA AAACGATGAA
TTAGTAAGAT TTTAG
 
Protein sequence
MLIKNGLIAT AGDLYQGDIY IEDGIIKEIG KDLNIEDSEI IDADGKYVIP GGIDVHTHFH 
LDVGIAVSSD DFRTGTIAAA CGGTTSIVDH IGQGPRGTTL HDPINHYHKL ADGKAVIDYG
FHGVIPYEVD DDRLKEMDQL LEDGIESFKI YMTYGQMVHD EDSIKVLKKA KEKGGIIAVH
PENNDTVNYL KKYYSENGMT APIYHAKSRP EECEGEAINR ILNIAHLVGD APIYIVHLSA
KLGLDYIKMA RDRGQENIYA ETCPQYLVLD EEKYNLPGTE GLKYVISPPL RNKANQEPLW
RAVREGDIQV IATDHCPFLF EKEKEAMGKD DFTKCPNGAP GVETRMPVIF SEGVMKGRIS
INKFVEVTST NPAKIYGMYP QKGTIAVGSD ADIVIIDKDK EVTITKSMLH ENVDYTPYEG
MKVKGYPVMT IVRGKVIVKD NEFIGEEGYG KFIKRYKNDE LVRF