Gene Sterm_0102 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_0102 
Symbol 
ID8595598 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp108462 
End bp110120 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content43% 
IMG OID 
Productdihydroxy-acid dehydratase 
Protein accessionYP_003306918 
Protein GI269118741 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0234635 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGTG ATAATTTGAA AAAGGGAGAC AGAAGAGCAC CGCACAGATC CCTGCTTAAG 
GGATTAGGTT TTGTAAATGA GGAAATGGAC AAGCCTATTA TAGGAATTGC CAATTCATTT
AATGAGATAA TACCGGGACA TGTCCATCTG CAGACTCTTG TACAGTCTGT AAAAGACGGT
ATAAGAATGG CAGGAGGAGT TCCTATGGAA TTTAATACGA TAGGAATCTG CGACGGGCTG
GCAATGAATC ACATAGGAAT GAAATATTCG CTGGTAACAA GACAGATAGT GGCTGATTCG
ATAGAAGCTA CTGCGATGGC AACACCATTT GATGCTATAG TTTTTATACC AAACTGTGAT
AAGGTAGTTC CCGGAATGCT TATGGCAGCA GCAAGACTGA ATATACCAAG TATATTTATA
AGCGGAGGAG CAATGCTCGC AGGTGTCTAT AAAGGGAAAA AAATAGGATT AAGCAATGTT
TTTGAATATG TTGGGCAGTT TGAATCAGGG AAAATGACTG CAAAAGAACT GAATATGGTA
GAAGATATGG CGTGTCCTAC ATGCGGGTCA TGTTCGGGAA TGTACACTGC AAATACAATG
AACTGTCTGA CTGAAGCTCT GGGAATGGGA CTGCCCGGGA ACGGAACTGT GCCTGCGGTA
TTTTCGGAAA GACTCAGACT TGCTAAAAAA GCAGGAATGC AGATACTGGA AATACTAAAA
GCTGATCTGA AACCAAAAGA TATAATGACA AAGGAAGCAT TTGTAAATGC AGTGGCAGTG
GATATGGCAC TCGGAGGATC TACAAATACA GCACTTCATC TGCCGGCAGT AGCACATGAT
GCAGGAGTAA AACTTACTAT AGATGATTTT AACGAAATTG CGGCGAGAGT ACCTCAGCTG
TGTAAGCTGT CACCTTCAGG AGAGTATTTC ATAGAGGATT TATACAGAGC AGGCGGAGTT
ACTGCGGTAA TGAGAAGACT GCTTGAAAAC GGAGAGCTGG ATGGAACTCA GAAAACAGTT
GCACTGAAAA CACAGGAAGA GCTGTGTAAG GAAGCATATA TAAATGACGA GGATGTAATA
AAGCCGTGGG ATAAGCCGGC GTATGCAGGC GGAGGACTGG CAGTGCTGAA AGGAAATCTT
GCCGAGCTGG GATCAGTGGT AAAAGCCGGG GCAGTGGCAG ATGAAATGCA GGTACATTCA
GGACCGGCAA AGGTGTATAA TTCTGAAGAG GAGGCCGTGG ACGGAATTCT CGGCGGAAAA
GTAAAAAGCG GAGATGTAGT GGTAATAAGA TATGAAGGAC CTAAAGGCGG ACCGGGAATG
AGAGAAATGC TTACTCCGAC ATCTGTAATA GCAGGTATGG GACTGGATAA AGAAGTAGCA
CTTCTTACTG ACGGAAGATT TTCAGGGGCG ACAAGAGGAG CTTCAATAGG GCATGTGTGT
CCTGAGGCAG CAGTAGGAGG AACTATAGCA GTAGTAAGAG ACGGGGATAT TATAGAAATA
GATATACCAA ACAGAACTCT GAATGTAAAA CTAAGCGACG AGGAAATTGC AGCCAGAAAA
GCCGAGCTGA AACCATATGA GCCTGAAGTA ACAGGATATC TGAAAAAATA TGCACTGCAT
GTAGGATCGG CAGTTAACGG AGCAATAGAA GAATATTAA
 
Protein sequence
MRSDNLKKGD RRAPHRSLLK GLGFVNEEMD KPIIGIANSF NEIIPGHVHL QTLVQSVKDG 
IRMAGGVPME FNTIGICDGL AMNHIGMKYS LVTRQIVADS IEATAMATPF DAIVFIPNCD
KVVPGMLMAA ARLNIPSIFI SGGAMLAGVY KGKKIGLSNV FEYVGQFESG KMTAKELNMV
EDMACPTCGS CSGMYTANTM NCLTEALGMG LPGNGTVPAV FSERLRLAKK AGMQILEILK
ADLKPKDIMT KEAFVNAVAV DMALGGSTNT ALHLPAVAHD AGVKLTIDDF NEIAARVPQL
CKLSPSGEYF IEDLYRAGGV TAVMRRLLEN GELDGTQKTV ALKTQEELCK EAYINDEDVI
KPWDKPAYAG GGLAVLKGNL AELGSVVKAG AVADEMQVHS GPAKVYNSEE EAVDGILGGK
VKSGDVVVIR YEGPKGGPGM REMLTPTSVI AGMGLDKEVA LLTDGRFSGA TRGASIGHVC
PEAAVGGTIA VVRDGDIIEI DIPNRTLNVK LSDEEIAARK AELKPYEPEV TGYLKKYALH
VGSAVNGAIE EY