Gene Sterm_4016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_4016 
Symbol 
ID8599460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4254068 
End bp4255573 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content40% 
IMG OID 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_003310779 
Protein GI269122602 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.44182 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAT TGAAATTTTT TATAAACGGT GAGTGGAGAG AATCAAAAAC TGATAAGTAT 
TATGATATAT ACAATCCGAG TACTGGTGAA GTAATAGCAC AAACTCCGTG CTGTACAAAG
GAAGAAGTAG AGGAAGCCGT ACAGGCAGCC AAGGACGCAT TTGAATCATG GGCGGCAATT
CCTGTAATGA AAAGAGTTCA GATTTTATAT AAATTTAGAG ATCTGCTGGA TCAAAGAATG
GACGAGCTTA CAGAAATACT GTGTAAAGAA CATGGTAAAA ACTGGGCGGA AGGACAGGGA
GATTTATTAA AGGTAAAAGA GCCCGTAGAG CTTGCATGCA GTGCTCCGTT ATTAATGATG
GGAGAATCTC TGATGAATAC TTCTACAGGT TATGATACTG TACTTTACAG AGAACCGCTG
GGGGTATTTG CAGGAATAGC TCCGTTTAAT TTCCCGGGGA TGATACCAAT GGGATGGATG
GTTCCGTTAT GTATCGCTAC AGGAAATACA ATGGTGCTGA AAGCATCAAG TACTACTCCT
ATGACAAGCT ACAAGCTTGC AGAGCTTTTT GTGGAAGCAG GACTGCCAAA AGGAGTATTA
AATATAGTGA CAAGTTCAAG AAACGAAGCA GAAATTCTGT TATCGCACCC TGATGTAAAA
GGAATCTCAT TCGTAGGATC GACATCTGTG GGACTTCATG TATATTCTAC AGGAGCTGCA
CACGGAAAAA GAGTACAGGC ATTATGCGAG GCAAAAAACC ATGCATTGGT ACTGGAAGAC
TGTGTACTTG AAAGATCTGT AAGAGGAATT ATAAACTCGG CATTCGGATG TGCAGGGGAA
AGATGTATGG CTCTTCCTAC AATATGTGTT CAGGAAAGCA TAGCTGATAA GTTTGTGGCA
AAATTAACAG AAGTGGCAAA AGAACTGAAA ATAGGACCTG CATATGATAA AACAACTGAT
CTCGGACCTG TAGTTACAGC AGATCACAGA AAATATGTGG AAGGATGGAT ACAAAAAGGT
ATAGACGAAG GAGCAAAGCT TGTTCTTGAC GGAAGAGGAG TAAGCGTACC CGGATATGAA
AACGGATTCT ATATGGGGCC TACAATTTTT GATTATGTAA CAGAAGAGAT GGAAGTAGGA
CAAAAAGAGA TATTCGGTCC GGTGTTATGT ATAAAAAGAG TAAAAGACTT CGAAGAAGGA
ATAACAATAA TGAATGCAAA TGAATTTGCA AACGGCTCTG TTATTTACAC TAGCAGCGGA
TATTACAGCC GTGAATTTGC AAGACGTACT GACGGAGGAA TGGTAGGAAT AAACGTAGGA
GTGCCGGTGC CGGTAGGATT GTTTCCGTTT AACGGACACA AGCGTTCTTT CTTTGGAGAC
CTTCATACAC TTGGAAAAGA CGGGGTAAAA TTCTTTACTG ATGCCAAAGT AGTAACAAGT
ACATGGTTTA CAGAAGAAGA TAATAAAGCT AAGGTAGACA CATGGGACGG TTCAGTAGTA
AAATAG
 
Protein sequence
MQKLKFFING EWRESKTDKY YDIYNPSTGE VIAQTPCCTK EEVEEAVQAA KDAFESWAAI 
PVMKRVQILY KFRDLLDQRM DELTEILCKE HGKNWAEGQG DLLKVKEPVE LACSAPLLMM
GESLMNTSTG YDTVLYREPL GVFAGIAPFN FPGMIPMGWM VPLCIATGNT MVLKASSTTP
MTSYKLAELF VEAGLPKGVL NIVTSSRNEA EILLSHPDVK GISFVGSTSV GLHVYSTGAA
HGKRVQALCE AKNHALVLED CVLERSVRGI INSAFGCAGE RCMALPTICV QESIADKFVA
KLTEVAKELK IGPAYDKTTD LGPVVTADHR KYVEGWIQKG IDEGAKLVLD GRGVSVPGYE
NGFYMGPTIF DYVTEEMEVG QKEIFGPVLC IKRVKDFEEG ITIMNANEFA NGSVIYTSSG
YYSREFARRT DGGMVGINVG VPVPVGLFPF NGHKRSFFGD LHTLGKDGVK FFTDAKVVTS
TWFTEEDNKA KVDTWDGSVV K