Gene Sterm_3765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3765 
Symbol 
ID8599211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4001664 
End bp4003643 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content41% 
IMG OID 
Producttransketolase 
Protein accessionYP_003310530 
Protein GI269122353 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones40 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAA ATGTAAAACA ATTGTCAGTA GATGCAATAA GAATGCTGGG AGTGGATGCG 
ATAGAGAAGT CAAAGTCAGG ACATCCCGGA ATAGTATTAG GTGCGGCACC TATGGCGTAC
ACACTTTGGA GCGAACACCT TAACGTGAAT CCAAAAGAAC CTGAATGGCT GAACAGAGAC
AGATTTGTAC TGTCAGCAGG ACACGGATCG ATGTTAATAT ATTCATTATT ACATTTAAGC
GGTTTTGATG TGTTTATGGA AGACATAAAG AATTTCAGAC AGTGGGGTTC TAAGACACCA
GGACATCCTG AGTTCGGACA TACAAAAGGA GTGGACACGA CAACAGGTCC CCTTGGACAG
GGAATTGCCA CAGCAGTGGG AATGGCACTG GCAGAAACAC ATCTTGCGAA AAAATATAAT
AAAGAAGATA TGAATATAAT AGATCACTTT ACATATGTAA TCTGCGGAGA CGGAGATCTT
ATGGAAGGTG TGAGCGGAGA GGCAAGTTCA TTTGCAGGAG TACAGAAGCT GGGGAAACTG
GTAGTACTTT ATGATTCCAA TGATATATGT CTTGATGGTG AAACAAGAGA AACATTTACA
GAAGATGTGG CTAAGAGATA TGAAGCTTAC GGCTGGCAGG TACTAACGGT AAAAGACGGC
AATGATCTTG GAGCAATAGA TGCGGCAATA AAAGAAGCAA AGAAAGATGT AACAAAGCCG
ACATTAATAG AGATAAAAAC AGTAATAGGA TACGGAGCAC CGACAAAAGC AGGAAAGAAC
AGTTCACACG GGGCACCTTT GGGAGCAGAA GAAACAAAAG GATTAAGAGA ATATCTGAAA
TGGGACTATG AAGCATTTGA AGTTCCGGCA GAGGTATATG AGGACTATAA GAAATCAGTA
GCAGAAAGAG GAACAGCAAA GTCAGAGGAA TGGAAAGCCC TTGTAGGAAA GTATAAGGAA
AAATATCCTG AGCTGGGAAA AGAGATAGAA GAAATAGCGG CAGGAACATT ATTTGATAAT
ATAAAGATAG AATTCCCGGC ATATGAAGCA GGACATTCAC AGGCAACAAG AAATGCATCA
AATGATGCGA TAAATGCAAT AGCAGGACAG ATACCGAATT TCATAGGAGG CTCGGCAGAC
TTAGCACATT CAAATATGAC AATGATAAAA GGAGAAGGAC TGTTTGACGC AGAGCACAGA
GAAAACAGAA ATATTCAGTT TGGAGTAAGA GAATTTGCAA TGGGAGCGAT ACTGAACGGA
ATGGTATTAC ACGGAGGACT GAAGACATTC GGGGGAACAT TCTTTGTATT CAGTGACTAT
GTGAAGGCAG CAATAAGATT ATCAGCATTA ATGGGATTAC CGGTAACTTA TGTACTTACT
CATGACAGTA TAGCAGTAGG GGAAGACGGT CCGACACATG AACCGATAGA ACAGCTTGCA
GGTTTAAGAG CAATTCCGAA TATAAATGTA ATAAGACCAG CAGACAGCAG AGAAACACAG
GGAGCATGGA AGGTAGCTGC AGAAAGTAAG AAGACACCGA CACTTTTAGT ATTAAGCAGA
CAGAATCTGG ATGTAACAGA AGGTTCGTCA ATGGAAGATG TAGCAAAGGG AGCATATGTA
TCTTACGAGA CAAACAAAGA TTTTGGAAGA ATAATAATAG CAACAGGATC AGAAGTATCT
CTGGCAGTGG GAGCAGCAAA GGAACTGGAA AAAGCAGGAG AATCAGTAAG GGTAGTAAGT
ATGCCGAGTA TGGAGCTTTT TGAAAGACAG AGCTGTGAGT ATAAGGAAAG TATCCTTCCA
AAAGGAGTAA GAAACAGAGT ATCACTTGAG ATGGGATCGA CATTCGGATG GCATAAATAT
GTAGGAATGG ACGGACTGGC AATAGGAATA GATACATTCG GAGCATCAGC ACCGGCAGGA
AAAGTAATAG AAGAATATGG ATTTACAGTA GAAAAAATCG TTAATAAAAT AAAAGGATAA
 
Protein sequence
MEKNVKQLSV DAIRMLGVDA IEKSKSGHPG IVLGAAPMAY TLWSEHLNVN PKEPEWLNRD 
RFVLSAGHGS MLIYSLLHLS GFDVFMEDIK NFRQWGSKTP GHPEFGHTKG VDTTTGPLGQ
GIATAVGMAL AETHLAKKYN KEDMNIIDHF TYVICGDGDL MEGVSGEASS FAGVQKLGKL
VVLYDSNDIC LDGETRETFT EDVAKRYEAY GWQVLTVKDG NDLGAIDAAI KEAKKDVTKP
TLIEIKTVIG YGAPTKAGKN SSHGAPLGAE ETKGLREYLK WDYEAFEVPA EVYEDYKKSV
AERGTAKSEE WKALVGKYKE KYPELGKEIE EIAAGTLFDN IKIEFPAYEA GHSQATRNAS
NDAINAIAGQ IPNFIGGSAD LAHSNMTMIK GEGLFDAEHR ENRNIQFGVR EFAMGAILNG
MVLHGGLKTF GGTFFVFSDY VKAAIRLSAL MGLPVTYVLT HDSIAVGEDG PTHEPIEQLA
GLRAIPNINV IRPADSRETQ GAWKVAAESK KTPTLLVLSR QNLDVTEGSS MEDVAKGAYV
SYETNKDFGR IIIATGSEVS LAVGAAKELE KAGESVRVVS MPSMELFERQ SCEYKESILP
KGVRNRVSLE MGSTFGWHKY VGMDGLAIGI DTFGASAPAG KVIEEYGFTV EKIVNKIKG