Gene Sterm_4128 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_4128 
Symbol 
ID8599693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4407078 
End bp4410116 
Gene Length3039 bp 
Protein Length1012 aa 
Translation table11 
GC content35% 
IMG OID 
Productalpha amylase catalytic region 
Protein accessionYP_003310891 
Protein GI269122714 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0625116 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTATA AATTAATAAT ATATAGAAAT GACAGTTTAT ACAGAGAAGA AGAACTTACA 
CAGCTGAAAG ATACTACATA TACTTATAAA TGGGAAAAAG TAGAGAGCGG AAGCTATTCA
TTTGAAGTAG TGGATGAAAA TAACATTACT TATGCAGTAA CATATAATCA CACTACTCCC
TTTGCCCATA CATTCGATAC ATATCTGGAT AAAGATGCAA AACCAAAGCC CATAACAGGT
TTTCAGAGTA ATATAGATAT ACTTATAAAA TATAATTCAA GGGAAAATAC TTTTTCCCTT
GTGAAAACAC GGTTTAAAAG ACTTATAGTA GATATAACAG ATTATGGATA TGAGAAAATA
AACAGGCTGC AGATAACAGG AACATTTAAT AACTGGGAAA TAGAAGAGGT TCCGTTAAAA
AGTGCAGGGG AAAATAAGTA TGAGATAGTA CTCGCAGTAA AAGAAGGAAA TTATGAATAT
AAGCTGATTT TTGATGAAAA ATGGGTTCCC GAAAAAGATA ATCTTATACT GATAGTAGGA
GAAAGCGGTG CTTTATTTCC AAAAGGAGAA CTGGGAACCG GGAAGCTTTC ATATGACGCG
CTGGATAAGA ATCAGACTGA AAAAGCCATA AAACATGATT TCAGAAAATT AAATTATCTG
AATAAAATAT CCGAGAATGA AGTGGAATTT ACAATAAGAA CCCAGCTCCA TGATGTAGAG
AGGGCTTATA TAAGTGTAGA TCTGAATGAA AAGGATATTT ATGAAAAGAT ATACGAACTG
GAAAGACATA GCGATTTTAC ACACAGTTTT GATTACTTTA AAAGAAGTAT TTCGTTTGAA
GAAGATGTAA AAGAGTTTTC ATATGTATTT ATATTGGAGG ATGGAAATAC AAAATATTAT
TTTGACGGAA AGCTGTCCAA TAAAAAGGGA AGAAAAATAA AAATAAATTT TGAAAAAGAT
AAAATCGAAA TATTTTATAT ACCGAACTGG GCAAAGGAAG CAATCTGGTA TAATATATTT
CCAGATAGAT TTTATAACCA CAGCTGCTAT AATAATCCTA TATTTAATGA ATTTGGACCG
GAAAATTTTG AAATAAACAA GCTTCATGAG AGCAATTTTG AAGAGATGTA TAAGTGGAAT
ACAGAAGAAG AAACTTTAGG AAAATTCGAT ACCAACAGGT GGACAAGTGA TTTTTCCGAA
AAGACAGACT GGGAGATAAA AGGTGAGAGC GGAAAAAATA CTTCACTGAA ATATGCAAGA
ATGTACGGTG GAGATCTGCA GGGCATAAGA GAAAAAATAC CTTATATGAA AGAGCTGGGC
ATCAATGCCG TATGGCTGAA TCCGGTATTT TATTCATACC AGAACCATAA ATACGGTACG
AATGATTTCA GACATATATC GCCGGATCTG GGAACGATAC GTACAAGCGG CAGCAGATAT
AATGCAGAGA TAGATGAAAA TAATCCTTAT GGAGACAAAA GCTATGTTGA TATATTAAAA
AAGAATGCCA AAGATAACAG TGAACTAAAG CTTCTGGAGC TAAAACTGAC CGGAGAAAAT
AAAGGAAAAA ACGGTTACGG CGAAACAGAG GATCCTTCGA CATGGATATG GACGGAGTCA
GATCTTATAA TGGTAGATTT AATAAAAGAA CTGCACAGAA ACGGGATAAG AGTTATATTT
GACGGTGTAT TTAACCACAG CAGCAACAGA CACTGGAGTT TTAACCAGGT TCTTATGGAG
GGAGAAAATT CCAAGTATAA AAACTGGTAT AAATTCAGTG ATTTTTCAAA GCATATAAAG
ATAGAAGATG GAATGAGCGA GGAAGAAGCC TACAAAATAT TAAATAAAAA CAGGGAAAAT
ATAAAATATT CAGGCTGGGC AGGATTTGAT TCATTGCCGG AGTTTAACAG CTATAATCTG
GAATTCAAAA GTTATATTTT CAATATAACC AAAAAATGGC TTCTTGGTCC TGATGCAAAA
GTTTCCAAAA ATTGGTACGA AGATGACGGA ATAGACGGTT TCAGACTGGA TGTTCCTAAT
TGCCTTGAGA ATCAGGATTT TTGGATAGAG TGGCGTGAGG TAGTAAAAAG TACTAAAAAA
GATACATATA TAACTGCAGA ACTGTGGGGA AATGCAAGCT ATGATATAAA TCAGGGGAAT
AAATTCGATG CGGTTATGAA TTATGAATGG CTGAAAACAG TAATAGGTTA TTTTATAAAT
CAGGGTTATG AGAAAAATAA AAGCTATAAG CTGAAAGCCG GAGAATTTTT AAATGAGCTG
AGAGAAAAAA GAACATGGTA TCCTAAACAG GCGATACAGG CCTCACAAAA TCTGAACGGT
TCGCATGATA CAGACAGACT GCTGTCAAGA ATAGTAAATG ACAGGGTGGG GAGAGATCTG
GAAGAGGGCA AACAGCTGGA ACAGGGGTAT AATGGAATAC GGCCGGATCT CGCCTCAAAT
TACCATCCTA ATACTACAAT AGACTGGAGA AGCTCTTTTA TAAAACCAAA GGATGTTTTG
AAGCTGATAT CTGTATTTCA GATGACATAT GTAGGAGCTC CGATGCTTTT TTACGGAGAT
GAAATAGGAA TGTGGGGAGC TACAGATCCT TACTGCAGAA AGCCTATGCT CTGGGATGAG
TTTACATATG ACAATGAAAA AAACCCGTCG CTGACAAATG AAAATGAGGT ATATTCGCAG
GAGCCCGACC AGGATCTGCT GTTCTGGTAT AAAAAAGTAA TAAAAATAAG AAAAGAGAAT
CCTGTACTGG TATATGGGAA ATTCAAAGAG CTGTACTGGG ATGACGGAAG GGATATAGTA
GCCTATGTAA GGTCAAATCC AAACAGTGTA ATAATAACAG TACTGAATAA TTCATTTAAT
GACTATGAAG ATTTGGAAAT AGTGACAGAT GAACCAGAAG AAAGATACAT AGATCTTCTT
ACAGGGAAAA ATATATACAG CAGAAAAGAC GGAAAAATCA TACTTAGTAT AAGGGCCAAA
CAGGGAATGA TCCTGAAAAA ATGGAAAAAA AGTATATAG
 
Protein sequence
MIYKLIIYRN DSLYREEELT QLKDTTYTYK WEKVESGSYS FEVVDENNIT YAVTYNHTTP 
FAHTFDTYLD KDAKPKPITG FQSNIDILIK YNSRENTFSL VKTRFKRLIV DITDYGYEKI
NRLQITGTFN NWEIEEVPLK SAGENKYEIV LAVKEGNYEY KLIFDEKWVP EKDNLILIVG
ESGALFPKGE LGTGKLSYDA LDKNQTEKAI KHDFRKLNYL NKISENEVEF TIRTQLHDVE
RAYISVDLNE KDIYEKIYEL ERHSDFTHSF DYFKRSISFE EDVKEFSYVF ILEDGNTKYY
FDGKLSNKKG RKIKINFEKD KIEIFYIPNW AKEAIWYNIF PDRFYNHSCY NNPIFNEFGP
ENFEINKLHE SNFEEMYKWN TEEETLGKFD TNRWTSDFSE KTDWEIKGES GKNTSLKYAR
MYGGDLQGIR EKIPYMKELG INAVWLNPVF YSYQNHKYGT NDFRHISPDL GTIRTSGSRY
NAEIDENNPY GDKSYVDILK KNAKDNSELK LLELKLTGEN KGKNGYGETE DPSTWIWTES
DLIMVDLIKE LHRNGIRVIF DGVFNHSSNR HWSFNQVLME GENSKYKNWY KFSDFSKHIK
IEDGMSEEEA YKILNKNREN IKYSGWAGFD SLPEFNSYNL EFKSYIFNIT KKWLLGPDAK
VSKNWYEDDG IDGFRLDVPN CLENQDFWIE WREVVKSTKK DTYITAELWG NASYDINQGN
KFDAVMNYEW LKTVIGYFIN QGYEKNKSYK LKAGEFLNEL REKRTWYPKQ AIQASQNLNG
SHDTDRLLSR IVNDRVGRDL EEGKQLEQGY NGIRPDLASN YHPNTTIDWR SSFIKPKDVL
KLISVFQMTY VGAPMLFYGD EIGMWGATDP YCRKPMLWDE FTYDNEKNPS LTNENEVYSQ
EPDQDLLFWY KKVIKIRKEN PVLVYGKFKE LYWDDGRDIV AYVRSNPNSV IITVLNNSFN
DYEDLEIVTD EPEERYIDLL TGKNIYSRKD GKIILSIRAK QGMILKKWKK SI