Gene Sterm_4067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_4067 
Symbol 
ID8599511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4331077 
End bp4332384 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content38% 
IMG OID 
Productenolase 
Protein accessionYP_003310830 
Protein GI269122653 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.122819 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTATTA TTGAAGATAT CCACGCTAGG GAGATATTGG ACTCAAGAGG GAACCCTACT 
GTAGAAGTGG AAGTTTATCT GTCAGGAGGT GCGATGGGAA GAGCTTCTGT ACCATCTGGA
GCATCTACAG GTGAACATGA GGCTGTGGAA TTAAGAGATA AGGATGCAAG CAGATACCTT
GGACAAGGAG TTCTAAAAGC AGTAGAAAAT GTAAATACAG TTATAGCAGA TCATTTAATG
GGAATGGATG CTTTAGACCA AGTAGCTGTG GATAAAGCTA TGATAGAATT AGACGGGACA
CCAAATAAAG GAAAATTAGG AGCAAATGCT ATTTTAGGTG TATCTTTGGC AGTGGCTAAA
GCAGCTGCAA ACCAATTAGG ATTACCTTTA TACAGATATT TAGGTGGAGT AAATTCTAAA
GAATTACCAG TACCAATGAT GAATATCCTG AATGGAGGAT CACATGCTGA TTCGGCTGTA
GATGTACAGG AATTCATGGT TCAGCCGGTA GGAGCAAAAA CATACAAAGA AGGATTAAGA
ATGGGTGCTG AAATTTTCCA TCACTTAGGT AAAATCCTTA AAGAAAACGG AGATTCTACA
AATGTTGGTA ATGAAGGTGG ATATGCACCT GCTAAAATCA ACGGAACAGA AGGAGCACTT
GATATTATTT CTCAGGCAGT GGAAAGAGCA GGATATAAAT TAGGTGAAGA AATCACATTC
GCACTGGATG CAGCTTCAAG TGAATTCGCT AAAAAAGACG GAGATAAGTA TACTTACCAC
TTCACAAGAG AAGGCGGAGT AGAAAGAACA TCAGAAGAAA TGGTAGAATG GTATGCAGGT
CTTGTTGCTA AATATCCTAT TATTTCAATA GAAGACGGTT TAGCAGAAGA TGACTGGGAC
GGGTTCAAAA AACTTACTGA AAAATTAGGG AAAACTGTAC AGTTAGTAGG AGACGATTTA
TTCGTAACTA ATACTGAAAG ATTATCAAGA GGAATCAGAG AGGGAATAGC TAACTCTATC
CTTATCAAAG TTAACCAAAT AGGAACATTA ACAGAAACTC TTGATGCTAT AGAAATGGCT
AAAAAAGCAG GATATACAGC AGTTATCTCT CACAGATCAG GAGAAACTGA AGATGATACT
ATAGCTGATA TAGCAGTAGC TACAAATGCA GGTCAGATCA AAACTGGTTC TGCTTCAAGA
ACAGACAGAA TGGCTAAATA CAACCAGTTA TTAAGAATCG AAGATGACTT GGCTGAAGAG
GCTATCTATG AAGGAATTAA CACTTTCTAT AACATAAGAA AAAAATAG
 
Protein sequence
MTIIEDIHAR EILDSRGNPT VEVEVYLSGG AMGRASVPSG ASTGEHEAVE LRDKDASRYL 
GQGVLKAVEN VNTVIADHLM GMDALDQVAV DKAMIELDGT PNKGKLGANA ILGVSLAVAK
AAANQLGLPL YRYLGGVNSK ELPVPMMNIL NGGSHADSAV DVQEFMVQPV GAKTYKEGLR
MGAEIFHHLG KILKENGDST NVGNEGGYAP AKINGTEGAL DIISQAVERA GYKLGEEITF
ALDAASSEFA KKDGDKYTYH FTREGGVERT SEEMVEWYAG LVAKYPIISI EDGLAEDDWD
GFKKLTEKLG KTVQLVGDDL FVTNTERLSR GIREGIANSI LIKVNQIGTL TETLDAIEMA
KKAGYTAVIS HRSGETEDDT IADIAVATNA GQIKTGSASR TDRMAKYNQL LRIEDDLAEE
AIYEGINTFY NIRKK