Gene Sterm_1104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_1104 
Symbol 
ID8596583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp1207920 
End bp1209443 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content29% 
IMG OID 
ProductExopolyphosphatase-like protein 
Protein accessionYP_003307903 
Protein GI269119726 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGATA TGTTGTGTGT CATTAGTATA GCGTCACACG GTATTTACAT GAAAATATTT 
CAGAAAAAAG GAACTGCTGT AAAGATTATA GATAAGGCAG TACATGTTTC ACTGATAGGG
AAAGAAGTCT TTTTCGAGGA TAAGCTTACA TTTAATAAAA TAAAAGAAAT AATAGACACG
GTAAAAAAAA TGAAGATCAT AGCAGAAGGT TATCAGACTG ATAAAATTGT TGTTATAGGA
ACTACAGCAG TACGTAAGGC AAAAAATAAT TACTATCTTC AGGATCAGCT GAAAATAAAT
ACAGGTTTGG ATTTACTGGT AATGGATAAG CTTGATGAAA ATTATCTGGC GTATGAAAAA
ATATCTGTCA GTCTTGAGAA GCATCTTGAT GATTATAATG AAAAAAATAA CCTGATCGTT
TACGTAGGTT CGGGAAATAT ATCCTTTTCG GTTATAAGTA ACGGAATTAA CATATATAAC
ACCAATATAG AGATAGGCTC TCTTGTTTTA TCAGATATAT CGGATAAGCT GAATCTCGGA
ACCAAAAAGA GAAATATAGT AATTGACGAA TATATAAAAA GGCATTTGAG CGGGATCTTC
CGAAATATTT CCGATATAAA AATAGAAAGG GTTATTCTTG CCGGAAAGTT TTTTGATATT
TATATAGAAA GAGTAAAAAA TAAAAAAAGA ATTGACTTTA TAGAAGAGCT TACACGTGAG
GAGATATTCG AATATAACAA AAAGCTTTAT GAAAAATCTC CGGAAGAAAT AGCTAAAACT
TATAATATAA AAAAAATCGA AGCTGAGATG CTTACTCAGA AAGTAAATAT CATAAAAAAT
ATAATACAAA AATTCAACAG CAACAAAATT ATATTTGTAA ATTTCAAATT ATCCAATTCA
ATTGCAGAGT TTCACTTTTT TAAAAATGAA AAAATAAGAA GAAAAATAGA AAAAGATTCT
GTGGAAAGTG CCAGAAAAAT AGCCAAAAGA TACTATTATA TGGAGGAACA TGTAGATAAA
CTCGAAGAAA TAATAGATAA GATTTTTAAT AAGGTTCATA AAACACACGG ACTGACAAAA
AAGGACAGAT ATTATCTGAC TCTTGCGAAT ATTTTCAGGG AAACAGGGAA GTATATTACA
CTTGAAAATT ATATTAATTT TTCCAGAGAT ATTCTGGAAG CATCAGGAAT ATTCGGAATC
AGTACAAGAG AACATTTTTT TATTTCAAAA ATCCTGAAGT ATTTTGAAGA GGATATTCTG
GAATTGGAAA ACAGTAAATC TATAAGCAAA GAAGAAAAAC TGTCAGTGGC AAAACTGGCA
GCTATATTGA AAATAGCCGA ATCTCTTGAT TCCAGCATGG AACAGAAAAT AGCCGATCTG
GAAATTACAA GTGATGAAAA TGATATTTAT TTTAACGTAA AGCTCAAAAA GATGATTTTT
TTGGAAAAGC TGGAATTTGA AGAGAAAAAA GAAGTTTTTG AAAATGTATT CGGATTGAAA
ACACATTTAG TTATAAATAA ATAA
 
Protein sequence
MKDMLCVISI ASHGIYMKIF QKKGTAVKII DKAVHVSLIG KEVFFEDKLT FNKIKEIIDT 
VKKMKIIAEG YQTDKIVVIG TTAVRKAKNN YYLQDQLKIN TGLDLLVMDK LDENYLAYEK
ISVSLEKHLD DYNEKNNLIV YVGSGNISFS VISNGINIYN TNIEIGSLVL SDISDKLNLG
TKKRNIVIDE YIKRHLSGIF RNISDIKIER VILAGKFFDI YIERVKNKKR IDFIEELTRE
EIFEYNKKLY EKSPEEIAKT YNIKKIEAEM LTQKVNIIKN IIQKFNSNKI IFVNFKLSNS
IAEFHFFKNE KIRRKIEKDS VESARKIAKR YYYMEEHVDK LEEIIDKIFN KVHKTHGLTK
KDRYYLTLAN IFRETGKYIT LENYINFSRD ILEASGIFGI STREHFFISK ILKYFEEDIL
ELENSKSISK EEKLSVAKLA AILKIAESLD SSMEQKIADL EITSDENDIY FNVKLKKMIF
LEKLEFEEKK EVFENVFGLK THLVINK