Gene Sterm_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_2010 
Symbol 
ID8597476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp2139373 
End bp2140878 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content33% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003308796 
Protein GI269120619 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAA ATAAAGAAAA TATAGAGAAA AAAGAATTAG CAGAGGAAAA AGAAAATAAT 
ACGGAAATTT CTGAAGAAAA TACTGAGATA TCTCAGGAAA ATACAGGAAA AAATATTTCG
GAAGAAATAG AACCGGAAAC AGGAGAGGAT AAAAATCCCG AAATTTTTAT CCATGAAACT
GAAGAAGAAG CTAAAATTGA AGAAAATCCT GAAGAGGAGC TTTTGGATGA AAATATTAAG
GATAAATTTG ACCCCGCAAA GTATGTTCCT TATGATCCAG GGGTAAAGAG CCGCGGAAAT
AGCAGCAAGC TTCCTTATAT AATATTTGGA ATTATAGTGG CTTTGGGAAT ACTTCTTTAT
ATTATAATTG GAAACATGGG AGGCGGTAAT AGAATAGGCG ACCTCGCTCA GGAAGAAGAA
CAGACTGAAA TTGAACAGAC TGCTGCTCCA AGCACAGATA TTAATGATTA TAAAAAGAAA
TACTGGAGCG GCGATTCGAG CATAACGATA ACAGATGCTT TCTCAGGATA TAAAAATGCT
AAAAATATAG AATATCTTTT GTATGAAAAA GATGGAAGAA CGGTTTTTCA GGTTAAGGCA
GAGCTGGAGG TAAAGCAGAT TCTCGATTAT AACGGTCCTG ATATAAAAAT CGGAAATGAC
GGAGACCTGT CTGAAACAGC ATATCTGTAT TTCAGAAAGC ATCAGAATGA TATAAAGATT
TATGATAACA GTTATTTTTA TATACCAAAG GAAAATACAG GAAGTGAAAA TCTGGATTTA
TCCAAAAGAG AAGTAGAAAT AACAAACGGG AACGAAGTAT ATAAAGCTGA GTATTCCAAT
GGTTTGAATG ATATATATAA TGACAGTTTT AATTATGGTG CAATGCTGAA AAATATGAAT
AATTTTTATT TAAAAGTAGT TCCTAAAAAA TCAACTAGTA TAAGCGTAAA ATTTGAAACT
AAGGAAAAGG CTGATGAGGA ATTAAATAAG GTATATCAGC ACCTCACGGG AATACTTAAT
GATGACCAGA AGTCAAAGCT TACATCTTCC CAAAAAGCAT GGCTGGATTA CAGAGACAGC
GAGTTTAAAT TTTTGAATTC TGTATTCTTT ATAAAAGATA TACCAAATTC TGCAGAAATT
TCGAATAGAT TTTCTGAAAA ATACAAAATA AAAATAATAG AAAACAGAAT CAGCGAATTG
AATACATATA AAGAACTGGT AGATAAAAAA GGTACAGTAA AACTTGATGA AACAGAAATA
AACAAGCAGA TGGAAAATCT GAAGCAGAGA TATGCAACAC TGCTTACACA TCTGAGCGGT
GACAGTCTGC AGTTTATGAA AGATTCGGAA ACAAAATGGT CTTCTTTTGC AGATACTGAT
CTGATATTCG TTCAGAGCCT TTCTGCTGTA CTTCCGGAAG GAGAAACATC ACAGTTTTCA
ATAGGATTTG AGCCTTACAG TATAAGACTG AAGATGCTTC AGGTATATGA TGATATCTTA
TTTTAA
 
Protein sequence
MSENKENIEK KELAEEKENN TEISEENTEI SQENTGKNIS EEIEPETGED KNPEIFIHET 
EEEAKIEENP EEELLDENIK DKFDPAKYVP YDPGVKSRGN SSKLPYIIFG IIVALGILLY
IIIGNMGGGN RIGDLAQEEE QTEIEQTAAP STDINDYKKK YWSGDSSITI TDAFSGYKNA
KNIEYLLYEK DGRTVFQVKA ELEVKQILDY NGPDIKIGND GDLSETAYLY FRKHQNDIKI
YDNSYFYIPK ENTGSENLDL SKREVEITNG NEVYKAEYSN GLNDIYNDSF NYGAMLKNMN
NFYLKVVPKK STSISVKFET KEKADEELNK VYQHLTGILN DDQKSKLTSS QKAWLDYRDS
EFKFLNSVFF IKDIPNSAEI SNRFSEKYKI KIIENRISEL NTYKELVDKK GTVKLDETEI
NKQMENLKQR YATLLTHLSG DSLQFMKDSE TKWSSFADTD LIFVQSLSAV LPEGETSQFS
IGFEPYSIRL KMLQVYDDIL F