Gene Sterm_3345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3345 
Symbol 
ID8598797 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3519845 
End bp3521170 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content37% 
IMG OID 
Productglycoside hydrolase family 4 
Protein accessionYP_003310116 
Protein GI269121939 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAAAAA TTAAAATTGT AACTATTGGC GGAGGTTCCA GCTACACACC GGAACTGATT 
GAGGGATTTA TTAAGAGAAG TGCGGAGCTT CCAATAAGGG AAATTTGGCT TGTTGACATA
GAAGAGGGGA AAGAGAAGCT TGAAATAGTG GGGAATCTGG CAAAACGTAT GGTAGAAAAA
GCTGGACTGG ACTGGCAGAT ACATCTGACT CTGGACAGAG AAGAAGCTTT GAAAGGTGCT
GATTTCGTAA CAACACAATT CAGGGTTGGT TTCCTTGATG CTCGTATAAA AGATGAAAGA
ATACCTTTTG AAAATGGTCT GCTTGGTCAG GAAACAAACG GTCCCGGCGG AATGCTGAAA
GCATTTCGTA CAATTCCGGT AATTCTTTCT ATAGTTGAAG ACATGAAAAG ACTTTGTCCT
GATGCATGGC TCGTAAACTT TACAAATCCG GCAGGAATGG TAACAGAAGC AGTATTGAAG
TATGGGAAAT ATGAAAAAGT GGTGGGACTG TGCAATGTTC CTGTAAACCA TATGATGAGC
GAATCAAAGC TTCTTGGCAA GGATGCCAGT GAATTGTTTT TCCACTTTGC AGGATTAAAC
CACTTTGTAT GGCACAAGGT ATATGATAAT AAAGGTAATG ATATAACAGG AGAAGTAGCT
GCGAAAGTAA TAAGCGAAGA AGAAGCAGGA GTGGCTAATA TAGAAGTAAT GCATTTTCTT
CAGGATCAGC TTGATCACTT AGGAATGATA CCGTGTTATT ATCACAGATA CTATTATCTT
CAGGACGATA TGCTTCAAAA AGGACTTGAA AGCTATAAAA ATGAAGGAAC TCGTGGTGAA
GTGGTAAAAA GAGTGGAAGA AGAATTATTT GAACTATATA AAAATCCAGA CTTAAAAGAT
AAGCCTACAC AGCTTGAAAA AAGAGGAGGA GCATATTATT CAGATGCTGC ATGTGAATTG
ATAAATTCAA TACATAATGA CAAAAAAATA TTAATGGTAG TAAATACGCG TAATAACGGA
ACAATAGATG ATCTTCCTTA TGACTGTGCT ATAGAAACTA CTGCATATAT AACTGCATCC
GGTCCAAGAC CTCTTAATTT CGGGAAATTT CCTACTGCAC AAAGAGGATA TATCCAGATA
ATGAAAGCAA TGGAAGAACT TACAATAGAA GCGGCTGTAA CTGGAGATTA TAAAATAGCA
TTAGAAGCAT TCATTACTAA TCCTTTAGTA CCTGGAAGCA CTATCGGTAA AAAGGTATTA
GATGAATTAT TAATAGCTCA CAAAAAATAT CTTCCTCAGT TTAAAGATTT TTTTGACAAA
CAATAG
 
Protein sequence
MSKIKIVTIG GGSSYTPELI EGFIKRSAEL PIREIWLVDI EEGKEKLEIV GNLAKRMVEK 
AGLDWQIHLT LDREEALKGA DFVTTQFRVG FLDARIKDER IPFENGLLGQ ETNGPGGMLK
AFRTIPVILS IVEDMKRLCP DAWLVNFTNP AGMVTEAVLK YGKYEKVVGL CNVPVNHMMS
ESKLLGKDAS ELFFHFAGLN HFVWHKVYDN KGNDITGEVA AKVISEEEAG VANIEVMHFL
QDQLDHLGMI PCYYHRYYYL QDDMLQKGLE SYKNEGTRGE VVKRVEEELF ELYKNPDLKD
KPTQLEKRGG AYYSDAACEL INSIHNDKKI LMVVNTRNNG TIDDLPYDCA IETTAYITAS
GPRPLNFGKF PTAQRGYIQI MKAMEELTIE AAVTGDYKIA LEAFITNPLV PGSTIGKKVL
DELLIAHKKY LPQFKDFFDK Q