Gene Sterm_3871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3871 
Symbol 
ID8599317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp4116270 
End bp4117580 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content36% 
IMG OID 
Productglycoside hydrolase family 4 
Protein accessionYP_003310636 
Protein GI269122459 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGAC TAAAAATTAC TACTATCGGC GGAGGTTCCA GTTATACTCC TGAATTAATA 
GAAGGATTTA TCAGAAGATA TGACGAACTG CCGGTTACTG ATTACTATCT GCTTGATATC
GAGGAAGGGA AAGAAAAGCT TGAGATTGTT GGCGAATTTG CCAGAAGAAT GGTCAAAAAA
GCCGGAGTTC CCATAAATAT TCATCTTACT CTGGACAGAG AAGAAGCCCT GAAAGATGCG
GATTTTGTGA CAACTCAGTT AAGAGTGGGA CTTCTTGATG CCAGAATCAA TGATGAAAAA
ATCCCTTTAA AATATAATGT GCTTGGTCAG GAAACTACAG GACCCGGCGG ATTTATGAAA
GCCCAGAGAA CTATTCCTGT TCTTTTAGAT ATATGCGAAG ATATGAAGAG ACTTTGTCCC
GATGCATGGC TTATTAATTT TACCAATCCT GCAGGTATAG TAACAGAAGC AATAAAAAAA
TACAGCAGTA TAAAAACTAT AGGTATTTGC AGCGGTGCAA ACAGCATGCT TATGGATATT
GCAAAAGCTT ATGATGTACA AAAAGACGAC ATCTACACCA GAATAATCGG GCTGAATCAT
CTGATTTTTG CAGATAAAAT ATTTCTAAAA GGCGAAGATA TTACTGATGA TTTTATAAAA
AAATTATCTG CCGGAAAAGC GGATAACAGT TTGAAAAATA TTCCGGATAT AGGCTTTTCC
GCTAAATTCA CAGAAGCACT GCATATGTAT CCTATTTCAT ATCTGAAATA TTTTTTCCTG
AACAGAGAAA TGGTTGAAAT TGCAAAAAAA GACGAAGCTG AAAAAGGTAC AAGAGGTGAA
CAGACAAAGG CAATAGAACA TAATCTATTT GAGCTTTATA AAGATAAAAA TCTGGACACC
AAACCTAAAG AATTGGAAAA ACGCGGAGGA GCATATTACT CTGAAACAGC ATGCTCTATA
ATAAGCTCTA TTTATAATAA TAAAAAAGAA ATACATGTGG TAAACACTCT TAATAACGGT
ACCACTTCTG ATCTTCCCGA TAATGTGGTA ATTGAAACAA ATGCGGTAAT AGATAAGGAT
GGTGCACATC CTGTCACATA TGGAAAACTG CCTGTAAAAA TAAGGGGACT TATCCAGAGT
GTGAAAGCAT ACGAAGAGCT TACAGTGGAA GCTGCTGTAA CAGGCAGCTA TGATACGGCT
CTCCTTGCCC TGAGCATTAA TCCTCTTGTT CCGTCTGCAA ATGTAGCAGA AAGTATACTA
AATGAGCTTC TTGAGGTTAA TAAAAAATAC CTGCCTCAAT ATTTTAAATA A
 
Protein sequence
MKGLKITTIG GGSSYTPELI EGFIRRYDEL PVTDYYLLDI EEGKEKLEIV GEFARRMVKK 
AGVPINIHLT LDREEALKDA DFVTTQLRVG LLDARINDEK IPLKYNVLGQ ETTGPGGFMK
AQRTIPVLLD ICEDMKRLCP DAWLINFTNP AGIVTEAIKK YSSIKTIGIC SGANSMLMDI
AKAYDVQKDD IYTRIIGLNH LIFADKIFLK GEDITDDFIK KLSAGKADNS LKNIPDIGFS
AKFTEALHMY PISYLKYFFL NREMVEIAKK DEAEKGTRGE QTKAIEHNLF ELYKDKNLDT
KPKELEKRGG AYYSETACSI ISSIYNNKKE IHVVNTLNNG TTSDLPDNVV IETNAVIDKD
GAHPVTYGKL PVKIRGLIQS VKAYEELTVE AAVTGSYDTA LLALSINPLV PSANVAESIL
NELLEVNKKY LPQYFK