Gene Athe_1884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1884 
Symbol 
ID7408997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1991451 
End bp1993163 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content38% 
IMG OID643716256 
Producttype II secretion system protein E 
Protein accessionYP_002573745 
Protein GI222529863 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG2804] Type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway, ATPase PilB 
TIGRFAM ID[TIGR02533] general secretory pathway protein E 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000626903 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCCAAGTG AAAGAAAGAG AATTGGAGAT GTTTTAGTAG AGGCAAAGAT AATTACACCT 
CAGCAACTGG AAGAAGCACT TAAAATTCAA AAGCAGACCA ATAAAAAATT AGGCGAGATA
CTGGTTGAGA AAGGCTATAT AACAGAAGAT GAGCTAATAG AGATTTTAGA ATTTCAGTTA
GGGATACCTC ACATAAAATT AGATGTATAT CCAATTGATC CTAAGGCTGT TGAAATGATT
TCTGAGTCAA TTGCACGAAG GCATACCGTT TTGCCAGTAA GTTTTGACGA AGATGGTAAC
TTAATTGTAG CAATGGCTGA TCCTCTCAAC ATATTTGCAA TGGAGGATAT TGAGATTTAT
TCAGGCAGAA GAGTGCGACC ACGAATTGCC AAGGCATCCG ATATTAAACG TGCAATTGAA
AGATTCTATG GTAAGCAAGA AGCTTTAAAA GCAGCTGAAG AGTTGCAAAA AGAAAGTAGC
GAAAAGGACA GCCAAGCCAA AAGAGCTACT ATTACTCCTC GATTCCAACT TGGTTTGGAA
GATGGAACTG AAGGACCTAT TGTAAGGCTT GTCAATTCGA TTTTTGAACA AGCTATTACA
TCGCGTGCAA GCGACATTCA TATTGAACCG TTTGAGAACG AGATAAAAGT GAGATACAGA
ATAGATGGTG TATTGTATGA TGTGTTGAAG TTAGATATTG GGATATTATC ATCATTGGTG
GCAAGAATTA AAATTATAGG TAATATGGAC ATTGCGGAAA AGAGAATACC CCAGGATGGG
CGAACGACTT ACATTTTTTC GGATAAAATA TATGACATGA GAATCTCCTC TTTACCATGT
GTTTATGGTG AAAAGATAGT TGTGCGTGTA ATAGACAAAA GTGCATTTGT GCGTTCTAAA
GCTGAGCTTG GCTTGACTGA AGAGGACGAA GAGAAATTTA ACAAGCTGAT TGCTGCGCCA
CATGGAATAA TCTTGGTATG TGGTCCTACC GGTAGTGGTA AATCTACTAC GCTTTATACT
ATTTTAAACG AACTTAATAC AGGAACACGC AACATAATAA CTGTTGAGGA TCCTGTCGAA
AGTACCATAG AAGGTATTAA TCAAGTAGAG GTCAATACCA AAGCAGGGCT AACATTTGCA
GCGGCGCTGA GATCAATCTT GCGACAGGAC CCAGATATAA TTATGATTGG TGAGATCCGA
GACAGAGAGA CAGCTGACAT GGCAATCAGG GCTGCTATTA CAGGACATTT AGTGTTGTCA
ACCATTCATA CAAATGATGC AGCAAGTGCA ATTACAAGGT TGGTTGACAT GGGGATAGAA
AACTTTTTGA TAAGCTCGTC GTTGGTAGGG GTAATATCAC AGAGATTGGT AAGAAAACTG
TGTTCATATT GTAAAGAGCC TTATGAGCCA TCAGAGGAAG AAAAAATTCT TCTTGACATA
AAGCAAGATG AGAATGTAAA ATTATACAGA AAGAGAGGAT GTCATATATG TGATAGAAAA
GGTTACTATG GTAGAACAGG TGTATATGAA ATTCTGATTG TGACAAAAGA GTTGAGAAAA
CTTATAAACA AAAAAGATGT CAGCAGTGAG GAAATAAAGG AACTTGCTGT CAAACAGGGG
ATGAAGACAC TGCGACAAGC TTGCAGAGAG AGAGTTTTGA ATGGAATTAC ATCAGTTGAA
GAATATCTGA AGATTACTTA CGCACTTGAA TAG
 
Protein sequence
MPSERKRIGD VLVEAKIITP QQLEEALKIQ KQTNKKLGEI LVEKGYITED ELIEILEFQL 
GIPHIKLDVY PIDPKAVEMI SESIARRHTV LPVSFDEDGN LIVAMADPLN IFAMEDIEIY
SGRRVRPRIA KASDIKRAIE RFYGKQEALK AAEELQKESS EKDSQAKRAT ITPRFQLGLE
DGTEGPIVRL VNSIFEQAIT SRASDIHIEP FENEIKVRYR IDGVLYDVLK LDIGILSSLV
ARIKIIGNMD IAEKRIPQDG RTTYIFSDKI YDMRISSLPC VYGEKIVVRV IDKSAFVRSK
AELGLTEEDE EKFNKLIAAP HGIILVCGPT GSGKSTTLYT ILNELNTGTR NIITVEDPVE
STIEGINQVE VNTKAGLTFA AALRSILRQD PDIIMIGEIR DRETADMAIR AAITGHLVLS
TIHTNDAASA ITRLVDMGIE NFLISSSLVG VISQRLVRKL CSYCKEPYEP SEEEKILLDI
KQDENVKLYR KRGCHICDRK GYYGRTGVYE ILIVTKELRK LINKKDVSSE EIKELAVKQG
MKTLRQACRE RVLNGITSVE EYLKITYALE