Gene Athe_1622 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1622 
Symbol 
ID7409452 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1722486 
End bp1723568 
Gene Length1083 bp 
Protein Length360 aa 
Translation table11 
GC content36% 
IMG OID643715991 
Producthypothetical protein 
Protein accessionYP_002573489 
Protein GI222529607 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.681183 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAG GAACAAAGAG GCTTGCCCTT CCTTCTTTAA ACACAAAACA TCTTTTTAGA 
ATGACAGACT CAACAGGAAT TTTGCAGCAC GCTAAATTCT CAGTCCCAAA CTATAAAGAA
GGATATACAA CAGATGACAA TGCAAGAGCA CTGATTGTTG CCTTGAGGCT TTATGAAAAG
ACGGGAGACA AATTATACCT TGACCTTGTT TATAGATACA TGGCATTTTT ATACAATGCT
TACACTGAGG ATGGCTTTTT CAGAAACTTT ATGAATTATT CAAGGGTATT TCTGGACGAA
AAAGGCACGG AAGACTGTTT TGCAAGGTCT TTGATTGCTC TTTCGTATGT TTACAGCTCT
GAGATACTTG ATAGTTCTAT TAAAGAGCTT GCGTATGTGA TGTTAAAACG CTCGCTCAGA
AATGTTTTGC ACCTCAGCTA CCCGATTAGC ATTGCTTATT CAGTAGTTGC ACTTTCAATG
TTGCACGACA TAAAAGAGTT TTCAAGCGAA GCAAAAATGT ACTTAGAAGC TCTTTCTGAA
AAACTTTTAA ACTTTTACCA CAAACACTCA GATGAGAACT GGAAATGGTT TTCTGACAAG
CTCACATATG CCAATGCAAT AATTCCCTAT GCTTTGTTTA GAGCTTTTGC TGTTACAGAA
AAAGAAAAGT ATTTAAAGGT TGCAAAAGAA GCTCTTGATT TCTTGTCTGG TATTTTATTT
GAAAATGGAA TTCTAAGAGT TATAGGGAAC AGAGGATGGT ATGAAAAAGG AAAAGAACGT
CCTTATTTTG ATGAGCAACC AATTGATGCA TGTGTCTGTG TGATTGCCTA TACTGAGGCT
TATAAAATCA CTGAAGAAAA AGAATACAAA GAAAAAGCTC TCAAAGCTTT CAAGTGGTTT
TTAGGTGAAA ATATTCACAA AAAACCTCTG TATGATGAAA AAACAGGTGG ATGTAGAGAC
GGTATAGAAG AGGATGGCAT TAACCAGAAC CAGGGTGCAG AATCTACCAT TTGTTACCTC
TTAGCAAGGC TCTTCATCGA AGATCTTGTT AAAAGTGAAG AAAAGAATAA AGAGGTTGTG
TAA
 
Protein sequence
MKRGTKRLAL PSLNTKHLFR MTDSTGILQH AKFSVPNYKE GYTTDDNARA LIVALRLYEK 
TGDKLYLDLV YRYMAFLYNA YTEDGFFRNF MNYSRVFLDE KGTEDCFARS LIALSYVYSS
EILDSSIKEL AYVMLKRSLR NVLHLSYPIS IAYSVVALSM LHDIKEFSSE AKMYLEALSE
KLLNFYHKHS DENWKWFSDK LTYANAIIPY ALFRAFAVTE KEKYLKVAKE ALDFLSGILF
ENGILRVIGN RGWYEKGKER PYFDEQPIDA CVCVIAYTEA YKITEEKEYK EKALKAFKWF
LGENIHKKPL YDEKTGGCRD GIEEDGINQN QGAESTICYL LARLFIEDLV KSEEKNKEVV