Gene Athe_1619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1619 
Symbol 
ID7409449 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1720468 
End bp1721568 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content37% 
IMG OID643715988 
Productprotein of unknown function DUF34 
Protein accessionYP_002573486 
Protein GI222529604 
COG category[S] Function unknown 
COG ID[COG0327] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00486] dinuclear metal center protein, YbgI/SA1388 family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.15807 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTAAGTG CTCAGGAGAT AATTTCGTTT ATAGAAACCT ATTTTCCCAA AAAACTCTCA 
TATGAATGGG ACAACTGCGG TCTTCAGGTT GGAAGTTATT CAGACAAAGT AGATTCAGTT
TTGATATGTG TGGATGTGAC AGAGGAGGTC TTAAAAGAAG CTATCTTGCT TGGAGCAAAG
CTTATAATTT CCCATCATCC ACTTATTTTT CAGGGAATCA AAAGTATAAA AGATGACACA
CCAGAAGGAA GGATTATCAT AGATGCTATC AAAAACGGCA TAAATATATA TTCTGCTCAC
ACCAGTGCAG ATGTCTCGAA ACATGGTATA AACTACTGGC TTGCCAATCT CATAGGTCTT
GAAAACATTG AGGGTTTGAA CATCAAACAA AAAAGTGGGT ATTTTAAAGT TGTTGTGTAT
GTACCAGTAG ACTATGTACA AAATGTGTTA GAGGCAATGG CAAATGAAGG TGCGGGCTTT
GTTGGGAAAT ACAGCCATTG CTTTTTTGCA GTCGAAGGTG AAGGAAGTTT TAAACCTCAA
GAAGGTGCAA AACCTTTTTT AGGACAGGTG GGGAGGCTTG AAAAGGTTAA AGAGGTAAGA
CTTGAGAGCA TAGTGCCTGA AGATAAGCTC AAAAATGTAA TAAAATCGAT GTTAAAAGCT
CATCCTTATG AAGAAGTTGC ATATGACATA TACCGGCTTG AAAATGATAT ATCATATGAA
AGTTTAGGAG TTGTTGGAGA GAGAGAGGTT TTGGCAAAAG AACTTATCTT AGAGCTAAAA
CAAAAACTAA ACCTTGATTT TGTAAAAGCA AGCATTCAAA AAGATGCTTT TAAGAAGATA
GCCATTGTCA GTGGTTCTGG TAAAGACCTT ATAAAAGATG CATATTTCAA AGGTGCAGAC
TGTCTTATCA CAGGCGAAGT TGGTCACCAC GGGATTTTGC TGGCAAAGTC GCTATCGATG
AGTATAATAG AGCTTGGACA TTATGAGAGC GAGAAGGTGT TTGTGGATAT CGTTTACAGC
CTTTTTGAAG ACTTTAAGAA AAAAGATGAT CTGAAAATAT ATAAATCCAA AATCAATACC
AGCTTTACAA ACATTTACTA A
 
Protein sequence
MVSAQEIISF IETYFPKKLS YEWDNCGLQV GSYSDKVDSV LICVDVTEEV LKEAILLGAK 
LIISHHPLIF QGIKSIKDDT PEGRIIIDAI KNGINIYSAH TSADVSKHGI NYWLANLIGL
ENIEGLNIKQ KSGYFKVVVY VPVDYVQNVL EAMANEGAGF VGKYSHCFFA VEGEGSFKPQ
EGAKPFLGQV GRLEKVKEVR LESIVPEDKL KNVIKSMLKA HPYEEVAYDI YRLENDISYE
SLGVVGEREV LAKELILELK QKLNLDFVKA SIQKDAFKKI AIVSGSGKDL IKDAYFKGAD
CLITGEVGHH GILLAKSLSM SIIELGHYES EKVFVDIVYS LFEDFKKKDD LKIYKSKINT
SFTNIY