Gene Athe_1635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1635 
Symbol 
ID7409465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1732726 
End bp1733847 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content37% 
IMG OID643716004 
Productprotein of unknown function DUF111 
Protein accessionYP_002573502 
Protein GI222529620 
COG category[S] Function unknown 
COG ID[COG1641] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00299] conserved hypothetical protein TIGR00299 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.165553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTCTCT ATTTTGATGC AATCTCAGGT CTTGCAGGTG ATATGCTTGT TGCATCACTG 
ATTGACTGTG GAGTTGATTT TGAGTATATA AAATCAGCTA TTTCCCTTTT AAAGCTTGAT
GTAGAACTTT CGGTTGAGGA GAAGTTTGTA AATGGTATTA AGGCAAAAAG GTTTATAGTA
AATACTCAAT CACATGAACA TGACCATCAC AGCGATAGCA CCCATCATCA CAGAACTTTT
AAAGACATAA AAAAGATGCT GATAGAAAGC TCTTTGCCAG AGGCTGTAAA AGAGATGTCG
ATAAAAATAT TTGAAAAGAT AGCTTTGGCA GAGGCTAAGG TGCACGGCAA AAACCCAGAA
GAAATTTCTT TCCACGAGGT TGGAGCAGAT GATTCAATAG TGGATATTGT TGCATTTTCT
CTTTGTATAG ACAATTTAAA GCCTGAAAAA ATAGTATTTT CGCCTCTTTG TGACGGCAGA
GGTTTTACAA AATCGATGCA CGGGATTATT CCCGTGCCAG CGCCAGCTGT TTTGGAAATA
GCAAGGCAAA ATGGTATTCC ACTTTCCACA AAGGATATTG AATCTGAACT TATAACCCCA
ACAGGCATTG GTATTGCAGC GGCAGTTGCA AGCTCGTTTT CCCAAATGCC AAATATGACC
ATCGAAAAGA TTGGGTATGG TGCAGGGACA AAAGAACTTC CTATTCCCAA TGTTGTGAGA
GCTATAGTTG GAAAAAAAAA ACTGAATTTT GATGGTGATT ATTTTGAGAT ATTTGCAAAC
GTTGATGATA TGACAGGCGA AGAGCTTTCG CTTGCGTTTG AAAAAATAAT GCAAAGCGGT
GCTTTGGATG TCTATTTTAC TCCAATTTTC ATGAAAAAAG GTAGGCCAGC TTACAAGATT
GGGGTCATAA CAAAAGCTCA AAACTTTGAG GATGTAGCCT CTGCAATATT TCGCTGGACA
TCTACAATTG GTATAAGATT TGTAAAAATG CAGAGAATAG AGATGGAAAG ACAGGAGAAA
AGAATTCAGG ATAATCCCGA GCTGAGATTA AAGATAAGTT CTTATGAGGA TATTAAAAGA
ATCAAACTTG AATTTGAAGA TATTAAAAAG TTAACTGAAT AA
 
Protein sequence
MILYFDAISG LAGDMLVASL IDCGVDFEYI KSAISLLKLD VELSVEEKFV NGIKAKRFIV 
NTQSHEHDHH SDSTHHHRTF KDIKKMLIES SLPEAVKEMS IKIFEKIALA EAKVHGKNPE
EISFHEVGAD DSIVDIVAFS LCIDNLKPEK IVFSPLCDGR GFTKSMHGII PVPAPAVLEI
ARQNGIPLST KDIESELITP TGIGIAAAVA SSFSQMPNMT IEKIGYGAGT KELPIPNVVR
AIVGKKKLNF DGDYFEIFAN VDDMTGEELS LAFEKIMQSG ALDVYFTPIF MKKGRPAYKI
GVITKAQNFE DVASAIFRWT STIGIRFVKM QRIEMERQEK RIQDNPELRL KISSYEDIKR
IKLEFEDIKK LTE