Gene Athe_0985 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0985 
Symbol 
ID7407886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1087349 
End bp1088584 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content40% 
IMG OID643715350 
Productcompetence/damage-inducible protein CinA 
Protein accessionYP_002572859 
Protein GI222528977 
COG category[R] General function prediction only 
COG ID[COG1058] Predicted nucleotide-utilizing enzyme related to molybdopterin-biosynthesis enzyme MoeA 
TIGRFAM ID[TIGR00177] molybdenum cofactor synthesis domain
[TIGR00199] competence/damage-inducible protein CinA C-terminal domain
[TIGR00200] competence/damage-inducible protein CinA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000422375 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGCTG AAATAATCTG TGTGGGTACA GAACTGTTGC TTGGTCAGAT TTTGAACACA 
AATAGTCAAT ATCTTGCTCA AAAATTGGCC GAGCTTGGTA TTGACCTTTA TTTTCAGACA
ACTGTTGGGG ATAATATGGA AAGGCTCAAA ATGGCAATTG ATATAGCCAC AAAAAGGTCT
GACGTATTGA TTTTTACAGG AGGGCTTGGT CCAACATCTG ATGACATTAC AAAAGAAGCA
GTAGCAGATT ATTTTGGTTT GACCCTTGTG CTGGATGAAG ATATATTAAG AAGAATTGAA
AGTTTTTTTG AACGCAGGCA GGTAAAGATG CCTCAGATTA ACAAAAAACA GGCATATGTT
CCCGAAGGTG CAAAAGTTCT TCACAACAAA AATGGTACAG CACCTGGACT TATCATTGAA
AAAGACGGCA AGATTGCAAT TTTGCTTCCT GGACCTCCTT TTGAGATGCA GCCGATGTTT
GAAGAAGAAG TCTTGCCTTA TTTAGAGAAG TTTTCAAAAC AAAAGATTTA CTCAAGAGTG
TTAAAGTTTA TTGGAATAGG TGAGTCTTCT ATTGAAGAGG CTCTGAAGGA TTTAATCCTC
TCTCAGACAG ACCCAACGAT GGCTCTTTAT GCAAAACCGT TTGAAGTTGA GCTGAGAATT
ACAACAAAAA AAGAGAGTGA AGAGCTTGCA AAATCACTTC TTCAATCGAT GGAATATAGA
ATAAGAGAGC GTTTAGGAGA GTATATTTAT GGTGTTGACA GACAGCTGCT GGAAGAAGTT
GTGATAGGCT TGCTTACAGA AAAGAAGTTA AAGGTTAGCG TTGCCGAGTC GTGCACGGGA
GGGCTTATCT GCAACAAGCT TACAAATGTG CCAGGCGCAT CCGAAGTATT TGACAGAGGG
TTTATAGTAT ATTCAAATGA GGCCAAGATG AAACTGCTTG GTGTTCCAGA GCAAGTGTTG
AAAGAGCACG GGGCAGTAAG TTCTCAGACA GCCAGGTTTA TGGCACAGGG AGCACTTTCA
AATTCGCTAT CAGATATTGC ACTGTCTGTG ACAGGAATTG CAGGGCCAGG CGGTGGGAGT
GAAACAAAAC CTGTAGGGCT TGTATATATT GGTATTGCAA CAAAAGATAA TGTTGAGAGT
TTTGAATTCA GGTTTTCGGG TGACAGATTA AGGATAAAAG AGATGACTTC AAAGGCTGCC
CTCAACATTT TGAGAAAAAA GATAATTGAT TATTGA
 
Protein sequence
MVAEIICVGT ELLLGQILNT NSQYLAQKLA ELGIDLYFQT TVGDNMERLK MAIDIATKRS 
DVLIFTGGLG PTSDDITKEA VADYFGLTLV LDEDILRRIE SFFERRQVKM PQINKKQAYV
PEGAKVLHNK NGTAPGLIIE KDGKIAILLP GPPFEMQPMF EEEVLPYLEK FSKQKIYSRV
LKFIGIGESS IEEALKDLIL SQTDPTMALY AKPFEVELRI TTKKESEELA KSLLQSMEYR
IRERLGEYIY GVDRQLLEEV VIGLLTEKKL KVSVAESCTG GLICNKLTNV PGASEVFDRG
FIVYSNEAKM KLLGVPEQVL KEHGAVSSQT ARFMAQGALS NSLSDIALSV TGIAGPGGGS
ETKPVGLVYI GIATKDNVES FEFRFSGDRL RIKEMTSKAA LNILRKKIID Y