Gene Athe_1600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1600 
Symbol 
ID7409430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1695063 
End bp1697957 
Gene Length2895 bp 
Protein Length964 aa 
Translation table11 
GC content39% 
IMG OID643715969 
Productprotein of unknown function DUF1156 
Protein accessionYP_002573467 
Protein GI222529585 
COG category[L] Replication, recombination and repair 
COG ID[COG1743] Adenine-specific DNA methylase containing a Zn-ribbon 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000970911 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAAAT CTTTAATAGA GGTTCAATTT CCAGTTTCAA AACTTTCAAA GGAGGCATTT 
AAAGAGCGAA AAGCAGGTGC AGGACAGACA TTAACAGGAC TTGGCAAATG GTGGGGCAGA
AAACCTCTTG TGCTTGTGCG TGCACTACTT TTGGGAGTTC TTTTGCCTGC AACAGATGAC
CCCAAAAAAG ATATGAAAAT TTTTCTTAAG CTTATGACTA TGGATGAAGA GGGCTTGAAG
CTGAGAAGAA AAAGTAGCAT ATCAGCAAAG GATGCTTATG AGTTTGCAAC AGAAGAAGAA
AAGGCAAAAT ATTTTGATGT TGCAGATGAG GGGAAAATAA GCTACAAAAA AGATTTAAAA
AAAGCTGAGA GGGAAGGATT TCAAGAAAGA ATATTTAAAA GGATGCCATA TGAACAGAAG
CTAAGATATT GCAAGCGTCC CGAAGAGATA GAAAATCTGC CTAAGAGCGC TTGGGATGAG
ATAAATGAGC ATCTTGGAAC AAATGCATAT TCATATCAAG CCCTTGTTGA AGAGCTTGGC
AAAAAAAGAT TTGGACAACT TCCAACAGTT GGAGATTGTT TTTGCGGTGG AGGGAGTATT
CCATTTGAAG CAGCAAGGCT TGGGTTTGGT GTTTTTGCGT CAGACCTAAA TCCAATTGCA
ATGCTTCTTA CGTGGGCAGC TTTGAATCTT TTGAGCCTGC CCGAAGATGC GATTGAAAAG
CTAAAAGATT TTCAAAAAAG AATATTTGAG CAGGCAGACA AGATTGTAAC TCAGTGGCAG
ATTGAGCACA ACTCAAAAGG GCATAGAGCA AATGCATATT TATACTGCGT TGAGACAATT
TGTCCTGAAT GTGGGTTCAA AGTTCCTCTA CTACCTTCTT TGGTAATTGG CAAAAATTCT
AAAACTATTG CTGTGTTGCA CGAAAACCCA GCTAAAAAAG GGTTTGATAT AGAAATCAAA
ACGAAAGTCA GCCAATCAGA GCTCGAACAA GCAGCTAAAA ACGGAACTGT AAAGGATGGT
TATTTAATTT GTCCACATTG TAAAATGGAA ACAAGTATTT CGTCAATTAG GGGAGATAAG
GTTGATGAGA GTAGCAAAAC AATTTGGGGG CTAAGACGCT GGGAAAAACA TGAATTTGTG
CCAAGAGAAG ATGATGTTTT TCAAGAAAGA TTGTATTGCA TAAGATATGA GGATGAAAAA
GGGCAAAGAT ACTACAAAGC TCCGGATGAT GAGGATTTTG AAAGGGAAAA GAAAGTTATA
GAACTTTTGA AAGAAAGATT TGAGGAGTGG CAGCAAAAAG GATATATTCC AAGTGATATG
ATTGAAGAGG GTGAAGAAAC AAGCAGATTG TACAGAGAAA GAGGTTGGGC ATATTGGCAT
CAGCTTTTCA ATCTACGACA GTTACTTTTA CATGGGCTTT TGATGGAACT GATTGACAAA
AAGGCAAAGA CGAAAGAGGA GAAGATTGTG GGGCTACTGG GGGTTAATAG GTGTGTTACG
TGGAATTCTA AGTTATGTCT ATGGGACAAT ACGCGTGAGG ATAATGGGAA GAACACATTT
TACAATCAGG CATTGAATAC GTTGTTCAAT TTTAATGTAA GAGGATTGAC TGTTTTACCA
TGGTTTTTAG ACAGTTTAAA GCCTTATTTG TTGTCAAATA ATCACAAAAT AGTTTATCCT
ACCGATGCCC GCGATGTAAA TCAAGCCTGC CACATTTGGA TAACAGACCC GCCTTATGCT
GATGCAATAA ATTACCATGA GCTCTCTGAG TTCTTCTTAG CATGGGATAA GAAGTTTTTG
AAAGAAGTTT TTCCAGATTG GTATACAGAC AGCAAAAGGG CATTGGCAGT TCGGGGTGAC
CGCGAACTGT TTAAAACTGC TTTTACGGAG ATTCTTAAAA ACATAGTTTC TAATATGCCT
GAAAATGGCT ATTTTGTCCT CATGTTTACA CACCAGGACT CACAGGTTTT TGCAGACCTA
ACAGAAATTT TGCTTAACTC AGGGCTTTTG TCTGTCAATG CATGGAGCAT TGCAACAGAG
ACAGAGGACA ATATGTCAGA GGGCAATTTT GTCCAGTCAA CTGTGTGTGT TGTTTTAAAG
AAGATTGATA GAACTCAACT TGAGCCTGTA TTTATTGAAG AGCTATATCC TTTTGGCAAA
GAAGAGGTAG AAAGACAGAT AAAGCTCATG TATGAGCTTG ACAAAGATGA AGCAGAGCCT
AATTTTAGTC CAACAGATTT GGAGCTGTCA GGCTACTATG CAGCATTGCG TGTTTTGACA
TCCTGCAATC TTAAAGCAAC AAACCAGAAG ATTAAAGAGT TTTTGGATTC CATGCGCGAA
TATGCAAGCA GCTACATAGT ACCAGAAGGT TTGAAATACC TTGGGTTTGA CCAAGATACC
ATTTACGAAA TTTGGCGCAA GATGGAAAGC TATGAGAAGT TTTATATAAG AGGCATCGAA
TTTGAAACAA GGGGTGAAAA AAGAATAGGT GCATATCAAG ATGCTGCAAG GAGTCTTGGT
GTTGCTGATT ATGATGAACT TTTTGCAATT AAAAAATCAA ATTCGGCAAG ACTGAAGACA
GCAAGCGAGC TTGGGCAGGG ACTTCTTGAT ACCAAACATG CGTTTAGTAC AACAATCTTG
CGCTTGTGTC TTTTAGCGAT AAATAGCGCA ATAAAAAAAG ACCAGGAAAT AAATGACACT
GCCGAGGCGG TTGCACTTTC ACATGAGATG TTAAAGACCA AACTTGGAAC AAAATACTGG
AACAACAAAA CCAAGATAGA GATAATATTC AGATACCTTG CAAGGCTTGA GAAAATTGAT
GGCATGGAAC ACTGGCAAAA CGACTCAAAA ATAGCTTCAT ATCTTGCTGA GCGTGTGGCA
AACGATAGAC TGTAA
 
Protein sequence
MEKSLIEVQF PVSKLSKEAF KERKAGAGQT LTGLGKWWGR KPLVLVRALL LGVLLPATDD 
PKKDMKIFLK LMTMDEEGLK LRRKSSISAK DAYEFATEEE KAKYFDVADE GKISYKKDLK
KAEREGFQER IFKRMPYEQK LRYCKRPEEI ENLPKSAWDE INEHLGTNAY SYQALVEELG
KKRFGQLPTV GDCFCGGGSI PFEAARLGFG VFASDLNPIA MLLTWAALNL LSLPEDAIEK
LKDFQKRIFE QADKIVTQWQ IEHNSKGHRA NAYLYCVETI CPECGFKVPL LPSLVIGKNS
KTIAVLHENP AKKGFDIEIK TKVSQSELEQ AAKNGTVKDG YLICPHCKME TSISSIRGDK
VDESSKTIWG LRRWEKHEFV PREDDVFQER LYCIRYEDEK GQRYYKAPDD EDFEREKKVI
ELLKERFEEW QQKGYIPSDM IEEGEETSRL YRERGWAYWH QLFNLRQLLL HGLLMELIDK
KAKTKEEKIV GLLGVNRCVT WNSKLCLWDN TREDNGKNTF YNQALNTLFN FNVRGLTVLP
WFLDSLKPYL LSNNHKIVYP TDARDVNQAC HIWITDPPYA DAINYHELSE FFLAWDKKFL
KEVFPDWYTD SKRALAVRGD RELFKTAFTE ILKNIVSNMP ENGYFVLMFT HQDSQVFADL
TEILLNSGLL SVNAWSIATE TEDNMSEGNF VQSTVCVVLK KIDRTQLEPV FIEELYPFGK
EEVERQIKLM YELDKDEAEP NFSPTDLELS GYYAALRVLT SCNLKATNQK IKEFLDSMRE
YASSYIVPEG LKYLGFDQDT IYEIWRKMES YEKFYIRGIE FETRGEKRIG AYQDAARSLG
VADYDELFAI KKSNSARLKT ASELGQGLLD TKHAFSTTIL RLCLLAINSA IKKDQEINDT
AEAVALSHEM LKTKLGTKYW NNKTKIEIIF RYLARLEKID GMEHWQNDSK IASYLAERVA
NDRL