Gene Athe_2443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2443 
Symbol 
ID7408067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2586114 
End bp2587529 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content44% 
IMG OID643716806 
ProductHNH endonuclease 
Protein accessionYP_002574284 
Protein GI222530402 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTAATCT TCACCGTTGA CAAACATGGA AGACCAGGAC ACCCAACAAG AAGATTTGAT 
ATGGTAAGAA AACTGGTAAA GCAGGGTAGA GCAAAAATCA TCGGTGGTGG AGCTTCTGGC
AAACCACCGG TTGTGATGTT CCTCGACAGG GAGTTTGACT ACTCCAAAAC GATAGAAAGA
CGTCTCTTTG TAGTACTTGA CCCGGGATAC CACCACATAG GCTTTGCAGT ATGCGAACTT
CGCTGGGGCG TATTGATTGT CTACTGTATA GGGGTTTTAG AAACAAGAAT CCCTGAAATT
AAGGACTTGA TGACTAAAAG AAGGGGATAC AGACGAAACC GCAGGTACCA CTCAAGGTGC
AGAAAAAAAC GAATGTCCAA AAGACATAGT AGGGTCCTGA CAAAATTCAA AGCACCAAGA
AATGTAAGGA CAAAGGATAG AACAAATGCA ACACTTAGAC ATGGCATAGA AACCCACCTC
AACCTTTACA AAAAACTCTT AAAGTTCTTT CCATTCCCAG CAGAGCAGGT TGTGTTCGTT
ATGGAAGACA ACATCTTTGA TGTCAGAACA ATGACATGGG GTAAAACATA TGGTACAGGG
TATCAAAAAT CACCCAGAGT TCCAGCAGAG AAGAAGTGTA TTATCTGCGG TACAGAAGAC
AATCTGCAGA AGCACCATTT GATACAGCGT AAATGTGGTG GTACAGACGT TCAGGAAAAC
CTGGTGTACC TGTGCAGGGA CTGTCATGAA GATGTCCATG CTGGAAGAGT GTATATACCG
GTGGAAGGTG TCAGGCAGTG GCGTGCACTG GGTACGATGA ATGCGATAAT AGGTCAACTG
CGTGAAATAC CATGGCTGAA GTTCGTACCT GCATCTGACG CGGCACAGAT GAGAAAAAAA
CTGGGTCTTA AGAAAGGACA TGCAAACGAC GCTCTGGCAA CAGCAGCGGT CTTTTGCAGC
TGTACAGAAG CTGACAGAAC ACACATGATT GAGCTAACCC TGGTAAAGTT CAGAAGACAC
AACAGGGCAA GAATACATGC TGTAAGAGAC AGACTGTACA AGGTTGATGG TAAGATTGTG
GCGAAGAACA GACGTAAGAG GACAGACCAG AAAGAACCGT CCTTTGCAGA TATATCACCA
TTGCCACCGG AAATTCAAAG AAAACTCAAG GTATATCCCG GTACAAAGAT TCTTAACCCG
CTGCGAAAAG AAATGCCGAC TATAGCGGGT GATGTATGGA TTCACGAACC AACAGGCAAG
AGGTTTGTAA CAACAGGTGT GGTATCCCAG AAGTATTTGT ATTCGCCACA GCTAAAAAAG
ATAGTGGGAA AAATGTACGT TCAACCAGAA GAATGCAGGC AGGTACTCCA TAACGAAGGA
ATGGTTGTTA TGTACAACAG TCTATACCAC AGTTAA
 
Protein sequence
MVIFTVDKHG RPGHPTRRFD MVRKLVKQGR AKIIGGGASG KPPVVMFLDR EFDYSKTIER 
RLFVVLDPGY HHIGFAVCEL RWGVLIVYCI GVLETRIPEI KDLMTKRRGY RRNRRYHSRC
RKKRMSKRHS RVLTKFKAPR NVRTKDRTNA TLRHGIETHL NLYKKLLKFF PFPAEQVVFV
MEDNIFDVRT MTWGKTYGTG YQKSPRVPAE KKCIICGTED NLQKHHLIQR KCGGTDVQEN
LVYLCRDCHE DVHAGRVYIP VEGVRQWRAL GTMNAIIGQL REIPWLKFVP ASDAAQMRKK
LGLKKGHAND ALATAAVFCS CTEADRTHMI ELTLVKFRRH NRARIHAVRD RLYKVDGKIV
AKNRRKRTDQ KEPSFADISP LPPEIQRKLK VYPGTKILNP LRKEMPTIAG DVWIHEPTGK
RFVTTGVVSQ KYLYSPQLKK IVGKMYVQPE ECRQVLHNEG MVVMYNSLYH S