Gene Athe_1441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1441 
Symbol 
ID7408099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1523763 
End bp1526315 
Gene Length2553 bp 
Protein Length850 aa 
Translation table11 
GC content34% 
IMG OID643715804 
ProductDNA polymerase I 
Protein accessionYP_002573312 
Protein GI222529430 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID[TIGR00593] DNA polymerase I 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACTGG TTATATTCGA TGGAAACAGC ATTTTGTACA GAGCCTTTTT TGCTCTTCCT 
GAACTGACAA CCTCAAATAA TATTCCAACA AACGCTATAT ATGGATTTGT AAATGTGATA
TTGAAATATT TAGAACAAGA AAAACCTGAT TATGTTGCTG TAGCATTTGA TAAAAGAGGA
AGAGAGGCAC GAAAAAGCGA GTACGAAGAA TATAAAGCTA ACAGAAAACC TATGCCAGAT
AACCTTCAAG TACAAATCCC TTATGTTCGA GAGATTCTTT ATGCCTTTAA CATTCCAATA
ATTGAGTTTG AAGGATATGA AGCAGATGAT GTAATCGGTT CACTTGTTAA CCAGTTCAAA
AATACTGGTT TGGATATTGT TATTATTACG GGTGACAGGG ATACTCTTCA GTTGCTCGAC
AAAAATGTAG TTGTGAAGAT TGTTTCAACA AAATTTGATA AAACAGTAGA AGATTTGTAC
ACTGTGGAAA ATGTTAAAGA AAAATATGGG GTTTGGGCAA ATCAAGTGCC TGATTACAAA
GCGCTTGTTG GAGACCAATC AGATAACATT CCCGGGGTAA AGGGAATTGG CGAAAAGAGT
GCCCAGAAGC TCTTGGAAGA GTACTCATCC TTAGAAGAGA TATACCAAAA TTTAGATAAA
ATTAAAAGTT CCATTCGTGA AAAGTTAGAA GCAGGAAAAG ATATGGCGTT TTTATCCAAG
CGCTTAGCAA CAATTGTATG TGATTTACCA CTAAATGTTA AACTTGAAGA CCTAAGAACA
AAAGAGTGGA ACAAGGAAAG GCTCTATGAG ATTTTGGTGC AGTTAGAGTT CAAAAGCATA
ATAAAACGGT TAGGACTATC AGAAGTTGTT CAATTTGAAT TTGTTCAGCA GCGAACCGAT
ATACCTGACG TTGAACAAAA AGAGCTTGAA AGTATTTCAC AAATAAGATC AAAAGAGATT
CCATTAATGT TTGTACAGGG CGAAAAATGT TTTTATTTAT ATGATCAAGA AAGTAATACT
GTATTTATAA CAAGTAATAA ACTTTTGATA GAGGAGATTT TAAAAAGTGA TACTGTGAAA
ATTATGTATG ATTTGAAAAA TATATTTCAT CAACTCAACC TGGAAGACAC TAATAATATT
AAAAATTGCG AAGATGTAAT GATTGCTTCC TATGTTCTTG ACAGCACAAG AAGTTCATAT
GAGTTAGAAA CGTTGTTTGT ATCTTACTTG AACACTGACA TAGAAGCTGT AAAAAAAGAC
AAGAAGATAG TCTCTGTGGT ACTTCTAAAA CGGTTATGGG ACGAGCTTTT GAGATTAATA
GATTTAAATT CATGCCAGTT TTTATATGAG AATATAGAAA GACCTCTTAT CCCAGTTCTA
TATGAAATGG AAAAAACAGG ATTTAAGGTG GATAGAGATG CCCTCATCCA ATATACCAAA
GAGATTGAAA ACAAAATATT AAAACTTGAA ACGCAGATAT ACCAGATTGC AGGTGAGTGG
TTTAACATAA ATTCACCGAA ACAGCTTTCT TACATTTTGT TTGAAAAGCT AAAACTTCCT
GTAATAAAGA AGACAAAAAC AGGATATTCC ACTGATGCCG AGGTTTTAGA AGAGCTTTTT
GACAAACATG AAATAGTTCC TCTTATTTTG GATTACAGGA TGTATACAAA GATACTGACA
ACTTACTGTC AGGGATTACT ACAGGCAATA AATCCTTCTT CGGGTAGAGT TCATACAACC
TTTATCCAAA CAGGTACAGC CACAGGAAGA CTTGCAAGCA GCGATCCTAA TTTACAAAAT
ATACCTGTAA AATATGATGA GGGGAAATTG ATACGAAAGG TTTTTGTACC TGAGGGTGGA
CATGTACTGA TTGATGCAGA TTATTCCCAA ATTGAGCTGA GAATACTTGC CCATATTTCT
GAAGATGAAA GACTTATAAG TGCTTTCAAA AATAATGTTG ACATTCATTC GCAGACAGCA
GCTGAGGTTT TTGGTGTAGA CATAGCCGAT GTTACTCCAG AGATGAGAAG TCAAGCTAAA
GCAGTAAATT TTGGTATAGT TTATGGGATT TCTGATTATG GTCTTGCAAG GGATATTAAA
ATTTCCAGGA AAGAAGCTGC AGAGTTTATA AATAAGTATT TTGAGCGTTA TCCCAAAGTT
AAAGAGTATT TAGATAATAC TGTTAAGTTT GCTCGTGATA ATGGATTTGT TTTGACTTTA
TTTAATAGAA AGAGATATAT AAAAGACATA AAATCTACAA ACAGAAACTT AAGGGGTTAT
GCAGAAAGGA TTGCAATGAA TTCGCCAATT CAGGGCAGTG CTGCTGATAT CATGAAATTG
GCAATGATTA AGGTTTATCA GAAACTTAAA GAAAACAATC TCAAATCAAA AATAATTTTG
CAGGTACACG ATGAGCTTTT AATTGAAGCC CCATACGAAG AAAAGGATAT AGTAAAGGAA
ATAGTAAAAA GAGAAATGGA AAATGCGGTA GCTTTAAAAG TACCTTTGGT AGTTGAAGTG
AAAGAAGGAC TGAACTGGTA TGAGACAAAA TAG
 
Protein sequence
MKLVIFDGNS ILYRAFFALP ELTTSNNIPT NAIYGFVNVI LKYLEQEKPD YVAVAFDKRG 
REARKSEYEE YKANRKPMPD NLQVQIPYVR EILYAFNIPI IEFEGYEADD VIGSLVNQFK
NTGLDIVIIT GDRDTLQLLD KNVVVKIVST KFDKTVEDLY TVENVKEKYG VWANQVPDYK
ALVGDQSDNI PGVKGIGEKS AQKLLEEYSS LEEIYQNLDK IKSSIREKLE AGKDMAFLSK
RLATIVCDLP LNVKLEDLRT KEWNKERLYE ILVQLEFKSI IKRLGLSEVV QFEFVQQRTD
IPDVEQKELE SISQIRSKEI PLMFVQGEKC FYLYDQESNT VFITSNKLLI EEILKSDTVK
IMYDLKNIFH QLNLEDTNNI KNCEDVMIAS YVLDSTRSSY ELETLFVSYL NTDIEAVKKD
KKIVSVVLLK RLWDELLRLI DLNSCQFLYE NIERPLIPVL YEMEKTGFKV DRDALIQYTK
EIENKILKLE TQIYQIAGEW FNINSPKQLS YILFEKLKLP VIKKTKTGYS TDAEVLEELF
DKHEIVPLIL DYRMYTKILT TYCQGLLQAI NPSSGRVHTT FIQTGTATGR LASSDPNLQN
IPVKYDEGKL IRKVFVPEGG HVLIDADYSQ IELRILAHIS EDERLISAFK NNVDIHSQTA
AEVFGVDIAD VTPEMRSQAK AVNFGIVYGI SDYGLARDIK ISRKEAAEFI NKYFERYPKV
KEYLDNTVKF ARDNGFVLTL FNRKRYIKDI KSTNRNLRGY AERIAMNSPI QGSAADIMKL
AMIKVYQKLK ENNLKSKIIL QVHDELLIEA PYEEKDIVKE IVKREMENAV ALKVPLVVEV
KEGLNWYETK