Gene Athe_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1038 
Symbol 
ID7409595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1131614 
End bp1133788 
Gene Length2175 bp 
Protein Length724 aa 
Translation table11 
GC content36% 
IMG OID643715404 
Productprimosomal protein N' 
Protein accessionYP_002572912 
Protein GI222529030 
COG category[L] Replication, recombination and repair 
COG ID[COG1198] Primosomal protein N' (replication factor Y) - superfamily II helicase 
TIGRFAM ID[TIGR00595] primosomal protein N' 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGCTC AGGTTTGTAT TAACTACCAG GACGCCAATG TTGATAAGGT ATTTGATTAT 
TTGGTACCAA CACATCTTGA AAATAGCATT GAGATAGGGA AAAGAGTGTA TGTAAGCTTT
GGAGTTTCAA ATAGAATTGT GGAAGGGCTT GTTGTTGGGA TAAAGCAAAC TACTGATATA
GAGGAAAATA AGATAAAGTG CGTGCTTGCT GTAATTGATA AGTTTTCAAT TGTTTCAAAG
GAACAGATAG AACTTGCTTT TTCAATGAAA AATTACTATG CGTTGAATTT GGGTGAGGCT
TTGTCACTTG TTATACCTCC TTTTGTGAGC AGCAAACAGA TATATAATAT CTGTGCAAAA
AAGTGTGAAG AAAATAAGAA CTTAGATGAT GATCTAAAAG AACTGTATGA AAGTATTTTG
AAAAAACCTG TAAGCATAAA TTCAAAGCTT GTGAAAGAGA ACAAAGAAAA AATAGTAAAA
CTCTTTTTGG AAGGGCTTTT GGAGTTTGAT TTAAAGAATT TTGATACAAG AGAAAATACT
GAGAAAAATC TGCCGCGGGT AGAACCAGAG TTTAATTTAA CTGAAGAACA GAACAAAGCC
CTAAATAATA TAATCTCTGC GTTTGATGAA GGAGGATACA GAAATATTCT CTTATTTGGA
GTCACAGGAA GTGGGAAAAC AGAGGTTTAT ATAAGAGCGA TACAATATGT AATTGAAAAA
GGCAAGAGTG TCATATTTAT GGTACCAGAA ATCTCACTCA CACCACAGAT GATAGAAAAT
GTTCAAAGTA GAATAGGCAA CAAGGTTTTA GTATATCACA GCAAAATGAA AAGTATAGAC
AGGCTAAATA GTTGGCTTGC TGCCAGAAAC AAAGAGGCGG TTGTGGTGAT TGGTCCGCGA
TCAGCAGTTT TTGCCCCTGT CAAGAACCTT GGTCTTATAA TTGTCGATGA AGAGCATGAA
CCAAGCTATA AATCTGAAAA ATCGCCGCGG ATAAATGCTG TTGAGGTTGC CCAGATGAGA
GCTAAAATTA ATAATATACC CATTATACTT GGCTCTGCAA CTCCATCTAT TGAGCATTAT
TATTATGCTA AAAAAGGAAA GTATTCTCTT TGTACATTGA AAAATAGAAT AAACAAGACC
CTGCCAGAAG TTTTGATTGT TGATATGAAA AAAGAAATCT TAGAAGGTAA CAAGTCCATT
TTTAGCAGGC TTTTACTTAG TGAAATAGAG AACAACTTGA AAAAAGGGGA GCAGGTTCTC
CTTTTTTTAA ACAGGAGAGG TTATTCTCCA ATTGTTATAT GCCGTGAGTG TGGCTACGTT
TATATGTGTA AAAACTGCAG TATTTCACTT ACATACCACA AAGAGGGGTA TTTGAAATGT
CACTACTGCG GGTATAAAGA GGAATATAAA GGTGTGTGTA CAAAATGTAA CAGCAGGTAT
GTCAGACAAT ATGGTAGTGG CACCCAGAAG ATAGAGGAAG AGATAAAAGC GTACTTTAAA
GATGCAAGGG TTTTGCGTAT GGACAGTGAT ACAACTTCAA AAAAAGATGC GACAGAACAG
ATTGTGAAAA AGTTCAGGGA AAAAGAGGCA GATATTCTTG TTGGTACGCA GATGATTGCA
AAAGGTTTGC ACTTTCCTGA CTTGACCTTG GTGGGTGTGA TAGATGCAGA TATTCTTTTG
AACATGCCAG ATTTTAGGAG CAGAGAAAGA ACATTTCAAC TGCTTACACA GGTTGCAGGA
AGGTCTGGCA GGGAAAAACC AGGAAAAGTT ATAATTCAAA CTTTTAACCC TGAAGATTAC
AGCATTGTGT TTGCTTCAAA GCACGACTAT GAAAGCTTTT ATGCCCAGGA AATGAAACTG
AGAAAAATGA TGGTATATCC ACCGTATTCT TATGTAGTTA ACTTTGTTAC AGTAGCAAGG
GAAGAGAATA TGGCAAAAAG AGGAATAGAG CATGTATATG CTTTGCTCAA GGAGAATGAA
ATGGAGAATG ACATGAAAAT TTATGGTCCA AGTGAAAATC CCATCTTTAA AATAGAAAAC
CAGTACAGGT ATCACATATT GGTAAAGTTC AAAAGAGCTG GGCAGATGAT TAGTATAGCA
AATCTAATCA AAGAAAGATA TAATTATAGT AACGCGTCGC TTATCATCGA CGTAAATCCT
TTGGATACAC TTTAA
 
Protein sequence
MIAQVCINYQ DANVDKVFDY LVPTHLENSI EIGKRVYVSF GVSNRIVEGL VVGIKQTTDI 
EENKIKCVLA VIDKFSIVSK EQIELAFSMK NYYALNLGEA LSLVIPPFVS SKQIYNICAK
KCEENKNLDD DLKELYESIL KKPVSINSKL VKENKEKIVK LFLEGLLEFD LKNFDTRENT
EKNLPRVEPE FNLTEEQNKA LNNIISAFDE GGYRNILLFG VTGSGKTEVY IRAIQYVIEK
GKSVIFMVPE ISLTPQMIEN VQSRIGNKVL VYHSKMKSID RLNSWLAARN KEAVVVIGPR
SAVFAPVKNL GLIIVDEEHE PSYKSEKSPR INAVEVAQMR AKINNIPIIL GSATPSIEHY
YYAKKGKYSL CTLKNRINKT LPEVLIVDMK KEILEGNKSI FSRLLLSEIE NNLKKGEQVL
LFLNRRGYSP IVICRECGYV YMCKNCSISL TYHKEGYLKC HYCGYKEEYK GVCTKCNSRY
VRQYGSGTQK IEEEIKAYFK DARVLRMDSD TTSKKDATEQ IVKKFREKEA DILVGTQMIA
KGLHFPDLTL VGVIDADILL NMPDFRSRER TFQLLTQVAG RSGREKPGKV IIQTFNPEDY
SIVFASKHDY ESFYAQEMKL RKMMVYPPYS YVVNFVTVAR EENMAKRGIE HVYALLKENE
MENDMKIYGP SENPIFKIEN QYRYHILVKF KRAGQMISIA NLIKERYNYS NASLIIDVNP
LDTL