Gene HY04AAS1_0542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHY04AAS1_0542 
Symbol 
ID6743338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHydrogenobaculum sp. Y04AAS1 
KingdomBacteria 
Replicon accessionNC_011126 
Strand
Start bp476085 
End bp477842 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content33% 
IMG OID642750333 
ProductDNA-directed DNA polymerase 
Protein accessionYP_002121207 
Protein GI195952917 
COG category[L] Replication, recombination and repair 
COG ID[COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00481022 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTG TGTATGTAGA TAAAGAGCCT GTTTTAATAA AAGCTTTAGA CTATCTTTCC 
TCTGGTGATA TTTGGTTTAT AGACACAGAA ACCACGCCAA AAGATATAAG ACTTTTTCAA
GTAGGATTAG AGAGCGGTCC TATTTATGTG ATAGACTTTT TGTTTGTAAA AAGAGCTCCA
GAACTTATAA AAGATATAAT AGCTAAAAAG GGTGTAGCAG GACACAACTT AAAATATGAT
TTAAAGTATC TTATGAAATA CGATATACAT CCTTATACTA CGTTTGATAC TATGGTAGGA
GCACAGTTGA TAGGTTTAAA TAGAGTTTCT TTGGCGAGTG TTTACAATCA TTTTACAGGT
GAAAGTATCG ACAAAAAAGA GCAGTTTTCA AATTGGTCTT CAAAAGAGCT TACAGAAAGT
CAAATTTTTT ATGCTGCAAA GGATGTAGAG GTTTTGAGGC TTTTATACGA AAAGCTAAAA
AATGAATTAA ACAAAGAACC CACCATCATT GAGATATTAC AAAAATCAAG GGTGGCGAAG
GTTTTTGGAT TGGAAAGCAC ATACGCTATA ATAGAAATGG GGTTTGTACA GGAGCTTGCT
AAAATTGAAC ACACCGGAAT AGGAATAGAT ACAAAAGAAA TAGAGACTAT GAAAAAACAA
TTACAAAAGA AAACCCAAGA GCTTGCTATG AACTTTTATA TAAAGTATCG TATAGATATA
AGTAGTCCTA AAAAAGTAGG TGAGTTTTTA GAAAATCATT TAAATATTTC ACTTCCTCGT
ACCGATAAAG ACAATATAAT AACAGATGAT AGTGTGTTGA TAGAGCATCT TGACTATGAA
AACGAGAAAG CAAAAGATGT GATAAGTAGC GTATTGGAGT TTAGAAAGCT TCATAAGTTA
CAAGAAAAGC TATCAGAGAT TTTAGAGTAC AACGAAAACA ACCGTATACA TCCAGAGTTT
TGGCAAATAG GAGCCGTTAC TGGAAGGATG TCCTCTTCAA GACCCAATGT TCAAAACATA
CCAAGAGAAT TAAGAAGTAT TCTAAAAGCT AAAGACGGAT ACGTGTTTGT AATAGCTGAT
TTCTCACAAA TAGAACTAAG AATAGCAGCA GAGTATGTTA AAGATGAGGT AATGATAGAT
ATAATAAATA AAGGAGAAGA CCTTCACAAG TTTACAGCCT CATTAATTAC AGGTAAATCG
TTGGAAGATA TTACAAAAGA AGAAAGACAA AGGGCAAAAG CTGCCAATTT CGGTCTTATA
TACGGTATAT CAGAAAAATC TCTTTCTTTG TATGCAAGAA ACTCTTATGG GATTGATATG
TCTATAGAAG AAGCCAAAAG ATTTAGAGAG GTGTTTTTTT CTACATTCCA AGGGATAAAA
GCTTGGCACG AAAGGATAAA AAAAGAGCTA AAGGCAAAAG GTGAAATAAG GTTAAAAACT
ATCGGCGGAA AACCTATGAT AGCCTACACT TTTACCGATG CTGCCAATTA TCCAATACAA
GGTACTGGAG CAGAATTGTT GAAGCTTTCA GTTTTAATTT TTTCTCAAGA GCTTAAAAGA
GCTTTTCCAA GCATATTTCA CGAAGTAGCA AACGTTGTAA ACTTGGTACA CGATGAGATA
GTGGTGGAAG CAAAAGAAGA TTATAAAGAA GAAGTATCTA AGCTTTTAGA AAAATCTATG
AAAAAAGCTG GCTCTATACT TCTTAACAAT GTAAAAATAG AAACAGAAAT AGTTATCAAT
CACCGCTGGA CAAAGTAA
 
Protein sequence
MNFVYVDKEP VLIKALDYLS SGDIWFIDTE TTPKDIRLFQ VGLESGPIYV IDFLFVKRAP 
ELIKDIIAKK GVAGHNLKYD LKYLMKYDIH PYTTFDTMVG AQLIGLNRVS LASVYNHFTG
ESIDKKEQFS NWSSKELTES QIFYAAKDVE VLRLLYEKLK NELNKEPTII EILQKSRVAK
VFGLESTYAI IEMGFVQELA KIEHTGIGID TKEIETMKKQ LQKKTQELAM NFYIKYRIDI
SSPKKVGEFL ENHLNISLPR TDKDNIITDD SVLIEHLDYE NEKAKDVISS VLEFRKLHKL
QEKLSEILEY NENNRIHPEF WQIGAVTGRM SSSRPNVQNI PRELRSILKA KDGYVFVIAD
FSQIELRIAA EYVKDEVMID IINKGEDLHK FTASLITGKS LEDITKEERQ RAKAANFGLI
YGISEKSLSL YARNSYGIDM SIEEAKRFRE VFFSTFQGIK AWHERIKKEL KAKGEIRLKT
IGGKPMIAYT FTDAANYPIQ GTGAELLKLS VLIFSQELKR AFPSIFHEVA NVVNLVHDEI
VVEAKEDYKE EVSKLLEKSM KKAGSILLNN VKIETEIVIN HRWTK