Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5242 |
Symbol | |
ID | 5737200 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | - |
Start bp | 9353 |
End bp | 11116 |
Gene Length | 1764 bp |
Protein Length | 587 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641282406 |
Product | RNA-directed DNA polymerase |
Protein accession | YP_001547997 |
Protein GI | 159901752 |
COG category | [L] Replication, recombination and repair [V] Defense mechanisms |
COG ID | [COG1403] Restriction endonuclease [COG3344] Retron-type reverse transcriptase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.461209 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACGA CCCCACTTGA TGAGCGCGGA ACCGCGCTCA CCGCTTGGAT CAATCAGACG CAAGAAGTGT TAGCCCATCG AAGTCTCAAC CAACAACCCT TTCATCGGGT ATTCAATCTC ATGCGGACAC GGCGACTCGC CACGGTGGCA CTCAATCGGG TGCTTTCCAA CACGGGAGCA CGCACCGCAG GGATTGACGG AATGACCAAG AAGCATATCG CAACGGATAC AGAACAACAG GCATTGGTTC AGGAAATCTG GCACGACCTG ACAACCCATC AGTATCGGCC AGCTCCCGTG CGTCGGGTGT ATATCCCCAA AGCCAATGGA CAGCAACGAC CGCTCGGTAT TCCCACGATC AAAGATCGGG TGGTGCAAGA GATGGTACGG CTGATCCTCG ACCCGATCTA TGAAAGCACG TTTTATCGCC ATAGTTATGG ATTTCGCCCC TATCGGGCAA CCCATCACGC GGTGGTACGA CTCCGCGACC TGATCGGACG ACGAGGCTAC CAGATGGCCC TAGAAGGAGA CATCCGCGCG TGCTTTGACC GAATTCATCA CACCACCTTA ATCCGGATTC TACGCCGGAC AATCAAGGAT GAACGCCTGA TAACGGTCAT CCACCAGATG CTCAAGGCCG GAGTGATGGA CGATGGACAG TGGCGCGTAA CGGAGGACGG AACGCCACAG GGCGGAATTG TCTCGCCACT GCTGGCCAAC ATCTACCTGA ACGAGCTTGA CCAATGGGTA GCCAACCGAT GGGACACCTA CACCCCACTA GAGCGCTATT ACCATCGGAA AGCCGGAACG GGGTATCCCT GTCAGATAAC CCGCTACGCG GATGACTTTG TGGTATTGCT CCACGGCACA CACGCCGAGG CAACCACCTT GAAAACCGCG CTCGCGACGT TTCTCGCTGA CCACCTCCAC TTGGAATTAT CAGCGGAAAA GACGCTGATA ACGCCGGTGG AACAGGGCTT TGACTTTCTT GGATTTCACA TCCGGAAATA CCAAGACAGC ACACGGATAA CCCCATCACG GAAGGCGATT GCGACCTTCA AACGCGAGGC GGCAGACCGC ATCGGCAAAG GATTTCGGGA CAGTGACGAA GCGGGCATCG TAATGTTGAA CCACTACCTC ACCGGATGGG GCCACTATTA CCGACGAGTG AGCAGCTCAA CCACGTTTCG CAGTCTCGAC CACTACATTT GGTGGCGGGT GATGCGAACG ACGTTCCGGC TGCGACGCGG GCGGGGAGTC CGGCACTTTG GCACACACTG CCGAAGCCAT CGCAAACGAT ACCGTGACGG TCTCAACCGA AAACACGCAC ATCGACGAGG GGGCCATTAC GGCGTATGGG CAAACACCGC CCAAACGCGA GCCTACATTG TCACGAGCTT GGCGTTCCTG CCGATTGAAT ATGTCGCCTT ACACCCACAA CTGCATCCGT ACCGCAAAGC CGACCGGGCA AAACTCGACC AACGCAAACG CTTAGCGCTG CTGTTGGCGC GAAATAGCCA TCCGGAACGG CCCGCGAACC CTGCCTATGG GAAGGCGTGG GAACAGATAC GACAAGAGGT ACTCCAGATG AGCAACTACA CCTGCCAACA CTGCGGCACA CGCGTGCATC GTAGCACGGC AGAAATTGAC CACCGGATAC CGTTGAAACG CTTCACGCGG CGACAAACCG CGCACAAATT GGAAAATCTC CAATGCCTTT GCCGCGCATG CCATCTTCGG AAGCATGGCA AAGAACCACG ATGA
|
Protein sequence | MNTTPLDERG TALTAWINQT QEVLAHRSLN QQPFHRVFNL MRTRRLATVA LNRVLSNTGA RTAGIDGMTK KHIATDTEQQ ALVQEIWHDL TTHQYRPAPV RRVYIPKANG QQRPLGIPTI KDRVVQEMVR LILDPIYEST FYRHSYGFRP YRATHHAVVR LRDLIGRRGY QMALEGDIRA CFDRIHHTTL IRILRRTIKD ERLITVIHQM LKAGVMDDGQ WRVTEDGTPQ GGIVSPLLAN IYLNELDQWV ANRWDTYTPL ERYYHRKAGT GYPCQITRYA DDFVVLLHGT HAEATTLKTA LATFLADHLH LELSAEKTLI TPVEQGFDFL GFHIRKYQDS TRITPSRKAI ATFKREAADR IGKGFRDSDE AGIVMLNHYL TGWGHYYRRV SSSTTFRSLD HYIWWRVMRT TFRLRRGRGV RHFGTHCRSH RKRYRDGLNR KHAHRRGGHY GVWANTAQTR AYIVTSLAFL PIEYVALHPQ LHPYRKADRA KLDQRKRLAL LLARNSHPER PANPAYGKAW EQIRQEVLQM SNYTCQHCGT RVHRSTAEID HRIPLKRFTR RQTAHKLENL QCLCRACHLR KHGKEPR
|
| |