Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2385 |
Symbol | |
ID | 5734266 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3037877 |
End bp | 3039172 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641279526 |
Product | RNA-directed DNA polymerase (Reverse transcriptase) |
Protein accession | YP_001545153 |
Protein GI | 159898906 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG3344] Retron-type reverse transcriptase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACCAC GCTATGAATC GGCGATGTTG GCCGAAATTG CCAGCGTCGA TAATTTGACT CGCGCTTGGA GCCATGTGCG GCGTAATATT CGCATTTCGC AACGTGGGCG CTCACATGGC CCCGATGCAG TGACGATTTT GGATTTTGAG GCCGCTTGGG TTGATCACAT GCAACAGTTG GCAATGGAAT TGCAAAGCCA AATTTATCGG CCTTTGCCAC CACGGCGGCT TTTTTTGGAT AAACGTGATG GTGGCAAACG CAGTATTGCG ATTCTCGCCG TGCGCGACCG GATTGCCCAA CGCGCGGTGT TGCAAATTCT TGAGCCAGAA ATCGAGCCAA CTTTTTTGGA TTGTTCGTAT GGCTTTCGGC CTTACGTTGG CGTGCCCCAT GCCCTAACCC GAATCGAACG CTATCGCCAG CAGGGTTTGC AATGGGTTGC CCACGCTGAT ATTAGCGATT GTTTTGGCAC AATCGATCAT CAAATTTTGC TCAGTCAATT ACATCAACGA ATTAGCGATC GGGCAGTCGT CGAATTAATT GGCCAGTGGC TAAGCGTTGG CGTGATGGAA GATGCCGCAA CCACCGAAGC CAGTAATTGG TGGGATGATG GCGAAGATTT GCTTGAACGC TTGGCGAAAC ATGGCGAAGA TCTGCTCTGG CCAAATCAGG GCTACCCTCA GGCTGGCCCA TCCTATGCTC CGCAAATGCT TGATTTTGAG GCCAACCGCA CCGATAGTTT GCGCAAACGA GCCTTGCAAG GTCTCGCCAG CAACGCCGCC CTGTGGGGCA TAACCCATAG CAAACGGGTT ATTAGTGGTT TGCGCAGCTT GGCTCCGTTA TTCAAACAAG TGCCTGGTGG CAGCCTAACA TGGGGCGCTG CTGGGATTGC AACCCTAGCC TTAATTCCGC TGAGCCAGCG TTTGTTGCGC CAACATGAAC GTGGCACGCT CCAAGGTGGG GCAATTTCAC CGATGCTCGC CAATATCTAC CTCGATTCCT TTGATCGGGC GATGACTGAG CGGGGCCATA TTTTGGTGCG ATTTGCCGAT GATTTTGTGC TACTGGGCGC ACATCAAGCG GCAGTTGAGC AAGCGCTTGC CGATGCAACC AATGTGCTCA AGCGCTTACG CCTCGCCACC AAAGAGAGCA AAACTGGGGT GCAGCATTTT AATGATGGCC TGACCTTCCT TGGACATCGC TTTGCCGCTC AACCACAGGA TGCAGCCGAC CGCTGGCCGA CCTTCGAAGC GGCTGAGCGG GCGATTAAAG AGCGTTTGCG CAAACCTCGT ACATAA
|
Protein sequence | MQPRYESAML AEIASVDNLT RAWSHVRRNI RISQRGRSHG PDAVTILDFE AAWVDHMQQL AMELQSQIYR PLPPRRLFLD KRDGGKRSIA ILAVRDRIAQ RAVLQILEPE IEPTFLDCSY GFRPYVGVPH ALTRIERYRQ QGLQWVAHAD ISDCFGTIDH QILLSQLHQR ISDRAVVELI GQWLSVGVME DAATTEASNW WDDGEDLLER LAKHGEDLLW PNQGYPQAGP SYAPQMLDFE ANRTDSLRKR ALQGLASNAA LWGITHSKRV ISGLRSLAPL FKQVPGGSLT WGAAGIATLA LIPLSQRLLR QHERGTLQGG AISPMLANIY LDSFDRAMTE RGHILVRFAD DFVLLGAHQA AVEQALADAT NVLKRLRLAT KESKTGVQHF NDGLTFLGHR FAAQPQDAAD RWPTFEAAER AIKERLRKPR T
|
| |