Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1918 |
Symbol | |
ID | 5733807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 2312314 |
End bp | 2313855 |
Gene Length | 1542 bp |
Protein Length | 513 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641279062 |
Product | hypothetical protein |
Protein accession | YP_001544689 |
Protein GI | 159898442 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1337] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTATTA ATGCAACTCG TATCATTCGT AAACGGATTA TTTTCAAGGC CGAGCTAGTG CTTACAAGTG CGGCGGTCTT CAGTAATGGT GATAGCGACC CAATTATTGA TATGATGATT CTGCGCGATA GTTGTGAGCC GCAAAAGGCG CTTTTGCCAG GTAGTAGTTT GGCTGGGGCT TTACGCAGTT ATTTAAAAGA TATAACCAAA GATGACACAG CGATTGATAA ATTATTTGGC TGTATCGGTG ATGAAGAGGT TGGTGAATAT TTTGGGCAAA GTCGGCTCTT GATTAGTGAT GCAGTCAGTC GTGAGCCGAT TCAGGCTGAA TTACGCGATG GGGTGCGGAT TGACCATGCC ACCAGAACTG CTGCTGATCA GGCAAAATAT GATTTAGAGG TGTTGCCAGC CGGAACGTGC TTTAGCTTAG AGCTTGAATT TATTGTACTG GAAGAAGTAC CAACTATCAA TCTGTTGCCA TTAGTAGTTC AAGCTTTGCA TGGCCTTCAG ACGGGCCAAA TTAGCCTTGG CATGAAGAAA AATCGTGGAT TGGGGCAATG TAGCGTTAAA GGTTGGGATA TTTATGAAAT AGATATGACC AAGCCTACTG AAATTTTTGG CTGGCTTGAG CGTGATGATA AGAACCCTAT TCCTACCGCT GCGCAACATT CTAATCTCTA TGACTATTTT GGGTTCCAAC CTGCCGATGC CAGCCGCTAT CCGGTAACGC TTACAGCAAA TTTTACCTTT GCTGATGATG CGATGTTAAT TCGTTCGGCT CTCCAAACCA ATAATCTAGA AGATCTGATC AATAATCCGG ATGATCAGAC TGTATCCAAG AAGATTCCCG ATCCTGTGCA TTTGCAAACA CGGGTTGATG GCACGTTGCA GCCCGTTATT CCTGGTACAA GTTGGGCTGG GGTTTTGCGC CATCGGGCTT TACGCATTTT GAATACTTTG AAAGTGGCAA CTGCCGAGCA GCAACTTGAT GAGTTATTTG GCTTTGTTAT AGAGCAGCAA GCTAAAGCAC AGGCCAGCCG AATCATGATC AAAGATAGCA TTATTCAGCA CCCTGCGACT GAACCATTAG TGCAAAACCG CATTGCAATT GATCGCTTTA CGGGTGGTGC GTTTGATGGG GCGCTGTTCA GCGAAATGCC GGTGTGGAAA ACCGACCAAA CTTGCGTCAC GCTTGAAATT TCGATCAAAC CACCCCGACC AAAAAAAGAA GCCCAACAAG ATCAGCAAAA ACCAGAAGAT ACGCCACAGC CTGATCCTAA GCCAACGCCT AAATTTAATC AGGCTGAAGT TGGCTTGTTG CTGTTATTGC TCAAGGATTT GTGGACTGGC GATTTGGCGA TTGGCGGAAC CAGTAGCATC GGGCGCGGGC GACTTCAAGG GCTTGAGGCC ACGTTAACCG TCGATGGTGC TGAATTTTGC TTCAAGCAAG CCACTGATGG TATCGGTTCA TTGCATATAA CAGGAACTGG CAAGCGAGAT CAATTACAAA TGTATGTTGA AGCGATCGGA GCGTCCTCAT GA
|
Protein sequence | MSINATRIIR KRIIFKAELV LTSAAVFSNG DSDPIIDMMI LRDSCEPQKA LLPGSSLAGA LRSYLKDITK DDTAIDKLFG CIGDEEVGEY FGQSRLLISD AVSREPIQAE LRDGVRIDHA TRTAADQAKY DLEVLPAGTC FSLELEFIVL EEVPTINLLP LVVQALHGLQ TGQISLGMKK NRGLGQCSVK GWDIYEIDMT KPTEIFGWLE RDDKNPIPTA AQHSNLYDYF GFQPADASRY PVTLTANFTF ADDAMLIRSA LQTNNLEDLI NNPDDQTVSK KIPDPVHLQT RVDGTLQPVI PGTSWAGVLR HRALRILNTL KVATAEQQLD ELFGFVIEQQ AKAQASRIMI KDSIIQHPAT EPLVQNRIAI DRFTGGAFDG ALFSEMPVWK TDQTCVTLEI SIKPPRPKKE AQQDQQKPED TPQPDPKPTP KFNQAEVGLL LLLLKDLWTG DLAIGGTSSI GRGRLQGLEA TLTVDGAEFC FKQATDGIGS LHITGTGKRD QLQMYVEAIG ASS
|
| |