Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4400 |
Symbol | |
ID | 5736250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5624162 |
End bp | 5625133 |
Gene Length | 972 bp |
Protein Length | 323 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281562 |
Product | HhH-GPD family protein |
Protein accession | YP_001547160 |
Protein GI | 159900913 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGAAT TAAGTACCTT ACAAATCGAT CTGCTAGCGT GGTTTCAAGC CAATGGCCGC GATTTACCAT GGCGACGCAC CCGTAATCCT TACTATATTT TGGTGTCGGA AACCATGCTG CAACAAACCC AAGTTGATCG GGTGATTCCC AAATATGAGG CATTTTTGGC CCTATTTCCG ACGGTCGAGG CCTTGGCTAG TGCTTCAACC GCCGATGTCA TTCGTTCGTG GCAAGGCTTG GGCTACAATC GTCGCGCGGT CAATCTGCAA CGGGCGGCCC AAGCAATTGT CGCAGCAGGC TATCCCGCCG ATCCTGCTGG TTTCCCCGCT ACACCCGAGG GTTTGCGCAA TTTGCCAGGG ATTGGAGCTT ACACCTCGGG GGCGGTCGCA TGTTTTGCCT TCGAGCGTGA TGTGGCCTTT CTTGATACTA ATATTCGGCG TGTGGTACGA CGTTTGTTGG TCGGCCCCGA AGATGCTCCG CCTGAAACCA ATGAACAAAC GTTGATCGAT TATGCCCAAC AATTAATTCC CCAAGGCCAA GGCTGGGCAT GGAACCAAGC AATTATGGAA TTGGGAGCGT TAATTTGTAG TGCCGCCAAA CCTCAATGCT GGCGTTGCCC AGTCAATCAA CATTGCCGAG CCTACGCGAT TTGGCGCGAA GCCAACACGC AGCTTGATAT GTGGCAACCA CCAGTGATCA AACCACGCAA AAAGGCTGCC GAACAACCAT TTCATACATC AAACCGCTAT TTTCGTGGGC GCATTATCGA TGCCCTGCGA GCACTCGAAA CCCAGCAAAG CCTTGATCTG GCTAGTTTGG GGCCACAAGT CAAGCCAGAT TGGGTCGGCA ATGAGCATGA TCTCACATGG CTAGCCAAGT TGGTCAATGG TTTGGCCCAA GATGGCTTGC TGATTTGGCA GCAGCAACCA GAGCAGCAAT TAGCCGAGTG GAGTGTGCGA CTACCAGCTT AA
|
Protein sequence | MSELSTLQID LLAWFQANGR DLPWRRTRNP YYILVSETML QQTQVDRVIP KYEAFLALFP TVEALASAST ADVIRSWQGL GYNRRAVNLQ RAAQAIVAAG YPADPAGFPA TPEGLRNLPG IGAYTSGAVA CFAFERDVAF LDTNIRRVVR RLLVGPEDAP PETNEQTLID YAQQLIPQGQ GWAWNQAIME LGALICSAAK PQCWRCPVNQ HCRAYAIWRE ANTQLDMWQP PVIKPRKKAA EQPFHTSNRY FRGRIIDALR ALETQQSLDL ASLGPQVKPD WVGNEHDLTW LAKLVNGLAQ DGLLIWQQQP EQQLAEWSVR LPA
|
| |