Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5218 |
Symbol | |
ID | 5737176 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 317663 |
End bp | 318643 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 641282382 |
Product | integrase family protein |
Protein accession | YP_001547973 |
Protein GI | 159901727 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 52 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCACAAC CGGAATCTGA ACCCATTGAA TCACTGATCG ATCGATGGTG GACCAATCCT GGGGACGATC TGACCGCCGC AACCCGCACA CGCTATCGTA GTGCGCTGCG CCGTCTTTGT CGGTGGTTTG CCGCCGCCGA GCGGCGGTCC TTGCTGCTGG CCGATCTGCA TCCGATCAGC CTCGCAGGCT ATCGCGAGGC ACTCAAGCAG ACCGACGCGG CGAGCACGGT CAATACGCAC CTCAGTGCGA TCCGCACCTG GTGCGTCTGG CTCGTTGATC AGGGCTATCT CGCGACGAAC CCAGCCCAAC GGCTGAAGCT CGTTAAGCGC ACGACACCAT CGGCACCGAA GGCGCTGAGT CCAGCCCACG TCAATGCCCT GTTGCGGCAG GCGCAATTCA CCCGGTATCC CCTGCGGAAT ACCGCCATTC TCCAAGTCCT GATTCAAACC GGCATGCGGA TCAGTGAATG TGCGGCGTTG TGCTGGCACG ATATCCAGTA CGGCGAGCGC AGTGGCCATG CCCTCATTCG CGCAGGCAAG GGCAATACCG TGCGCACCGT CCCGCTCAAC GAATCAGCAC GGTGTGCACT CGCCAGCTAT GTGGCACCGC TGCTTGGGGT GCAACCGTCG CTGCACAAGG TTGCGCGGGC ATGGCCGCAG CGACAGGAGG GTGATCCCCG CTGTCCCCTC TGGACGAGCG AGCGGCAGCA TGCGCTTAGT CTGCGGGAGA TGAGCCACAT GATCCATCAA CTCGTGCGCG ATACGTCTGC GCGGAAGCTG CTGCCAGCAA GCACCACGCC GCATAGCCTG CGCCATACCT TTGCTACCCG CTACCTCGCC CGACACCCCC ATGATCTCGT TGGATTGGCC CGGCTGCTGG GCCATCGTTC CATCACAACC ACGCAAATCT ATATCCAACC GACCGCAGAA CAACTTGCGG CACGTGTTGA TCAGATTGAT CTCAATGCCT ACGGCAATTA A
|
Protein sequence | MPQPESEPIE SLIDRWWTNP GDDLTAATRT RYRSALRRLC RWFAAAERRS LLLADLHPIS LAGYREALKQ TDAASTVNTH LSAIRTWCVW LVDQGYLATN PAQRLKLVKR TTPSAPKALS PAHVNALLRQ AQFTRYPLRN TAILQVLIQT GMRISECAAL CWHDIQYGER SGHALIRAGK GNTVRTVPLN ESARCALASY VAPLLGVQPS LHKVARAWPQ RQEGDPRCPL WTSERQHALS LREMSHMIHQ LVRDTSARKL LPASTTPHSL RHTFATRYLA RHPHDLVGLA RLLGHRSITT TQIYIQPTAE QLAARVDQID LNAYGN
|
| |