Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0645 |
Symbol | |
ID | 5732545 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 743593 |
End bp | 744579 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641277774 |
Product | integrase family protein |
Protein accession | YP_001543421 |
Protein GI | 159897174 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000112195 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACGATG ATTTTTCACT CGCATTAACC ACCCTTGCCG CAACTACCCC AACCGCAATC CAATCAACGA CTAATCCCTT CATTGCCTAT ATCGCCAGTC TGCAAAGTAC CAATAGTCAA CGCACGATGG AGAAGCAGCT GCATCGTTTG GCAACAATGA TGGGCTTTGC CGATGCGCAT GCAGTTCCGT GGTCGCAGCT GCGGGTTGAA CATACCCAAG CCTTGTGGGC CAAGCTCGCC AGCAGTAAAT CAGCAGCCAC TGCCAACCTG ATACTCTCGG CCTTGCGCGG GGTGCTGAAG ATGGCGTGGC GCATGGGCTT AATGACTGGG GAAGATTACA CACGAGCGGT TGATCTCAAA ACGGTCAAAG GGAACGCACC TGATGCCGCA GCGGGCCGAT CGCTCACCGC TGCCGAACTC CGCGCCTTAT TTGCCGCGTG TGCCGCTGAT TCATCACCCA TTGGGCGGCG TGATGCAGCG ATTCTAGCAT TGGCCTATGC TGGTGGTGGT TTGCGTCGGG CCGAGATCGT CAATCTCGAT CTGTCAGACT TCAATCCTGA AACGGCGATG CTGACGATTC GCGGGAAGGG CAATAAAGTG CGGACGGCGT ATGTGCGTGG TGGCGCTCGT GAGGCGCTCG ACGAGTGGTT GGCGGTGCGT GGCGACGAAG CTGGGCCACT GTTTTGGCGG TTACAAGCTG GCGGTGTGGC AGGCCAAATG TACGAGCGTC TCAGTGATCA GGCGATTTAT ATTTTATGTC AGCGGCGTGG CAAAGAGGCC AATGTACGGC ATTTCAGCCC GCACGATATT CGACGAACCT TTATCAGTGA TCAACTGGAT GCGGGAACCG ATGTCTTGAC CGTGGCTCGG CTGGCTGGCC ATAGCAATGC CAACACCACC AGCCGCTACG ATCGTCGGGG TGAACGGGCC AAACAAGCAG CAGCTGATGC TCTGCATGTG CCGTTTATTT CGCAACAGGA GTTGTAA
|
Protein sequence | MHDDFSLALT TLAATTPTAI QSTTNPFIAY IASLQSTNSQ RTMEKQLHRL ATMMGFADAH AVPWSQLRVE HTQALWAKLA SSKSAATANL ILSALRGVLK MAWRMGLMTG EDYTRAVDLK TVKGNAPDAA AGRSLTAAEL RALFAACAAD SSPIGRRDAA ILALAYAGGG LRRAEIVNLD LSDFNPETAM LTIRGKGNKV RTAYVRGGAR EALDEWLAVR GDEAGPLFWR LQAGGVAGQM YERLSDQAIY ILCQRRGKEA NVRHFSPHDI RRTFISDQLD AGTDVLTVAR LAGHSNANTT SRYDRRGERA KQAAADALHV PFISQQEL
|
| |