Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0699 |
Symbol | |
ID | 5732600 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 802653 |
End bp | 804107 |
Gene Length | 1455 bp |
Protein Length | 484 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641277829 |
Product | hypothetical protein |
Protein accession | YP_001543475 |
Protein GI | 159897228 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1769] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR01888] CRISPR-associated protein, Cmr3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATACAC GCCGCGAACG TTCTAAGCAA AAGCAACAAG CCAAAAAGCA AAATAGTAGC AATACCCAAG TCATTGCGCA AAATTTAGTA GCCCCAATCA ATGATCAAGT AGATGAATTG ATAAGTGTGT CCAACAATTC AATTCAACCT GAGGCAAGCG CGGATCAACC AGCCAATGCT GAAATTACAA CTCAGCAAGC GTGGCTGATT GAGCCACGTG ACCCGTTGAT TGTGCGCGAT GGCCGACCAT TTAACAATAC ACCTGGAGCA CGCGCCTATA CTCAGTCGTT TCCACCACCA TCGGTTTTGG CTGGGGTTAT TCGCTCCCAA ACCGCCTATG CTTCAAAACT CCAATTTACT CGTAACTCCG ATGACAACCG ACATCAGGGA AATAAATTAC AGCCAATTCA AATTTATGGG CCAGTATTAG TTGAGTTGTT AGAGCCAAAT GAAGCCAATG CTGAAGCTGA ATTCTTATGC CCTGCACCCG CCGATGCACT GTTGTTTGAG CCTGAATCAA AAATCGAAGA TAAACCATCC GAGTCTGTGC CTTGGCTCCG CCGTTTAATC CCAATTCAGG CTCCTGCGAA GCTCGTCAGT GATTTTAACG AGCTTGGCTT GGTTGGTTTA GTTGAACCCG ATCCGCGTAA GCCCGCCAAA GAGCAACCGC ACTTTTGGTA TTGGAAGCGT TTTTTGCAAT GGTTGCAAGA TCCAGCAGGA TTGATTGATC AAACTCAAAC TGCCGCAGTA CAAAAAAGCA CTGACTTGGG AATTCAAGGC TTGCCAATTG ATCAACGGAC GCATGTTGAA ATTAAACCTG ATAGCCAACA AGCCGAAGAT GGTCGTCTAT TTCAAACCCG TGGCTTGAGC TTTACCCAAG CCAATGATCA ACGATTAGCA GAAGCACGAC GTTTGGCATT ATATGTGCAC GCCGATTACC GCGATACAAC TGGTTTAACT ATTCCACAGC CAAGCATTGC ACCTTTGGGC GGCGAACGCC GTTTGGCGCG TTGGGAGCCA ACTGAGCGTC AGTTGCCAGC ATGCGATCAA GCTATTCTCG ATGCAATCGT AAAAGCCAAC GCCTGCCGTG TAATTCTGCT CACCCCAGCG ATGTTTGCAA AGGGCTATCG CCCAACTTGG CTATTCGAAG ATCCCCGAGG TGTCCAGCCA AAGCTTGCAG CAATCGCGAT CAAGCGTGCT CAAGGTATTT CAGGCTGGGA TTTAGCAATT CATAAGCCGA AACCAACCCG CCGCCTAGCT CCCGCTGGCA CGGTCTTTTT CCTTACATTC CCTGAAAAGG AAAATTCGGC TGCAATTGAA GCGTGGGTTC GCAACTATTG GATGCATTGT ATTAGCGATG CTGAGCAAGA TCGCCGCGAT GGCTTTGGAT TAGCAGTGCT CGGCGCGTGG GATGGGAAGT TGGCTAAAAT CAAGGGAGTA CCAAATGAAT CGTAA
|
Protein sequence | MNTRRERSKQ KQQAKKQNSS NTQVIAQNLV APINDQVDEL ISVSNNSIQP EASADQPANA EITTQQAWLI EPRDPLIVRD GRPFNNTPGA RAYTQSFPPP SVLAGVIRSQ TAYASKLQFT RNSDDNRHQG NKLQPIQIYG PVLVELLEPN EANAEAEFLC PAPADALLFE PESKIEDKPS ESVPWLRRLI PIQAPAKLVS DFNELGLVGL VEPDPRKPAK EQPHFWYWKR FLQWLQDPAG LIDQTQTAAV QKSTDLGIQG LPIDQRTHVE IKPDSQQAED GRLFQTRGLS FTQANDQRLA EARRLALYVH ADYRDTTGLT IPQPSIAPLG GERRLARWEP TERQLPACDQ AILDAIVKAN ACRVILLTPA MFAKGYRPTW LFEDPRGVQP KLAAIAIKRA QGISGWDLAI HKPKPTRRLA PAGTVFFLTF PEKENSAAIE AWVRNYWMHC ISDAEQDRRD GFGLAVLGAW DGKLAKIKGV PNES
|
| |