Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0626 |
Symbol | |
ID | 4709380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 702746 |
End bp | 703816 |
Gene Length | 1071 bp |
Protein Length | 356 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639855090 |
Product | hypothetical protein |
Protein accession | YP_001002213 |
Protein GI | 121997426 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1367] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR01894] CRISPR-associated RAMP protein, Cmr1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.485642 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCCGGC AAGCTTGCAG GCATGCTGAC CACGTTGATC GCCGGGCGAT ACTGCGGCCA CGCAAGAGAG CGAAGAGCAC CATGGAGCAA CGACACCTGA CCCTGGAGCT GCTGACCCCG ACCTTTCTCG GGGATGCTCA GCAGACCGCC GCCTGGCGGA CGCCGCCAAT CAAGGCGCAG CTGCGCCGCT GGTGGCGGGT CGCCGCGTTC GCGCAGGGGA TGCGCCTACC CGAGCTACGC GCCCTGGAAG GACGGCTTTT CGGCGACGCC GCCGGGCAGC AGGGCCGCAA GAGCCAGGTT CGGCTGCGCC TACAGCCGCA TAAGGGACGG CCGGGCACGC AGCACAACAA CGCGTGGCGC CAGCAAGCCA ATCAACAGCC GTTACGGATG AATAACGGCT TGCCGGCCGA TGGCTACCTC GGTTTCGGCC GGGTCAAGAC CAAAGGTGCT AACGAGACCG CCCTGCCACC CGAGGAGAAG GCCGAGCTGC GCCTGGCCTG GCCGGCGGAA GCCGAGGGCG CCGAGGCCCT CGACGAGGCC CTCGGGCTCT TCCATCGGCT CGGCAGCCTC GGAGGACGCA GCCGCAACGG CTGGGGCGCC TGCCACCTGC ACGGCGCCGA GGCCATTGGC CTGGACGCCT ACAGCCGTCC CTGGGAAGAG GCCTTGGAAG AGGCCTGGAT CCACGCCCTC GGCAGGGATG GAAACGGCCT GCTGCTGTGG TGGACCCCGC CGTGCAAGAG CTGGGAGGCC GCCATGCGAC GGCTCGGCGA CCTGCGCAAG GGCCTGTGCG GCGAGGCGGG GGCGCTGCGG CCGCTGCTCT CCTGGCCGGT CACAGGCCAG CAGCAGCAGG GTCTCGACAA CCAGAACCGG ATCCCCAACA CCCTGCAGCT TAAGGTTGTC CGGGACGAGC AGGGGCAGCT GCGGGGACAA CTCGCGCACC TCCCCTGCCG ACCCGAGCCG AAGATCTGGG ATGCGCTACC CAAGGAGGTG CGCGACGACC ATCCTGCGCT CTGGCGCAAC GCCCACGCCT TCCTCGACCG GCACAGCGAT CTCGAGCGGG TGCACGCATG A
|
Protein sequence | MSRQACRHAD HVDRRAILRP RKRAKSTMEQ RHLTLELLTP TFLGDAQQTA AWRTPPIKAQ LRRWWRVAAF AQGMRLPELR ALEGRLFGDA AGQQGRKSQV RLRLQPHKGR PGTQHNNAWR QQANQQPLRM NNGLPADGYL GFGRVKTKGA NETALPPEEK AELRLAWPAE AEGAEALDEA LGLFHRLGSL GGRSRNGWGA CHLHGAEAIG LDAYSRPWEE ALEEAWIHAL GRDGNGLLLW WTPPCKSWEA AMRRLGDLRK GLCGEAGALR PLLSWPVTGQ QQQGLDNQNR IPNTLQLKVV RDEQGQLRGQ LAHLPCRPEP KIWDALPKEV RDDHPALWRN AHAFLDRHSD LERVHA
|
| |