Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2479 |
Symbol | |
ID | 5734360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3170109 |
End bp | 3171197 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641279619 |
Product | CRISPR-associated RAMP Csm5 family protein |
Protein accession | YP_001545245 |
Protein GI | 159898998 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1332] Uncharacterized protein predicted to be involved in DNA repair (RAMP superfamily) |
TIGRFAM ID | [TIGR01899] CRISPR-associated RAMP protein, Csm5 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCAACT ATGGATTAAC GATTGAAACG CTCTCGCCAG TGCATATCGG CGCTGGCGGG CCAGATTTGC GCCGCAACAT TGATTTTGCG ATCTTCAACA ATGTTCTCTA TCTGCTCAAT GTCGATGCGG TGCTTGAACA GATTTTGCCC GAAAATCCCA ACGACCGTTT GTATCAACAA ATTTTGAATA CACCCGATTT GGGCAGTTTT TTAACGGCTG ACCTGCTGAG CAAGCATCCA GAGCTGTATT ACTACAAACT TGAAGGCGTT TCCAAGCTAG AAACTATGCG GCCTGTGATT AAACACTGGA CGCATGCGCC CTACATTCCA GGTAGCAGCC TCAAAGGTGC TTTGCGCAGC GCATGGGTTC GCCAACACTA TCAGCAGCGC AACCTAATCC TAGATTTGCA GCAATTAAGC GATCGGCGCG AATGGGCTTT TCAGGCTCAG GAAGCACGTT TGCTTAGTCC TAAAGCATCA CGCCCAAGCC AAAATCCCAA CTATGACCTG TTTCGGGCAA TTCAGGTCAG CGATAGCCAG CCCGCGCCTG CTGATGCGCT ACGGCTTTAC AACGCGGTGG TGTTTCCAGC TGCCAATCAA GGTATTCCGC TTGATTTGGA AGCGATTAAG CCGCGTGTAG CCCTGCAAGC CCGAATCAAA TTCGATGATT ACATCTTGGA TAAGCAGGCC AAACAATTTG GGGTGCGCGA GCAAGGATTT AGCGTCGAGG CCTTGAAACA GGCTTGGCGC GAACAAGGCT TAGCGCGAAT TCAGCAAGAA TTAACCTTTT GGACTGGCCG CCGCGAAGGC GAGCATCTCC AGCAATTCTT TAGCAACTTG GCCCAACAGG CCAATACTGC GCCCGACAAC TGCTTCTATA TCGACATTGG CTGGGGCACA GGCTGGAAGA GCAAAACCCT TGGCGATATT ATCAAAACGC CGCAACTTGC CCAGTTGATG CGCCGCTATC GGCTGAGTCG CAAGGAATAT CGTGAAGGCG ATCGCTTTCC AAAAACCCGC CGCGCCGCTC GTGATACCAA AGGTGCGTTA CGCGTCCCAT TTGGCTGGGT CAAAGTAACG CTTAATTAG
|
Protein sequence | MTNYGLTIET LSPVHIGAGG PDLRRNIDFA IFNNVLYLLN VDAVLEQILP ENPNDRLYQQ ILNTPDLGSF LTADLLSKHP ELYYYKLEGV SKLETMRPVI KHWTHAPYIP GSSLKGALRS AWVRQHYQQR NLILDLQQLS DRREWAFQAQ EARLLSPKAS RPSQNPNYDL FRAIQVSDSQ PAPADALRLY NAVVFPAANQ GIPLDLEAIK PRVALQARIK FDDYILDKQA KQFGVREQGF SVEALKQAWR EQGLARIQQE LTFWTGRREG EHLQQFFSNL AQQANTAPDN CFYIDIGWGT GWKSKTLGDI IKTPQLAQLM RRYRLSRKEY REGDRFPKTR RAARDTKGAL RVPFGWVKVT LN
|
| |