Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2483 |
Symbol | |
ID | 5734364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3175004 |
End bp | 3176023 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641279623 |
Product | CRISPR-associated Cas1 family protein |
Protein accession | YP_001545249 |
Protein GI | 159899002 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0190242 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAACGC TTTATGTTCT CGAACAAGGT GCAGAAATTC GCTGTGACGG TGAACGGCTA GCAATTTGGC AAACTGATCA AGAGCTGGGC AATGTGCCAA TGGCCAAACT TGAAGATATT GTAGTGATGG GCAATATTGG GTTTAGCACG CCTGCGATCA AGCGCCTGTT GGATCAACAG ATCGAAGTAA CATTTTTGAC GATTCACGGG CGCTACCATG GGCGCTTGAT TGGCGAAGCG ACCGCCCATG TGGCCTTGCG TCGCAACCAA TATCGCCGAG CAGATGATGA GGTTTGGGCT TTGGCGATGG CTCAAGCCTG TGTTAGCGGC AAATTACGCA ATTGCCGTGC TGTATTGCAA CGCTTCGCCC GCAACCGCCA ACAGGTCGAA AAAGAGGTTT TGGAATCGAT TGAAGCCTTA GACCATTTTA TTGATCGGGT TGATCGCACC ACCAAAATCA GCTCATTGGT TGGGGTTGAA GGTAGTGGCT CGGCAGCCTA TTTTGGTGGT TTACGCGGCC TCTTTGATAG CGAATGGATG TTCAATAATC GTAATCGCCG CCCACCAACC GACCCAGTTA ATGTATTATT ATCGTTGGGC TATACCCTTT TGGTGCATAA AACCCTTGGC GCAGTTCAGG CGGTAGGGTT CGATCCTTAT CAAGGATTTT TACATCAACT CGATTACAAT CGACCATCGT TAGTGCTTGA TTTGATCGAA GAATTTCGGC CAATTTTGGT TGATGCTTTG GTGATTCGCT GCTGTAATGA TGGCAGGCTG ACGGCCAATG ATTTCAGCCC GAGCGACGAT CCCAAGCACC CGATTTTACT CAGCAATGAG GGCAAAAAAC GCTTTGTAGT GGCCTTTGAA GAACGCATGC GCACCGAAGT AACCCATCCC GATGGCGCAG ACGGACGGCC TGGCAAAGTC AGTTATTGGC GTTGTATCGA GCTTCAAGCC CGCTTATTGG CCCGCGCAAT CCAGACTGGC ACAAGCTATC AAGCTTGGAC AACCCGTTAG
|
Protein sequence | MATLYVLEQG AEIRCDGERL AIWQTDQELG NVPMAKLEDI VVMGNIGFST PAIKRLLDQQ IEVTFLTIHG RYHGRLIGEA TAHVALRRNQ YRRADDEVWA LAMAQACVSG KLRNCRAVLQ RFARNRQQVE KEVLESIEAL DHFIDRVDRT TKISSLVGVE GSGSAAYFGG LRGLFDSEWM FNNRNRRPPT DPVNVLLSLG YTLLVHKTLG AVQAVGFDPY QGFLHQLDYN RPSLVLDLIE EFRPILVDAL VIRCCNDGRL TANDFSPSDD PKHPILLSNE GKKRFVVAFE ERMRTEVTHP DGADGRPGKV SYWRCIELQA RLLARAIQTG TSYQAWTTR
|
| |