Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0819 |
Symbol | |
ID | 8533960 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 891222 |
End bp | 892256 |
Gene Length | 1035 bp |
Protein Length | 344 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 646383207 |
Product | CRISPR-associated protein Cas1 |
Protein accession | YP_003262712 |
Protein GI | 261855429 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1518] Uncharacterized protein predicted to be involved in DNA repair |
TIGRFAM ID | [TIGR00287] CRISPR-associated endonuclease Cas1 [TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.606535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACACAG TCCAAAACAC ACTCTACATT ATGACCCCCA ATGCTTATGT GCATCTGGAG AATGCCACAG TGCGCATTGA TGTTGAGCGC GAAAAAAAGC TTCAGGTGCC ACTGCACCAT CTAAATGGGC TCGTTTGCTT TGGCAATATC ATGATCTCGC CTGCACTGAT GCATCGCTTG GCAGATGATG GCAAATCATT GGTATTAATG GATAGTTCAG GGCGATTCAA GGCGCGTCTT GAAGGCCCGG TTTCAGGGAA TATCTTGCTG CGGCAAGCGC ATCACCGCCA AGCATCCGAT GCTGCCTTTG CGCTGGAAAT AGCGCGCACC ATCGTGTCAG GCAAACTCAA AAACAGTCGC AGCGTCGTGC AGCGTGGCGC ACGCGAAACC AGTGATACCA TCGAAACAAC CCAGCTCACG CGCAGCGCCG ATAATCTTGC TGCTTCATTG CGTGCTGCTG CGGCAGCCAC CTCAATGGAC GAACTCCGAG GCATTGAAGG CGAAGCGGCT CGCGGCTACT TCTCCGCCAT TAATCTGATC GTCAAGACGG CCATGCGCGC CAATTTTCAG CTCAATGGTC GCACTCGCCG CCCGCCACTG GATCGTTTTA ACGCCCTAAT TTCCTTTCTC TACGCCATGC TTATGAATGA TTGCCGTTCC GCAATCGAAG CGACCGGACT CGATGCCCAG TTAGGCTTTC TGCACGCCGT TCGTCCAGGG CGTGCCGCAC TGGCACTTGA TCTGATGGAA GAATTTCGCG CTATTGCCGC TGACCGTCTT GCTCTCACGC TCATCAACCG GGGCCAGATC AATGCCAGCG ATTTTGACGA ACGAGAAGGG GGCGCCGTGA TGCTTAATGA CAGGGGCCGT CGTGCCGTTG TCACCGCTTG GCAAGAACGC AAGCAAGAAA TAGTCACCCA CCCGCTCACA GAAACAAAGA TCCCCATTGG CCTGCTCCCT TTCATTCAGG CGCGGTTCTT GGCGCGCACC ATCCGTGGTG AAATGGATGG CTATCTACCC TATCAGTCAA AGTGA
|
Protein sequence | MHTVQNTLYI MTPNAYVHLE NATVRIDVER EKKLQVPLHH LNGLVCFGNI MISPALMHRL ADDGKSLVLM DSSGRFKARL EGPVSGNILL RQAHHRQASD AAFALEIART IVSGKLKNSR SVVQRGARET SDTIETTQLT RSADNLAASL RAAAAATSMD ELRGIEGEAA RGYFSAINLI VKTAMRANFQ LNGRTRRPPL DRFNALISFL YAMLMNDCRS AIEATGLDAQ LGFLHAVRPG RAALALDLME EFRAIAADRL ALTLINRGQI NASDFDEREG GAVMLNDRGR RAVVTAWQER KQEIVTHPLT ETKIPIGLLP FIQARFLART IRGEMDGYLP YQSK
|
| |