Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_0614 |
Symbol | |
ID | 8533749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 658845 |
End bp | 660050 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 646383002 |
Product | restriction modification system DNA specificity domain protein |
Protein accession | YP_003262514 |
Protein GI | 261855231 |
COG category | [V] Defense mechanisms |
COG ID | [COG0732] Restriction endonuclease S subunits |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGGGA AGGTCGTACC ACTTAAGGAT CTTTTTCAAA TTGGATCTAG CAAGCGTGTT TTGAAGTCAC AGTGGAAAGC AGAAGGAGTG CCTTTCTATC GCGGACGCGA GGTCACGCGA TTAGCAATGG ATGGCTTCGT AGACAACGAG CTGTTTATTT CTGAGGCTCA TTATGCAGAG CTTGCAAATC AGTATGGAGC TCCGAGAACT GACGATATAG TCATCACAGC GATCGGAACT ATTGGCAATT CGTACATCGT CCAGGATGGC GACAGGTTCT ATTTCAAGGA TGCCAGCATC CTTTGGATGA AGAGAATCAG CGATGTCAGT AGCAAGTTCG TCAATTTTTG GTTGAAATCC ACCATGTTTC TCGATCAACT GGATCATGGG AATGGGGCAA CCGTAGACAC GCTGACGATT CAAAAACTCC AGAGCGTCCA GATATGGGTT CCCCCCATTG CCGAACAACA CCGCATTGTC TCCATTCTCG ACGAAGCCTT TGAAGGCATC GCCAAAGCCC GAGCCCATGC CGAACAGAAC CGCCAGAACG CCCGCGCCTT GTTTGAAAGC CACCTGCAAT CCGTGTTCAC GCAGCGGGGT GAGGGGTGGG CGGAAAAGTC GCTTGAGGAA GTGGTAGATG CGCAATGCAC ACTTTCATAT GGCATCGTTC AGCCGGGTCA CGAATACGCT AAAGGAATGC CGATTGTTCG TCCTACGGAC TTGACGGCAA AATTGATTAC GCTTAACGGA TTGAAACGTA TCGACCCAAA GCTGGCCGAT GGCTATCGCA GAACTACGCT GCGTGGCGGC GAACTTCTGC TCTGTGTTCG AGGAAGTACC GGAGTGTTGG CGGTCACATC CTCAGAACTT GCTGGCGCTA ACGTAACGCG CGGCATAGTT CCGATCATGT TTGATCCATC GTTACTTAGC CAAGATTTTG GCTATTTCCT GATGACTTCA GAGGCAGTGC AGAGCCAAAT CCGCATCAAA ACTTATGGAA CAGCGCTAAT GCAAATAAAC ATTGGGGATT TGAGAAAAAT TGCTGTCTCA TTTCCTCCGC TAAAGGAACA GGAAAGGATG ACGGCACAAC TCGAAGAGTT GTCTGCCGAA ACCCAACGCC TGGAATCAAT CTACCAACAA AAACTCGCTG CCCTCGATGA ACTGAAAAAA TCCCTGCTGC ATCAAGCCTT CTCCGGCTCA CTTTAG
|
Protein sequence | MKGKVVPLKD LFQIGSSKRV LKSQWKAEGV PFYRGREVTR LAMDGFVDNE LFISEAHYAE LANQYGAPRT DDIVITAIGT IGNSYIVQDG DRFYFKDASI LWMKRISDVS SKFVNFWLKS TMFLDQLDHG NGATVDTLTI QKLQSVQIWV PPIAEQHRIV SILDEAFEGI AKARAHAEQN RQNARALFES HLQSVFTQRG EGWAEKSLEE VVDAQCTLSY GIVQPGHEYA KGMPIVRPTD LTAKLITLNG LKRIDPKLAD GYRRTTLRGG ELLLCVRGST GVLAVTSSEL AGANVTRGIV PIMFDPSLLS QDFGYFLMTS EAVQSQIRIK TYGTALMQIN IGDLRKIAVS FPPLKEQERM TAQLEELSAE TQRLESIYQQ KLAALDELKK SLLHQAFSGS L
|
| |