Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_1851 |
Symbol | |
ID | 8535009 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | - |
Start bp | 1986171 |
End bp | 1987316 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 646384232 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_003263720 |
Protein GI | 261856437 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.641505 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGCAA GCGATTTTCA CAGCCGCCTG CTCGACTGGT TCGATCGGCA CGGCCGGCAC GATTTGCCCT GGCAGCATCC CAGGACACCC TACCGCGTCT GGATCTCCGA GATCATGTTG CAGCAAACGC AAGTGGCGAC GGTGATCGGG TATTTCAATC GATTCATGCA GCGCTTCCCA TCGTTGGACG TATTGGCTGC CGCACCGGTG GATGAGGTGT TGGCGTTGTG GTCTGGACTT GGCTATTACG CCCGCGCCCG CAATCTGCAC GCCGCCGCGC AAATCATGGC GCAGCAAGGC GTTCCAGAAA CGCGCGCCGG TTGGCAGGCA TTGCCTAGTG TAGGGCCTTC CACAGCCGCT GCGATCATGG CGCAGGCATT CGATGTGCCG GAAACGATTC TGGATGGAAA CGTTAAACGC GTGCTGGCAC GCCATGCCGG TATAGATCGT CCCATCGAGC AGGCTTCGAC TATTCAGGCG CTGTACGAAG TTGCCAAATT ACATACGCCG CAGACCCGCG TGGCCGATTA CACGCAGGCT ATTATGGATC TCGGTGCAAC CCTGTGCACA CGCCATTCGC CGGGATGCTC GGCTTGCCCC GTTTCTGCGG ACTGCGTGGC GTTCGCGTCC AATCGGGTTG AATCCTTGCC CGTTCGGCGC AGGCGTCAAC CCGTGCCGAC CCGTCGTGCT GTCTTCATGG CGATCGAGGA TGAAGCGGAA CGCCTGATGC TGATTCGCCG TCCTCCGACG GGGATCTGGG GTGGGTTGTG GTGTCTGCCC GAGTATATTC CCGCCGATGA CTGTGTGTCG CAGGAACACA CACTCGATTC GGCCGTTTCA GGTAAACAGG CGAAAGATAA TTCAGGCCAG CAGTCGATGC TTTCGATACA TCGGCTGAAA CAGCGCGGCC CAACTTCGCC CTTGACAACA TTCGAACATC GATTCACGCA TTATTTGCTC GATGCCCGCA TTGATCACAT GGTGGTTTCT CGCACATCGA GCGTCGAGGA CAATCCCGAT GTGCTCTGGT TGCCATTGAT TGAGTTGGCA GCGCGCGCGC CGCTTTTGGG GTTGCCCAAA CCAATGAGCC GTTTTTTGTC GGACTATCCG GCGCTGAAGT CATCCCAGAG TTCATGCACG CCATAA
|
Protein sequence | MSASDFHSRL LDWFDRHGRH DLPWQHPRTP YRVWISEIML QQTQVATVIG YFNRFMQRFP SLDVLAAAPV DEVLALWSGL GYYARARNLH AAAQIMAQQG VPETRAGWQA LPSVGPSTAA AIMAQAFDVP ETILDGNVKR VLARHAGIDR PIEQASTIQA LYEVAKLHTP QTRVADYTQA IMDLGATLCT RHSPGCSACP VSADCVAFAS NRVESLPVRR RRQPVPTRRA VFMAIEDEAE RLMLIRRPPT GIWGGLWCLP EYIPADDCVS QEHTLDSAVS GKQAKDNSGQ QSMLSIHRLK QRGPTSPLTT FEHRFTHYLL DARIDHMVVS RTSSVEDNPD VLWLPLIELA ARAPLLGLPK PMSRFLSDYP ALKSSQSSCT P
|
| |