Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hneap_2023 |
Symbol | |
ID | 8535182 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothiobacillus neapolitanus c2 |
Kingdom | Bacteria |
Replicon accession | NC_013422 |
Strand | + |
Start bp | 2167684 |
End bp | 2168529 |
Gene Length | 846 bp |
Protein Length | 281 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 646384405 |
Product | formamidopyrimidine-DNA glycosylase |
Protein accession | YP_003263892 |
Protein GI | 261856609 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0266] Formamidopyrimidine-DNA glycosylase |
TIGRFAM ID | [TIGR00577] formamidopyrimidine-DNA glycosylase (fpg) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000058867 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGAAT TACCCGAAGT CGAAACCACC CGCCGAGGGC TCGAACCCCA TCTGCTCGGG CAGCGGATTA CGAGTGCAAC CGTATTCGAC TCCCGACTGC GCTGGCGCGT GCGTGATGAT CTTGCAGCAT GGCTCGAAGG TCGCCTAATC ATCGCCGTCT CACGGCGATC GAAATACCTA TTACTGCACT TTGAAAACGG TGAGCGCCTA CTGATTCATC TGGGTATGTC CGGCAGTCTG CGTATCGTGA CGCCCGATAT ACCGCGCAGA AAACATGACC ACGTCGAGAT CTGTATCAAT AGCAGTAAGA ATCTGCGTTT CCACGATCCG CGACGATTCG GCGCGCTGTT GACCGATCAT GAACAAGCCC CCCACATCCG ACTGCAAAAT CTAGGCCCCG AGCCACTCTC TGACGCATTC GACACCCATT ATCTCGGCAC TCAGCTACAC AAGCGCAAAC AAGCCATCAA ACCCTGCCTG ATGAATGCCG CGATTGTCGT TGGCGTCGGG AACATCTACG CGAACGAGGT GCTCTTTTTG TCCGGCATCC ACCCCGCAAC ACCGGCCCAC ACGCTCGATC ACAACCAAAT CAATCTTCTC GTTACGGCCA TCAAGAATGT ACTGGCCCGA GCCATTGAAC AGGGCGGCAC CACGCTCAGA GATTTTGTCC GCGAAGACGG GCAACCGGGC TATTTCAAAC AAACTCTGAA CGTTTATGAC CGGGCGGATC AACCCTGTCG GGTTTGCAGC ACCCCAATCG TTAAAACCGT GCAGGCGCAG CGCGCCACTT ATTACTGCCC TGTGTGCCAG CCGCCATTGG CAGATCGCTC CGGCCAGCGC ACTTGA
|
Protein sequence | MPELPEVETT RRGLEPHLLG QRITSATVFD SRLRWRVRDD LAAWLEGRLI IAVSRRSKYL LLHFENGERL LIHLGMSGSL RIVTPDIPRR KHDHVEICIN SSKNLRFHDP RRFGALLTDH EQAPHIRLQN LGPEPLSDAF DTHYLGTQLH KRKQAIKPCL MNAAIVVGVG NIYANEVLFL SGIHPATPAH TLDHNQINLL VTAIKNVLAR AIEQGGTTLR DFVREDGQPG YFKQTLNVYD RADQPCRVCS TPIVKTVQAQ RATYYCPVCQ PPLADRSGQR T
|
| |