Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnec_1621 |
Symbol | |
ID | 6183736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polynucleobacter necessarius subsp. necessarius STIR1 |
Kingdom | Bacteria |
Replicon accession | NC_010531 |
Strand | - |
Start bp | 1423167 |
End bp | 1424312 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 641672138 |
Product | A/G-specific adenine glycosylase |
Protein accession | YP_001798309 |
Protein GI | 171464196 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 0.865832 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAAAAA CGTTGATCAA CAACTTTGCG CCAAAGTTAA TTGCCTGGCA TGGGGTCAGT GGGCGCTCGA GCTTGCCTTG GCAAGGCAAT CGAGATCCTT ACGCAGTCTG GGTCTCCGAA ATTATGTTGC AACAAACACA AGTAACTACC GTACTTGAGC GCTACCCACG TTTTATGAAA CGTTTTCCTA CGGTTAAAAA ATTAGCCGCA GCTGATATTG ATGAAGTATT GGCGGAGTGG GCTGGCTTGG GTTATTACTC TCGCGCTAGA AATCTGCATG CCTGTGCAAA ACAAGTGGTG ACAGAATTTG GTGGCAAGTT TCCGAGTGAC CCGGTTTTGC TTGAGCAATT AAAAGGTATT GGGCGCTCTA CTGCTGGTGC AATTGCCGCC TTTGCATTTC ATGAGCGAGC ACCAATTTTG GATGTCAATG TTAAGCGTAT CTTGGCGCGT TTATTTGTAA TTGAAGGCGC CATTCAAGAT AAAGTAGTTA ATGATCAACT TTGGGGGTTG GCAGCAGATT TATTGCCAAG TAATTCTGCT GATATGTCGG TCTATACTCA GGCTCTGATG GATTTTGGAG CAACCTGGTG CACTTCACGT AAGCCGGTGT GTTTGGGTTC TGAAAAAAAG TGTCCATTCG AGAAAGACTG TCAGGCAAAT TTGAGCGATC AAGTCCTTCT ACTTCCGCAA AAAACGAAAA AGACCAAGTC GCCTGAATTC AACTGCAATA TGTTGTTGAT GCGTAGCGGC AATTCTGTTC TCTTGCAAAA GCGCCCCAAT AAGGCGATTT GGGGAGGTTT GTGGTCCTTG CCAGAATCAG TTTGGGTGCC CAAGGCGCGC GGTCCAGAAG TCGCTGATCT CAGTCCGGAA GACTTGTTTA CAGCCACTTT GCCAGAAGAA AAGATTGCAT CACTGATTAA GGCATGTAAG TCGACAATAA GGGCCAATCA AATTAAGCAC ATCTTTACCC ATAGACGTTT ATGGATGCAA ATTTGGCAGA CAAATTCCGT AAAAGAGTTG TCATTTTTAA ATCCAAATTT AAAGTGGGTG CCCTTAAGTC AGGTGGGGCG CTACGGCCTG CCGCAGCCGA TTAAGATTTT GCTACAGGAA TTGAGTCTAG TTCGCGATGA CGATCTAAAA AATTAA
|
Protein sequence | MSKTLINNFA PKLIAWHGVS GRSSLPWQGN RDPYAVWVSE IMLQQTQVTT VLERYPRFMK RFPTVKKLAA ADIDEVLAEW AGLGYYSRAR NLHACAKQVV TEFGGKFPSD PVLLEQLKGI GRSTAGAIAA FAFHERAPIL DVNVKRILAR LFVIEGAIQD KVVNDQLWGL AADLLPSNSA DMSVYTQALM DFGATWCTSR KPVCLGSEKK CPFEKDCQAN LSDQVLLLPQ KTKKTKSPEF NCNMLLMRSG NSVLLQKRPN KAIWGGLWSL PESVWVPKAR GPEVADLSPE DLFTATLPEE KIASLIKACK STIRANQIKH IFTHRRLWMQ IWQTNSVKEL SFLNPNLKWV PLSQVGRYGL PQPIKILLQE LSLVRDDDLK N
|
| |