Gene Pnec_1621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPnec_1621 
Symbol 
ID6183736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePolynucleobacter necessarius subsp. necessarius STIR1 
KingdomBacteria 
Replicon accessionNC_010531 
Strand
Start bp1423167 
End bp1424312 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content45% 
IMG OID641672138 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_001798309 
Protein GI171464196 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.865832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAAA CGTTGATCAA CAACTTTGCG CCAAAGTTAA TTGCCTGGCA TGGGGTCAGT 
GGGCGCTCGA GCTTGCCTTG GCAAGGCAAT CGAGATCCTT ACGCAGTCTG GGTCTCCGAA
ATTATGTTGC AACAAACACA AGTAACTACC GTACTTGAGC GCTACCCACG TTTTATGAAA
CGTTTTCCTA CGGTTAAAAA ATTAGCCGCA GCTGATATTG ATGAAGTATT GGCGGAGTGG
GCTGGCTTGG GTTATTACTC TCGCGCTAGA AATCTGCATG CCTGTGCAAA ACAAGTGGTG
ACAGAATTTG GTGGCAAGTT TCCGAGTGAC CCGGTTTTGC TTGAGCAATT AAAAGGTATT
GGGCGCTCTA CTGCTGGTGC AATTGCCGCC TTTGCATTTC ATGAGCGAGC ACCAATTTTG
GATGTCAATG TTAAGCGTAT CTTGGCGCGT TTATTTGTAA TTGAAGGCGC CATTCAAGAT
AAAGTAGTTA ATGATCAACT TTGGGGGTTG GCAGCAGATT TATTGCCAAG TAATTCTGCT
GATATGTCGG TCTATACTCA GGCTCTGATG GATTTTGGAG CAACCTGGTG CACTTCACGT
AAGCCGGTGT GTTTGGGTTC TGAAAAAAAG TGTCCATTCG AGAAAGACTG TCAGGCAAAT
TTGAGCGATC AAGTCCTTCT ACTTCCGCAA AAAACGAAAA AGACCAAGTC GCCTGAATTC
AACTGCAATA TGTTGTTGAT GCGTAGCGGC AATTCTGTTC TCTTGCAAAA GCGCCCCAAT
AAGGCGATTT GGGGAGGTTT GTGGTCCTTG CCAGAATCAG TTTGGGTGCC CAAGGCGCGC
GGTCCAGAAG TCGCTGATCT CAGTCCGGAA GACTTGTTTA CAGCCACTTT GCCAGAAGAA
AAGATTGCAT CACTGATTAA GGCATGTAAG TCGACAATAA GGGCCAATCA AATTAAGCAC
ATCTTTACCC ATAGACGTTT ATGGATGCAA ATTTGGCAGA CAAATTCCGT AAAAGAGTTG
TCATTTTTAA ATCCAAATTT AAAGTGGGTG CCCTTAAGTC AGGTGGGGCG CTACGGCCTG
CCGCAGCCGA TTAAGATTTT GCTACAGGAA TTGAGTCTAG TTCGCGATGA CGATCTAAAA
AATTAA
 
Protein sequence
MSKTLINNFA PKLIAWHGVS GRSSLPWQGN RDPYAVWVSE IMLQQTQVTT VLERYPRFMK 
RFPTVKKLAA ADIDEVLAEW AGLGYYSRAR NLHACAKQVV TEFGGKFPSD PVLLEQLKGI
GRSTAGAIAA FAFHERAPIL DVNVKRILAR LFVIEGAIQD KVVNDQLWGL AADLLPSNSA
DMSVYTQALM DFGATWCTSR KPVCLGSEKK CPFEKDCQAN LSDQVLLLPQ KTKKTKSPEF
NCNMLLMRSG NSVLLQKRPN KAIWGGLWSL PESVWVPKAR GPEVADLSPE DLFTATLPEE
KIASLIKACK STIRANQIKH IFTHRRLWMQ IWQTNSVKEL SFLNPNLKWV PLSQVGRYGL
PQPIKILLQE LSLVRDDDLK N