Gene Plim_3349 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlim_3349 
Symbol 
ID9140065 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePlanctomyces limnophilus DSM 3776 
KingdomBacteria 
Replicon accessionNC_014148 
Strand
Start bp4335465 
End bp4336610 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content53% 
IMG OID 
ProductA/G-specific adenine glycosylase 
Protein accessionYP_003631361 
Protein GI296123583 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.574146 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAGA CAAAGTTTCA AAAGCAACTT CTCGCCTGGT ATGCAAAGCA TGGTCGCCCG 
TTGCCGTGGC GAGCCTCACA TGATCCTTAT TCGATCTGGA TCAGTGAGAT CATGCTGCAA
CAGACCACCG TGACCGCTGT GATCCCTTAC TTCGAACGAT TCATGGCGAA ATTCCCCAGT
GTGCAGGCGC TGGCCAGTGC TCCGGAAGAA GAGGTGCTCA AACTGTGGGA GGGGTTAGGT
TACTACTCAA GAGCCCGCAA TCTGCATCAG TCTGCCCGAG TGTTGATGGA AAGATACCAA
GGGGTTTTTC CGCAAAGTGT CGAGCAATTG CTCGAGTTGC CCGGGATTGG TCGATATACG
GCTGGCGCCA TTTCGAGCTT TGCTTTTCGC CTGCCGGCTC CCATTGTCGA AGCCAATACC
CAGCGGTTGT ATGCCCGCAT TCTGGGATAT GATGGCGACT TGAAAAATGC AGCAGGACAA
AAAGCCTTAT GGGGATTCGC AGAATCGATT GTTTCAGGGA AAGAACCCGA TCTGATCAAT
CAGGCCCTCA TGGAACTGGG CTCACTTGTT TGTAAACCCA TCGACCCCTT GTGCGATCAA
TGCCCGGTCC AGCAGCATTG CCGCGCATTT CAGGAAGCAA GGCAAGCCGA GATTCCCCGA
GCACAGGCCA GACCAGTCAT TACACCGCTG GTTGATGCCA CATTGCTGAT CGAGTATCAG
GGGGAGCTAT TTCTCCGGCA ACGCGAGAAG CCTGAGCGAT GGGCCGGATT ATGGGATTTT
CCACGCTATA CGCTTTTTGA TCCCGAGAAT ACCAGCGAAG AGTTTCAGAA AGAAAAAGAC
GTCTCGACAT CAGCACTGGC CTTGTCACTC AAGGCTCGTG TGCAGGAACA ATTGGCGGTA
CATCCAGGCG AAGTCACTGA ATTTTCACGG CTGACTCATG GAGTGACTCG CTATCGCATC
ACTCTGCATG CCTTTGGCTG CGATCTCTCG GATGGAGTGG CCAGCAGACA AAGCAAAGCA
CTCTATGAAC AGCTCAAGTC GCATGGCGGG TGGTTTGGGT GTGAATCGCT CGATTCGCTG
GCCGTGCCCG TGACCACTCG AAAGCTGGTG AAGCAGTGGC AGAAGCTCAA GAACATGATG
CGATGA
 
Protein sequence
MQKTKFQKQL LAWYAKHGRP LPWRASHDPY SIWISEIMLQ QTTVTAVIPY FERFMAKFPS 
VQALASAPEE EVLKLWEGLG YYSRARNLHQ SARVLMERYQ GVFPQSVEQL LELPGIGRYT
AGAISSFAFR LPAPIVEANT QRLYARILGY DGDLKNAAGQ KALWGFAESI VSGKEPDLIN
QALMELGSLV CKPIDPLCDQ CPVQQHCRAF QEARQAEIPR AQARPVITPL VDATLLIEYQ
GELFLRQREK PERWAGLWDF PRYTLFDPEN TSEEFQKEKD VSTSALALSL KARVQEQLAV
HPGEVTEFSR LTHGVTRYRI TLHAFGCDLS DGVASRQSKA LYEQLKSHGG WFGCESLDSL
AVPVTTRKLV KQWQKLKNMM R