Gene Noca_3301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3301 
Symbol 
ID4598173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3506586 
End bp3508070 
Gene Length1485 bp 
Protein Length494 aa 
Translation table11 
GC content76% 
IMG OID639777907 
ProductDNA-3-methyladenine glycosylase II / transcriptional regulator Ada / DNA-O6-methylguanine--protein-cysteine S-methyltransferase 
Protein accessionYP_924490 
Protein GI119717525 
COG category[F] Nucleotide transport and metabolism
[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase
[COG2169] Adenosine deaminase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0977453 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGGCA CCGAGACGCG CGACGATCGG CTGGACCGCG AGTCCTGCTA TCGCGCCGTC 
AAGTCGCGCG ACCGCCGGTT CGACGGCGTC TTCTACACCG CGGTCCGCAC CACGGGGATC
TACTGCCGGC CCTCGTGCCC GGCCCGGACC CCGGCGTACC AGAACGTCAC GTTCCACCCG
AGCGCGGCCT CCGCGCAGGC CGCGGGCTTC CGCGCCTGCA AGCGCTGCCT GCCGGACGCC
ACGCCCGGCA GCCCGGACTG GGACGTCGCG GCCACCGCCG CCGGCCGGGC GATGCGGCTG
ATCGCCGACG GTGTCGTGGA CCGGGAGGGC GTCGACGGGC TGGCCCGCCG GGTCGGCTAC
ACCCCCCGCC ACCTCACCCG GATCCTCACC GCCGAGCTCG GCGCCGGCCC ACTGGCGTTG
GCGCGGGCCA AGCGGGCGCA GACCGCACGC GTGCTCATCG AGACGACCGA GCTGACCTAC
GCAGATGCCG CGTTCGCCTC CGGCTTCTCC AGCGTCCGGC AGTTCAACGA CACGATCCGC
GAGGTGTACG ACGCGTCCCC GACCGACCTG CGCGGGCGGC GCGGCGGCCG CGCCGCCACC
GGCACCGTCA CGATGCGGTT GGCCGTGCGG ACGCCGTACC ACGGGTCGGC GCTGCTCGGC
TTCCTCGCCA CCCGCGCCGT GCCGGGCGTC GAAGCGGCGG GCGCGGACTG GTACGCGCGG
ACGCTCGCCC TGCCGCATGG CACCGGCACC GTCCGGCTGC AGGTGCCCGA CGTCGTCCAG
TCCGGGCTGA CGGCGTTCGC GACCGCCACG TTCGTGCTCG ACGACCTCCG CGACACCGCC
GCCGCCACCG AGCGGGTGCG CCGGCTGCTG GACGCCGACT GCGACCCGGT GGCCGTCGCC
GACGCCTTCA CGGGCGATCC CGTCATCGGG CCGCTCGTGC GGGCCCGGCC GGGCCTGCGG
GTGCCCGGCC ACGTCGACGG CCACGAGATC GCGGTCCGCG CTGTGCTCGG CCAGCAGGTC
AGCGTGGCCC GGGCCCGCAC CCTCGCGGCG CGGCTCGTCG CGCAGCACGG ACGCCCGGTG
ACGCGACCCG ACGGCACGCT CACCCACCTG TTCCCGGACC CCGACGTGTT GGCCGGGCTG
GCGCCCGAGG AGCTGCCGAT GCCGCGCTCC CGCGGCCGGG CGCTGATCGC ACTGTGCCAC
GCCGTCGCCT CCGGGGACAT CGCGCTGGAC CGCGGTCCCG ACCGCGGGGA CGTACGCCGG
GCGCTGCTGG CCATCCCCGG GATCGGGCCG TGGACGGCCG ACTACATCGC GCTCCGGGCG
CTCGGCGATC CGGACGTCTT CCTGCCGACC GACGTCGGCA TCCGCAACGC GCTCACCGGC
CTCGGGCGGG ACCCGTCGAC GGCGGCGGAC CTCGCTCGGC GTTGGTCGCC CTGGCGCTCC
TACGCGCAGG TGTACCTCTG GCAGACCCTC AGCAAGGAGA ACTGA
 
Protein sequence
MTGTETRDDR LDRESCYRAV KSRDRRFDGV FYTAVRTTGI YCRPSCPART PAYQNVTFHP 
SAASAQAAGF RACKRCLPDA TPGSPDWDVA ATAAGRAMRL IADGVVDREG VDGLARRVGY
TPRHLTRILT AELGAGPLAL ARAKRAQTAR VLIETTELTY ADAAFASGFS SVRQFNDTIR
EVYDASPTDL RGRRGGRAAT GTVTMRLAVR TPYHGSALLG FLATRAVPGV EAAGADWYAR
TLALPHGTGT VRLQVPDVVQ SGLTAFATAT FVLDDLRDTA AATERVRRLL DADCDPVAVA
DAFTGDPVIG PLVRARPGLR VPGHVDGHEI AVRAVLGQQV SVARARTLAA RLVAQHGRPV
TRPDGTLTHL FPDPDVLAGL APEELPMPRS RGRALIALCH AVASGDIALD RGPDRGDVRR
ALLAIPGIGP WTADYIALRA LGDPDVFLPT DVGIRNALTG LGRDPSTAAD LARRWSPWRS
YAQVYLWQTL SKEN