Gene Saro_2039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2039 
Symbol 
ID3917686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2175675 
End bp2177105 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content65% 
IMG OID640444791 
Producturacil-DNA glycosylase superfamily protein 
Protein accessionYP_497312 
Protein GI87200055 
COG category[L] Replication, recombination and repair 
COG ID[COG1573] Uracil-DNA glycosylase 
TIGRFAM ID[TIGR00758] uracil-DNA glycosylase, family 4 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCCGG ACAGGCGCCG GTTTGAGGCC TTCCGCGTAC AATTCGCCGC GTCGGACGAT 
TTCGAAGGCT GGCGTGACGC CGCCCGCCGC ATGATTCGCG CGAAGATTGC CCCAGACCAG
GTGATGTGGG AATCTCCCGC CGACCAGTCT GCCGATATCT TCGCCCGAGG CGGTGTCGCC
CTGCCATCGC CGCCGACGGA CGCTCCCCAA CCGCGCGCGT CGAAGGACTT CCTTCAACTT
GCGCAAAGCG TCATCCTTCA TTCTGGCAGT AAACGGTTTT CTACTCTTTA TCGTACGCTT
TGGCGGCTCC AGTCTCGGCC CCGGCTGATG GACGACAAGG CCGATGCCGA CGTGCGGGCG
ATGGAGGACC TTGCCCGGCA GGTGCGGCGC GACATCCACA AGATGCGCGC CTTCGTCCGC
TTCCGCAGCG TCGAGGGCGA GGCGGGAGAG CGATATGTTG CCTGGTTCGA GCCCGAGCAT
CACATCCTGC GCGCGAATGC GGGCTTCTTC GTCCGTCGCT TCACCACCAT GCAGTGGTCG
ATCCTAACCC CGCGCGGTAG CCTGCACTGG GATGGCGAGA CGCTGCACGA GGGGCCGCCG
GCCACCCGCG CCGATGCGCC TTCCGGCGAT CCGGTCGAAG GGCTGTGGCG TACCTACTAT
GCATCGATCT TCAATCCCGC GCGTTTGAAG GTCGGGGCCA TGCTCAAGGA AATGCCCCGC
AAATACTGGA AGAACATGCC GGAGGCGGCG CTCATTCCCG AATTGATCGC AGGGGCACAA
TCACGCGAGG CACGAATGGT ACAGGCTGGC GAGCAGGATC TCGGTGAGAC GCCAGTGAGC
ATCGATGCCA TCGGCGCGGC CATCCTGGCC TGCCGTCGGT GCGACATCGG CTGCAATGGC
ACCCGGGCGG TCATGGGCGA GGGACCGCAC GACGCGGCCC TGATGATCAT CGGCGAGCAG
CCCGGCGAAC AGGAAGAGGC GCAGGGCCGG CCTTTTGTCG GTCCTGCCGG TCAACTGCTG
CGCACCCATC TCGAACATGC CGGCATTCCG GCGGAGCGCG CTTACGTCAC CAATGCGGTC
AAGCACTTCA AGTTCATGCC GCAGGGAAAG CGTCGCCTGC ACCAGAACCC GTCAGCCAGG
GAAATCGACG TGTGCCGCTG GTGGCTCGAG GGTGAACGCG GGCTGGTTCG CCCGCGCCTG
ATCCTCGCGC TGGGGGCAAG TGCGGCGCGC AGCCTGCTGG GCAGGACTGT CAGCGTCCAG
AAGGTGCGGG GTGCACCGCA TGTGCTGGAC GATGGCAGCG AACTGTGGAT CACCACCCAC
CCCAGCTACC TCCTGCGCCT GGACGACGGC GGGCGTTCGG AAGAAGAAGC CAGATTTTCA
AATGACTTGC AGAAGGTAGC TGCGCGGCTT TCGCAGATTT CGTCCGGCTA G
 
Protein sequence
MMPDRRRFEA FRVQFAASDD FEGWRDAARR MIRAKIAPDQ VMWESPADQS ADIFARGGVA 
LPSPPTDAPQ PRASKDFLQL AQSVILHSGS KRFSTLYRTL WRLQSRPRLM DDKADADVRA
MEDLARQVRR DIHKMRAFVR FRSVEGEAGE RYVAWFEPEH HILRANAGFF VRRFTTMQWS
ILTPRGSLHW DGETLHEGPP ATRADAPSGD PVEGLWRTYY ASIFNPARLK VGAMLKEMPR
KYWKNMPEAA LIPELIAGAQ SREARMVQAG EQDLGETPVS IDAIGAAILA CRRCDIGCNG
TRAVMGEGPH DAALMIIGEQ PGEQEEAQGR PFVGPAGQLL RTHLEHAGIP AERAYVTNAV
KHFKFMPQGK RRLHQNPSAR EIDVCRWWLE GERGLVRPRL ILALGASAAR SLLGRTVSVQ
KVRGAPHVLD DGSELWITTH PSYLLRLDDG GRSEEEARFS NDLQKVAARL SQISSG