Gene Saro_1759 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1759 
Symbol 
ID3916334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1854600 
End bp1855964 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content65% 
IMG OID640444500 
Productdeoxyribodipyrimidine photo-lyase type I 
Protein accessionYP_497033 
Protein GI87199776 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0257219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACC CCGTAATCGT CTGGTTCCGA CGCGACCTCC GCCTTGCAGA TCAAGCCGCC 
CTCCTGGCTG CCGCAGCCGA GGGGCCGGTC ATTCCCGTCT ACATCCTCGA CGACGATACC
CCGCGCCACT GCCGCATGGG TGGAGCGTCG CGCTGGTGGC TCCACCACAG CTTGGCCGAG
CTTGATGCCT CGCTGCGCAA GCTTGGATCG CGCCTGATCC TGCGTCGGGG CAAGTGCCAC
GAGGAGCTTG CCGCCGTTCG CCGCGAGGCC GGGGCGAGGG TCGTTCACGC ATTGCAGCAC
TACGAGCCTT GGTGGCGCAA TGCCGAAAAG GCGGTGGCCA AGACAATCCG CCTGGTCCTT
CACGAAGGCA ACTATCTGGC CCCCGCCGGT TCCGTCCGCA CCGGGATGGG CCGGCCCTAC
AAGATCTACA CGCCCTTCTG GCATGCCCTG CTCGAGCGCA TGCCCCCTGC GCCGCCCTTG
CCGGCACCCG AACGGCTCGA AGCGCCCGCC GTCTGGCCCA CGAGCGATGC GCTGGACGAC
TGGAAGCTGC TTCCGACGGC ACCCGACTGG GCAGGTGGGT TCCGCGAGAC ATGGACTCCG
GGCGAGAACG GTGCGGAAGA GCGGCTTCAG GACTTTGCCG ACCGTGTGGC CCGCTACGGA
GAAACGCGGA ACCTGCCTTC GATCGAAGGC ACCTCGCGCC TTTCGCCCCA CCTTCATTTC
GGCGAGATCT CTCCAGCCAC GATCTGGCAT CGGGTGGTGA ATGCGGGCGG TTCGGTCGAC
GTGTTTCTGG GTGAACTGGG TTGGCGCGAC TATGCCCAGA ACGTAATCGT GCAATTCCCG
GACTACGGTT CGAGGAATGC GCGCGAGGCG TACGACCGGC TCGAATGGCG CGACGATCCC
GAGGCATTGC GCGCCTGGCA ACAGGGCCGG ACCGGCTATC CCATCGTCGA TGCCGGCATG
CGCGAGCTCT GGCACACCGG GTGGATGCAC AACCGGGTGC GGATGATCGC CGCCAGCTTC
CTTGTGAAAC ACCTTCTGAT CGACTGGCGC GAGGGCGAGC GGTGGTTCTG GGACACGCTG
GTCGATGCCG ACTATGCATC CAACGCGGTC AACTGGCAGT GGACAGCCGG TACGGGCGTC
GATTCCAACA TGTTCGTCAG GATCATGGCG CCGCTGACCC AGTCACCGAA GTTCGACGCG
GCAGGCTATA TCCGGCAATG GGTGCCCGAA CTGGCTCATC TTTCCGACCG CGATATTCAC
GATCCCGCCG TTCCGCCGCG CGGCTATCCG GCAAAGATCG TCAGCCATTC CGAGGCGCGG
GCACGTGCCC TTGCCGCGCA TGACAGGATG AAGCAGCCGG CGTGA
 
Protein sequence
MSDPVIVWFR RDLRLADQAA LLAAAAEGPV IPVYILDDDT PRHCRMGGAS RWWLHHSLAE 
LDASLRKLGS RLILRRGKCH EELAAVRREA GARVVHALQH YEPWWRNAEK AVAKTIRLVL
HEGNYLAPAG SVRTGMGRPY KIYTPFWHAL LERMPPAPPL PAPERLEAPA VWPTSDALDD
WKLLPTAPDW AGGFRETWTP GENGAEERLQ DFADRVARYG ETRNLPSIEG TSRLSPHLHF
GEISPATIWH RVVNAGGSVD VFLGELGWRD YAQNVIVQFP DYGSRNAREA YDRLEWRDDP
EALRAWQQGR TGYPIVDAGM RELWHTGWMH NRVRMIAASF LVKHLLIDWR EGERWFWDTL
VDADYASNAV NWQWTAGTGV DSNMFVRIMA PLTQSPKFDA AGYIRQWVPE LAHLSDRDIH
DPAVPPRGYP AKIVSHSEAR ARALAAHDRM KQPA