Gene Saro_1229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1229 
Symbol 
ID3917860 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1281400 
End bp1282374 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content69% 
IMG OID640443966 
Productputative deoxyribodipyrimidine photolyase 
Protein accessionYP_496508 
Protein GI87199251 
COG category[R] General function prediction only 
COG ID[COG3380] Predicted NAD/FAD-dependent oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0596771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATGC ATGTTGCAAT CGTTGGCGCG GGCATGGCCG GTCTTTCCTG CGCAAGCCAT 
CTCGTGCGTG CAGGTCACAG GGTCTCGCTC TTCGACAAGG GGCGCGGACC GGGCGGGCGC
ATGTCGACGC GCCGCATGGA AACGCCGCTG GGCGATGCCC ATTTCGACCA TGGCGCGCAG
TACTTCACAG TCCGCGACCC GGCATTCATG GCGCAAGTCG CGCGCTGGTC GGCAAGCGGC
GTGGCCGCGC CATGGCCGGC GGCGGGGACC GGCGCCTGGG TCGGTGTTCC GGGAATGAAC
GCGGTGATCC GCGAAATGGC GGAGCGACAC GATGTCACAT TCGGCTGGCA CGTGCGCGGG
CTGGTCAACA GGAACGGAGG CTGGCTCCTG ACCGGCGACG CATCCGGCGG ACAGCGAGTG
CAGGACGGAC CATTCGACGC GGTCGTGGTC TCGATCCCGC CCGAGCAGGC CGCGGCGATC
GTCGCGCTGC ACGACCTGTC GCTGGCATCG ACGGCACTGG CGGCACGGTC GCAGCCGTGC
TGGACGGGCA TGTACGCCTT TGCCGAACGC TTGCCGACGC GGCGCGATGC GGTGCGGGAA
GCAGGCCTCG TCAGTTGGGC GGCCCGCAAT GGCGCCAAGC CGGGGCGCAC CGGACCGGAA
ACCTGGGTCG TGCAGGCAAC GCCGCAGTGG TCGGCCGACC ATATCGAAGA TTGCGCCGAC
GCGGTGGCTG GCACGCTCCT CTCATCGCTG GGCGAAGCGC TGGGGGTGGA CATTGCTGTC
CCGGTGGTGG CTTCGGCGCA CCGCTGGCGT TATGCCATGT CGACAGGAAG CGACCTCGGG
GCACTGTGGA GCGCGACGTC ACGGATCGGC ATCTGTGGCG ACTGGCTGCT TGGACCGCGC
GTCGAGAACG CATGGCTTTC GGGACGTACG CTGGCCGAGC GAATGCTGGC GAGTGTGCCG
CAGGCAGCAG CCTGA
 
Protein sequence
MQMHVAIVGA GMAGLSCASH LVRAGHRVSL FDKGRGPGGR MSTRRMETPL GDAHFDHGAQ 
YFTVRDPAFM AQVARWSASG VAAPWPAAGT GAWVGVPGMN AVIREMAERH DVTFGWHVRG
LVNRNGGWLL TGDASGGQRV QDGPFDAVVV SIPPEQAAAI VALHDLSLAS TALAARSQPC
WTGMYAFAER LPTRRDAVRE AGLVSWAARN GAKPGRTGPE TWVVQATPQW SADHIEDCAD
AVAGTLLSSL GEALGVDIAV PVVASAHRWR YAMSTGSDLG ALWSATSRIG ICGDWLLGPR
VENAWLSGRT LAERMLASVP QAAA