Gene Saro_2394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2394 
Symbol 
ID3916713 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2560701 
End bp2562269 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content67% 
IMG OID640445149 
Productdeoxyribodipyrimidine photolyase-related protein 
Protein accessionYP_497664 
Protein GI87200407 
COG category[R] General function prediction only 
COG ID[COG3046] Uncharacterized protein related to deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCCC CCGCTCCCTC CACCCCCGTC ATCGTCCCCA TCCTCGGCGA CCAGCTTTCA 
CCCCACATCT CGAGCCTTGC CGACCGCAGC CCCGACGACA CCGTGATCCT GATGATGGAA
GTGACGGAGG AGACGACCTA CGTCCGCCAC CACAAGGCCA AGATCGCTAT GATCCTCTCG
GCCATGCGCC ACTTCGCCGA GGAACTGCGC GGGGCCGGGT GGACGGTCGA TTACGTACGG
CTCGACGATC CCGCCAACAC CGGCACCTTC ACTGGCGAGG TAGCCCGCGC GGTGGAGCGC
CACGGCGCGC GCGGGGTGCA GGCCACCGAG CCCGGCGAAT GGCGCGTCAG GCAGGCGATG
GAGCACTGGC GCACCGACCT TCCCGTCCGC GTGCGCATAC TCCCCGACAC CCGTTTCGTC
TGCCCCCTCC CCGACTTCTA CGAATGGGCC GCCGGTCGCA AGGAACTGCG CATGGAGTGG
TTCTACCGCG AGATGCGGCG GAAGACCGGC CTGCTGATGG ACGGCGACAA GCCCGCCGGC
GGCCGCTGGA ACTTCGACGC CGAGAATCGC GGCGGACCGG AAGCCGGCCT CAAGCCTCCC
GCCCCGCCCC GCTTCACGCC TGACAGCATC ACCGGTGAAG TCCTCGATCT CGTCTCGACG
CGCTTTGCCA GACATTTCGG GTCGCTCGAA AACTTCGGCT GGCCCGTCAC CCGCACCGAG
GCCGAAGCCG CGCGCGATGC GTTCCTCGCC GACCGTCTTC CCCGCTTCGG CAAATATCAG
GACGCGATGG TCGCGGGCCA GGACTTCCTG TTCCACGCAG TCCTCTCGCC TGCCATCAAC
ATCGGCCTGT TAGACCCGCT CGACCTCTGC CGCCGCGCAG AGACCGAATG GCGCGAGGGC
CGCGCGCCGC TCGAGGCGGT GGAAGGCTTC ACCCGCCAGA TCATCGGCTG GCGCGAATAC
GTGCGCGGCA TGTACTGGCT CGAGATGCCA GCACTCGCGG ATGCCAACGG CCTGGACGCG
CACCGACCCC TGCCCGACTT CTACTGGACC GGCGATACGC CGATGCGCTG CCTCGCCGAT
TGCGTGCGGA CCACGCGCGA CAATGCCTAT GCCCACCACA TCCAGCGCCT GATGGTGCTG
GGCAACTTCG CGCTACTGGC AGGCCTCAGG CCGCAGGACG TCGCGGACTG GTATCTCGTC
GTCTATGCCG ACGCCTTCGA ATGGGTCGAA CTGCCCAACG TCGCGGGGAT GGTGCTCCAT
GCCGACAAGG GCCGCCTCGC CTCCAAGCCC TACGCCGCGA GCGGGGCCTA CATCGACAGG
ATGGGCGACT ACTGCGGCAA ATGCGCGTTC GATGTGAAGC GGAAGACCGG CGAAGGCGCC
TGCCCGTTCA ACGCGCTCTA CTGGCACTTC CTCGCCCGCA ACGAGAAGAA GCTTGCAGGC
TACCACCGCC TCGCCCAACC TTACGCCACC TGGCGGCGAA TGAGCGACGA AAAGCGCGCG
GAATATCTCC TCAGCGCCGA GGCCTTCCTC CGGACGCTCG ATCCCGCAAA GCCCGGATGG
GCGCGCTAG
 
Protein sequence
MSAPAPSTPV IVPILGDQLS PHISSLADRS PDDTVILMME VTEETTYVRH HKAKIAMILS 
AMRHFAEELR GAGWTVDYVR LDDPANTGTF TGEVARAVER HGARGVQATE PGEWRVRQAM
EHWRTDLPVR VRILPDTRFV CPLPDFYEWA AGRKELRMEW FYREMRRKTG LLMDGDKPAG
GRWNFDAENR GGPEAGLKPP APPRFTPDSI TGEVLDLVST RFARHFGSLE NFGWPVTRTE
AEAARDAFLA DRLPRFGKYQ DAMVAGQDFL FHAVLSPAIN IGLLDPLDLC RRAETEWREG
RAPLEAVEGF TRQIIGWREY VRGMYWLEMP ALADANGLDA HRPLPDFYWT GDTPMRCLAD
CVRTTRDNAY AHHIQRLMVL GNFALLAGLR PQDVADWYLV VYADAFEWVE LPNVAGMVLH
ADKGRLASKP YAASGAYIDR MGDYCGKCAF DVKRKTGEGA CPFNALYWHF LARNEKKLAG
YHRLAQPYAT WRRMSDEKRA EYLLSAEAFL RTLDPAKPGW AR