Gene GSU2829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU2829 
Symbol 
ID2686850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3112696 
End bp3114081 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content59% 
IMG OID637127518 
Productdeoxyribodipyrimidine photolyase, putative 
Protein accessionNP_953872 
Protein GI39997921 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID[TIGR00591] photolyase PhrII 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.242896 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTGCG GACGGATTCG TTCCCTGCTT CAGGGGGGGG AGGCCACGGC CGGGCCGGTG 
ATATACTGGA TGAGCAGGGA CCAGCGCGTT GCCGACAACT GGGCTTTAAT CCACGCCCAG
AAGCTCGCAC TGGCCCGTAG CGCACCGCTT GGGGTGCTCT TCTGTCTTGC CCCACGTTTT
CTCGGCGCGA CCGCACGCCA GTATCGGTTC ATGCTCAAAG GGCTGGAGCA GGTTCGGGCC
GCGCTGAATC GGCTTGATAT TCCCTTCTTT CTCGTGACCG GTGATCCCAA GGGGGCGGTC
GCGGCCTTCA CGAGGCGGCA CAGGGTTTCG TATCTGGTTA CCGATTTTGA TCCGCTTCGT
GTCAAACGGG AGTGGAAACG GCAGGTGGCA GGGGAGATAT CAATCCCGTT CGACGAGGTG
GATGCCCATA ATATAGTCCC CTGCTGGATC ACATCACAGC GTCAGGAGTG GGGGGCATAC
ACCATCCGCC CAAAGATACA CCGGCTGCTT CCCGATTTCA TGGAGCCGTT TCCGCCTCTG
CAACGTCACC CGTTTCCGTG GCAGGGAGCG CTGCCTTCAG ACGCCGAGTG GCGTGAGACT
TTTACGGGGA TGACCTTGGA CGAATCGGTG CCCGAGGTCA GCTGGCTCGC GTCGGGAGAA
GAGGCAGCGC AGGCCGCTTT GGCCAGATTT CTTGAAGACG GTCTGGCGGG CTACGCAACC
CGGCGCAATA ATCCTGCAGT AATGGGACAG TCGGGATTAT CCCCCTGGCT CCATTTCGGC
CAGCTTTCCG CCCAGAGGGT CGCGCAGGCA GCGTTTGCTG CCGCCGCGCC GATAGAATCG
CGTGATGCCT TTCTTGAAGA ATTGATCGTA CGTCGGGAGC TTGCCGACAA TTTTTGCTAT
TACAACGATG CCTACGACCG CTTCGACGGT TTTCCCGAGT GGGCGCAAAG AACCCTCAAC
CGGCATCGGC ACGATCCTCG CCCCCAGTGC TATGAGCATG ACGTGCTGGA GCAGGGACAG
ACCCACGATT CTCTCTGGAA TGCAGCACAA CTGGAAATGG TACGCTGGGG CAGGATGCAC
GGCTACCTGA GAATGTACTG GGCAAAGAAA CTGCTCGAGT GGACCTCTTC GCCCGAAGAT
GCCCTCATGA TTGCCATTCA ACTCAACGAC CGCTATCAGC TCGACGGCAG GGACCCCAAC
GGATACGCTG GCATTGCCTG GAGCATCGGC GGTGTCCATG ATCGTCCCTG GGCAGAGAGA
CCCGTCTTTG GCACGATTCG CTTCATGAGC CGCGACGGCT GCCGGAGAAA GTTCGATACA
GATGCCTACG AACGCCGGGT GATTATTAGT CCTGCCACAT GTGCGGGAAT AGCTCTGTGT
AAATAA
 
Protein sequence
MNCGRIRSLL QGGEATAGPV IYWMSRDQRV ADNWALIHAQ KLALARSAPL GVLFCLAPRF 
LGATARQYRF MLKGLEQVRA ALNRLDIPFF LVTGDPKGAV AAFTRRHRVS YLVTDFDPLR
VKREWKRQVA GEISIPFDEV DAHNIVPCWI TSQRQEWGAY TIRPKIHRLL PDFMEPFPPL
QRHPFPWQGA LPSDAEWRET FTGMTLDESV PEVSWLASGE EAAQAALARF LEDGLAGYAT
RRNNPAVMGQ SGLSPWLHFG QLSAQRVAQA AFAAAAPIES RDAFLEELIV RRELADNFCY
YNDAYDRFDG FPEWAQRTLN RHRHDPRPQC YEHDVLEQGQ THDSLWNAAQ LEMVRWGRMH
GYLRMYWAKK LLEWTSSPED ALMIAIQLND RYQLDGRDPN GYAGIAWSIG GVHDRPWAER
PVFGTIRFMS RDGCRRKFDT DAYERRVIIS PATCAGIALC K