Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_41430 |
Symbol | phr |
ID | 7763025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 4176789 |
End bp | 4178195 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 643806999 |
Product | Deoxyribodipyrimidine photolyase |
Protein accession | YP_002801250 |
Protein GI | 226946177 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0415] Deoxyribodipyrimidine photolyase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCCAGT TGATCTGGCT GCGCAGCGAC CTGCGCGTCC GCGACAACCG CGCCCTGGGC GCGGCCATGG GCGCCGGGCC GACCCTCGCC CTCTACCTGC TCAGCCCCGC TCAGTGGCGC ACGCACGACG ACGCCCCATG CAAGGTCGAC TTCCGGCTGC GCAACCTCGC CGAGCTGTCG CACGGCCTGG CAAGCCTGGG TGTGCCGCTG CTGATCCGCC GGGTCGATAC CTGGGACGAG GCGCCCGCCC TCCTCACCCG CCTGTGCCAT GAGCAGAATA TCGTCCGGGT GCACGTCAAC GACGAATACG GGGTCCACGA AAGCCGCCGC GACGCGGCGG TGGAGCGCGC ACTGCAGGCG CAGGGCGTCG GCCTGCAGCG ACACCTGGAC CAGCTACTGC TCGCCCCGGA CAGCCTGCGG ACCCGCTCCG GCGGCAGCTT CCGGCTGTTC GCTCAGTTCC GCCGGGCCTG CCACTGGCGC CTGGCCGTCA GCCTGCCGGC AGTCGGCGGC CTGCCCATGG CACAGCCGCC GCTGAACGTC GTCGGCAATG CCGTTCCCGC TGCGCTGGAA GGTTTCGCCA CGCCACCGGA ATCCCTGCGC CGGCTCTGGC CGGCCGGCGA GATAGCCGCG CAGCGGCGCC TGCACGAATT CGTCGAGAAC CGCCTCGCCG CCTACGCCGA AGCCCGCGAC TTTCCCGCCG AGCCCGGCAC CAGCCGGCTG TCGCCCTACC TCGCCGCCGG CGTGCTGTCG CCGCGCCAGT GCCTGCATGC GGTGCTGCGC ACCGGCGGAT TCGACCGTCC GGAGGCCTCC GCCTGGTTCG ACGAACTGCT CTGGCGCGAG TTCTACAAAC ACATACTGGC AGGCCATCCG CGGGTTTCGA TGGGCCGTGC CTTGCGCACG GAAACCGAGG CTCTGCCCTG GCGCGACGCT CCGCAAGAGC TGGCGGCCTG GCAGCAGGGG CGTACCGGCT TTCCGCTGAT CGATGCGGCG ATGCGCCAGT TGCTCGCCAC CGGCTGGATG CACAACCGCC TGCGCATGGT GGTCGCCATG TTCCTGAGCA AGAACCTGCT GATCGACTGG CGGCACGGCG AGCGCTGGTT CATGCACCAC CTGATCGACG GCGACCTCGC CGCCAACAAC GGCGGCTGGC AGTGGTGCGC CTCCACCGGC ACCGACGCGG TGCCCTACTT CCGTCTGTTC AACCCGCTCG CCCAGTCGCG CCGGTTCGAC CCGGAGGGGC GCTTCATCCG CCAGTGGCTG CCGGAACTGG CGAGTCTGGA CAACCGCGAC ATCCACGCAC CGGCCGGAGC GCTCCGCCCC GCCGGCTACC CGCCGCCGAT CGTCGACCTG CCGTCCAGCC GGGAACGCGC CCTGGCCGCC TTCAAGGCCC TGCGCCGCCG CGGCTGA
|
Protein sequence | MRQLIWLRSD LRVRDNRALG AAMGAGPTLA LYLLSPAQWR THDDAPCKVD FRLRNLAELS HGLASLGVPL LIRRVDTWDE APALLTRLCH EQNIVRVHVN DEYGVHESRR DAAVERALQA QGVGLQRHLD QLLLAPDSLR TRSGGSFRLF AQFRRACHWR LAVSLPAVGG LPMAQPPLNV VGNAVPAALE GFATPPESLR RLWPAGEIAA QRRLHEFVEN RLAAYAEARD FPAEPGTSRL SPYLAAGVLS PRQCLHAVLR TGGFDRPEAS AWFDELLWRE FYKHILAGHP RVSMGRALRT ETEALPWRDA PQELAAWQQG RTGFPLIDAA MRQLLATGWM HNRLRMVVAM FLSKNLLIDW RHGERWFMHH LIDGDLAANN GGWQWCASTG TDAVPYFRLF NPLAQSRRFD PEGRFIRQWL PELASLDNRD IHAPAGALRP AGYPPPIVDL PSSRERALAA FKALRRRG
|
| |