Gene Avin_41430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_41430 
Symbolphr 
ID7763025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4176789 
End bp4178195 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content71% 
IMG OID643806999 
ProductDeoxyribodipyrimidine photolyase 
Protein accessionYP_002801250 
Protein GI226946177 
COG category[L] Replication, recombination and repair 
COG ID[COG0415] Deoxyribodipyrimidine photolyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCCAGT TGATCTGGCT GCGCAGCGAC CTGCGCGTCC GCGACAACCG CGCCCTGGGC 
GCGGCCATGG GCGCCGGGCC GACCCTCGCC CTCTACCTGC TCAGCCCCGC TCAGTGGCGC
ACGCACGACG ACGCCCCATG CAAGGTCGAC TTCCGGCTGC GCAACCTCGC CGAGCTGTCG
CACGGCCTGG CAAGCCTGGG TGTGCCGCTG CTGATCCGCC GGGTCGATAC CTGGGACGAG
GCGCCCGCCC TCCTCACCCG CCTGTGCCAT GAGCAGAATA TCGTCCGGGT GCACGTCAAC
GACGAATACG GGGTCCACGA AAGCCGCCGC GACGCGGCGG TGGAGCGCGC ACTGCAGGCG
CAGGGCGTCG GCCTGCAGCG ACACCTGGAC CAGCTACTGC TCGCCCCGGA CAGCCTGCGG
ACCCGCTCCG GCGGCAGCTT CCGGCTGTTC GCTCAGTTCC GCCGGGCCTG CCACTGGCGC
CTGGCCGTCA GCCTGCCGGC AGTCGGCGGC CTGCCCATGG CACAGCCGCC GCTGAACGTC
GTCGGCAATG CCGTTCCCGC TGCGCTGGAA GGTTTCGCCA CGCCACCGGA ATCCCTGCGC
CGGCTCTGGC CGGCCGGCGA GATAGCCGCG CAGCGGCGCC TGCACGAATT CGTCGAGAAC
CGCCTCGCCG CCTACGCCGA AGCCCGCGAC TTTCCCGCCG AGCCCGGCAC CAGCCGGCTG
TCGCCCTACC TCGCCGCCGG CGTGCTGTCG CCGCGCCAGT GCCTGCATGC GGTGCTGCGC
ACCGGCGGAT TCGACCGTCC GGAGGCCTCC GCCTGGTTCG ACGAACTGCT CTGGCGCGAG
TTCTACAAAC ACATACTGGC AGGCCATCCG CGGGTTTCGA TGGGCCGTGC CTTGCGCACG
GAAACCGAGG CTCTGCCCTG GCGCGACGCT CCGCAAGAGC TGGCGGCCTG GCAGCAGGGG
CGTACCGGCT TTCCGCTGAT CGATGCGGCG ATGCGCCAGT TGCTCGCCAC CGGCTGGATG
CACAACCGCC TGCGCATGGT GGTCGCCATG TTCCTGAGCA AGAACCTGCT GATCGACTGG
CGGCACGGCG AGCGCTGGTT CATGCACCAC CTGATCGACG GCGACCTCGC CGCCAACAAC
GGCGGCTGGC AGTGGTGCGC CTCCACCGGC ACCGACGCGG TGCCCTACTT CCGTCTGTTC
AACCCGCTCG CCCAGTCGCG CCGGTTCGAC CCGGAGGGGC GCTTCATCCG CCAGTGGCTG
CCGGAACTGG CGAGTCTGGA CAACCGCGAC ATCCACGCAC CGGCCGGAGC GCTCCGCCCC
GCCGGCTACC CGCCGCCGAT CGTCGACCTG CCGTCCAGCC GGGAACGCGC CCTGGCCGCC
TTCAAGGCCC TGCGCCGCCG CGGCTGA
 
Protein sequence
MRQLIWLRSD LRVRDNRALG AAMGAGPTLA LYLLSPAQWR THDDAPCKVD FRLRNLAELS 
HGLASLGVPL LIRRVDTWDE APALLTRLCH EQNIVRVHVN DEYGVHESRR DAAVERALQA
QGVGLQRHLD QLLLAPDSLR TRSGGSFRLF AQFRRACHWR LAVSLPAVGG LPMAQPPLNV
VGNAVPAALE GFATPPESLR RLWPAGEIAA QRRLHEFVEN RLAAYAEARD FPAEPGTSRL
SPYLAAGVLS PRQCLHAVLR TGGFDRPEAS AWFDELLWRE FYKHILAGHP RVSMGRALRT
ETEALPWRDA PQELAAWQQG RTGFPLIDAA MRQLLATGWM HNRLRMVVAM FLSKNLLIDW
RHGERWFMHH LIDGDLAANN GGWQWCASTG TDAVPYFRLF NPLAQSRRFD PEGRFIRQWL
PELASLDNRD IHAPAGALRP AGYPPPIVDL PSSRERALAA FKALRRRG