Gene Avin_31250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_31250 
Symbol 
ID7762025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3230214 
End bp3231476 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content66% 
IMG OID643806000 
Productprotocatechuate 4,5-dioxygenase 
Protein accessionYP_002800264 
Protein GI226945191 
COG category[S] Function unknown 
COG ID[COG3384] Uncharacterized conserved protein 
TIGRFAM ID[TIGR02792] protocatechuate 4,5-dioxygenase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACGAA TCATTGGCGG CCTCGCCGTC TCCCACACAC CGACCATCGG CTTCGCCGTC 
GACAACGACA AGCAACACGA CGAAGCCTGG GCGCCGATCT TCAAGAGTTT CGAGCCGGTC
TCGGCATGGC TGCGGGAAAA GCGGCCGGAC GTCCTGTTCT ACATCTTCAA CGACCATGTG
ACCTCGTTCT TCTTCAATCA CTACGCGGCC TTCAACCTGG GCGTGGACGA GCGCTACGAA
CCCGCCGACG AAGGGGGCGG CCCGCGCGCC CTGCCGGCCG TCGAAGGCCA TGCCGAGCTG
GCCCGGCACA TCGGCGCAAG CCTGATGGCC GACGAGTTCG ACATGGCGTT CTTCCGCGAC
AAGCCCCTGG ACCACGGGCT GTTCTCGCCG ATGTCGGCGA TCCTGCCGCC CGATGCCCGC
TCCGGATGGC CGGTGAAGAT CGTTCCGCTG CAGGTCGGCG TGCTGCAGTT CCCGATTCCC
AGCGCCGCCC GCTGCTACAA GCTGGGCCAG GCACTGCGCC GGGCCATCGA GAGCTACCCC
GAGGATCTGA AGGTGGCGAT CGTGTCGACC GGCGGCCTCT CGCATCAGGT CCACGGCGAG
CGGTGCGGCT TCAACGACCC GCAATGGGAC GCCCAGTTCG TCGATCTGCT GGTCAACGAT
CCGGTGCGCC TGACCGAGCT GACCGTCGCC GAATACGCAG CCCTCGGCGG CGTGGAAGGT
GCCGAGGTGA TCATGTGGCT GATCATGCGC GGCGCCCTGT CCGCCACGGT GAAGAAGGTG
CACCAGGATT ACTACCTGCC GTCGATGACC GGGATCGCCA CCCTGATCCT GGAGAATCGG
GACCGCGAAG TGCCGGTGGA CCTGCATGAG CGCCACCGTC GGCACATGGA CCATCAACTG
GCCGGAGCGG ACCGGCTCGA AGGCACCTAC CCGTTCGACC TGGCGCGCAG CGCCAAGGGC
TACCGGCTGA ACAAGTTCCT GCACGGGCTG ATCTCGCCCG CCTTCCGCGA GCGCTTCAAG
GAAGAGCCGG AAACCCTGTT CGAAGAACAC CGGCTCAGCG AGCAGGAGCG CGACATGCTC
CGCCGCCTCG ACTGGCGCGC CCTGATCCAG TACGGGGCGA GCTTCTTCGT GCTGGAAAAG
CTCGGCGCGG TCGTCGGCGT CTCCAACCTG CACATCTATT CGGCCATGCG CGGCCAGTCG
CTCGAGGAGT TCCAGAAGAC CCGCAACCGG CAGGTCCTCT ACTCGGTGGC CGGCAAACGC
TGA
 
Protein sequence
MARIIGGLAV SHTPTIGFAV DNDKQHDEAW APIFKSFEPV SAWLREKRPD VLFYIFNDHV 
TSFFFNHYAA FNLGVDERYE PADEGGGPRA LPAVEGHAEL ARHIGASLMA DEFDMAFFRD
KPLDHGLFSP MSAILPPDAR SGWPVKIVPL QVGVLQFPIP SAARCYKLGQ ALRRAIESYP
EDLKVAIVST GGLSHQVHGE RCGFNDPQWD AQFVDLLVND PVRLTELTVA EYAALGGVEG
AEVIMWLIMR GALSATVKKV HQDYYLPSMT GIATLILENR DREVPVDLHE RHRRHMDHQL
AGADRLEGTY PFDLARSAKG YRLNKFLHGL ISPAFRERFK EEPETLFEEH RLSEQERDML
RRLDWRALIQ YGASFFVLEK LGAVVGVSNL HIYSAMRGQS LEEFQKTRNR QVLYSVAGKR