Gene Avin_21740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_21740 
Symbol 
ID7761094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2172716 
End bp2174044 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content67% 
IMG OID643805062 
Productxenobiotic compound monooxygenase, DszA family, A subunit 
Protein accessionYP_002799343 
Protein GI226944270 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.77772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAAT CCTCTCGTCA ACTCAGTCTC GGCGCCTTCC TCATGGCCAC CGGGCATCAT 
GTGGCCGCCT GGCGGCATCC GGATGTGCCG GCCGATCCCC TGGATTTTTC CGTTTACCGG
CGTGCCGCGC AGGTCGCCGA AGCGGCCTGC TTCGATGCGC TGTTCGTCGC CGACAGCGTC
GCCGTCTTCG ACGACAAGGT TGCCAGCCAT ACCTCGCGCT CGACCTATTT CGAGCCCCTC
ACCCTGCTCT CGGCGCTCGC TGCGATCACC GAGCGGATCG GTCTGGTCGG CACGGCGACC
ACCACCTATA ACGAGCCCTA CCACGTGGCG CGCAAGTTCG CCTCGCTGGA TCACCTGTCG
GGCGGGCGCT GCGGCTGGAA CCTGGTGACC TCCGACGCCG CCGCCGAAGC ACAGAACTTC
GGCCGCGACG AACATCTCGG GCATGCCGAA CGCTATGTCC GGGCACACGA GTTCCACCGC
GTGGTCACCG GCCTGTGGGA CAGTTGGGCG GATGACGCCT TCCTGCGCGA CAAGGCCAGC
GGGCGCTTCT ACGACCCGGA CAAGCTGCAT GTGCTTGATC ACCGGGGCGA GCATTTCCAG
GTGCGCGGTC CGCTCAATGT GGCGCGTTCG CCTCAAGGAC GCCCGGTGGT CGTGCAGGCC
GGCTCCTCCG AAGCCGGCCG CGAGCTGGCC GCCGAGACCG CCGAGATGGT GTTCACCGCA
CAGGCTTCGC TGGAATCGGC GAAAACCTTT TACGCCGACC TCAAGGGACG GCTTGGCAGG
TTCGGCCGCG ACGAGAACGC GCTGCGCATC ATGCCTGGCG TCTTCGTGGT CGCCGGGCAG
AGCGAGAGCG AGGCCAAAGA GAAGTTCGAG GCCTTCCAGG AGTTGGTCGA GCCGCAGGTG
GGCGTCGCCC TGCTCGGGCG CATGCTGGGT AACTTCGACC TGTCCGAATA CCCGCTGGAC
GGCCCGCTGC CGGAACTGCC GCTCACCGAC AGCGGCCAGC GCAGCCGCCA GCAGCTTTTG
ACCGAACTGG CGGGCGCAGA AAACCTGACT CTGGCCCAGT TGGGCCGGCG CATCGCCGGC
GGGCGCGGCC ACTACAGCCT GATCGGCACG CCTCGGCAGA TCGCCGACGA GCTACAGGCC
TGGTTCGAGG AGCGCGCTGC CGACGGTTTC AATGTGCTGG TGCCGCACCT GCCGGGCGGT
CTGGAAGACT TCGCCGCACT GGTGGTACCG GAACTGCAGC GGCGCGGCCT GTTCCGGCGC
GAATACCGGG GCCGCACTCT GCGCGAACAC CTCGGCTTGG CGCGCCCGAA AAGCCCCTTC
TTCGCCTGA
 
Protein sequence
MSQSSRQLSL GAFLMATGHH VAAWRHPDVP ADPLDFSVYR RAAQVAEAAC FDALFVADSV 
AVFDDKVASH TSRSTYFEPL TLLSALAAIT ERIGLVGTAT TTYNEPYHVA RKFASLDHLS
GGRCGWNLVT SDAAAEAQNF GRDEHLGHAE RYVRAHEFHR VVTGLWDSWA DDAFLRDKAS
GRFYDPDKLH VLDHRGEHFQ VRGPLNVARS PQGRPVVVQA GSSEAGRELA AETAEMVFTA
QASLESAKTF YADLKGRLGR FGRDENALRI MPGVFVVAGQ SESEAKEKFE AFQELVEPQV
GVALLGRMLG NFDLSEYPLD GPLPELPLTD SGQRSRQQLL TELAGAENLT LAQLGRRIAG
GRGHYSLIGT PRQIADELQA WFEERAADGF NVLVPHLPGG LEDFAALVVP ELQRRGLFRR
EYRGRTLREH LGLARPKSPF FA