Gene Avin_21340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_21340 
Symbol 
ID7761059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2135356 
End bp2136708 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content69% 
IMG OID643805029 
Productxenobiotic compound monooxygenase, DszA family, A subunit 
Protein accessionYP_002799310 
Protein GI226944237 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.341615 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGCTGG GCGCCTTTCT CTATCCCACC GGACACCATG CGGCCGCCTG GCGCCATCCC 
CAGGCACAGG CGGACGCCGG CATCAATTTC GAGCACTACG TCGGCCTGGC GCAGACCGCC
GAGCGGGCCG GCTTCGACCT GCTGTTCCTC GCCGACAGCG CCGCCGCGCG CGGCAAGGAC
TGGGTCGCCC TGTCGCGTTT CGCCACCCAT TACGTGGCGC AGTTCGAACC CCTGACGCTG
ATCTCGGCGC TCGCCGCGGT GACCCGGCGG ATCGGCCTGG TGGCCACCGC CTCGACCACC
TACAACGATC CCTACAGCCT GGCGCGCAAG CTCGCCTCCA TCGACCACAT CAGCCACGGT
CGGGTGGGCT GGAACCTGGT CACCTCGGGC AACGAGGAGG AAGCCTTCAA CTTCGGCGCC
ACGGCCTATC CGCCGCATGC CGAGCGTTAC CGCCGGGCCC GCGAGTTCGC CCAGGTGCTC
AAGGGCCTGT GGGACAGTTG GGACGACGAT GCCTTCCCGC GCGACAAGGC CAGCGGTCTG
TTCCTGGACG TCGACAAGAT GCATGTGCTC GGTCACCAGG GCGAGCATTT CAGCGTGCGC
GGCCCGCTGA ACATCCCGCG TTCGCCCCAG GGCTGGCCGG TGCTGGTGCA GGCCGGTTCG
TCCGAGGCCG GCAAGGAGCT GGCCGCGGAA ACCGCCGAGG TGGTGTTCAC CGCCCAGCGC
ACGCTGGCCG ACGCGCAGGC TTTCTATGCG GACGTGAAAG GACGCATGGC GCGTTACGGG
CGCCAGCCCT CGGCGCTGAA GATCATGCCG GGGATCTTCC CGGTGGTGGC CGCCACGGAG
GCCGAGGCCC AGGCCAAGTT CGAGGCGCTA CAGGACTTGA TCCATCCCTC GGTGGCGCTG
GCCATCCTCG AACACCGCCT GGGCGTGCCG CTGGCGCACC TGCCCCAGGA CGGGCCGCTG
CCGGAAATCG CCGAGGTCGA ATCGACCCGC AGTCGCCGCG CCTTGCTGCT GGAGGTGGCC
AGGCGCGAGG GCCTGAGCAT TCGCCAACTG GCGCTGCGGG TGGCCGGCGC GCGCGGCCAC
TGGCAGGTGG TCGGCTCCGG CGAACAGGTC GCCGATGCCA TGCAGGAACT GTTCGAGAAC
GGCGGCGCCG ACGGCTTCAA TGTCATGTCC CCTTATTTGC CGGGAGGCCT GGACGATTTC
GTCGAGCATG TCCTGCCGGT ATTGCGCGAG CGCGGCCTGC TGCGCGGCGA ATACCAGGGC
ACGACCCTGC GCGAGCATCT CGGCCTGCAA CGTCCGCCGT CGCCGCACCT GCGCGCGACC
GCCGGCCTTT CCAGCCACGA ATTGACTGCC TGA
 
Protein sequence
MALGAFLYPT GHHAAAWRHP QAQADAGINF EHYVGLAQTA ERAGFDLLFL ADSAAARGKD 
WVALSRFATH YVAQFEPLTL ISALAAVTRR IGLVATASTT YNDPYSLARK LASIDHISHG
RVGWNLVTSG NEEEAFNFGA TAYPPHAERY RRAREFAQVL KGLWDSWDDD AFPRDKASGL
FLDVDKMHVL GHQGEHFSVR GPLNIPRSPQ GWPVLVQAGS SEAGKELAAE TAEVVFTAQR
TLADAQAFYA DVKGRMARYG RQPSALKIMP GIFPVVAATE AEAQAKFEAL QDLIHPSVAL
AILEHRLGVP LAHLPQDGPL PEIAEVESTR SRRALLLEVA RREGLSIRQL ALRVAGARGH
WQVVGSGEQV ADAMQELFEN GGADGFNVMS PYLPGGLDDF VEHVLPVLRE RGLLRGEYQG
TTLREHLGLQ RPPSPHLRAT AGLSSHELTA