Gene Avin_22400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_22400 
SymboldszA 
ID7761158 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp2239883 
End bp2241244 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content70% 
IMG OID643805125 
Productxenobiotic compound monooxygenase, DszA family, A subunit protein 
Protein accessionYP_002799406 
Protein GI226944333 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAAAC GCCAGCTTGC GCTGAACGCC TTCATCCTGC CGACCGGCCA CCACATCGCC 
GCCTGGCGCC GCGCGGACGT GCCGGAGCAC GCCAACCACG ATTTCGGCGA ATACGTCCGC
GTGGCGCGCC TGGCCGAGGC GGCCAAGTTC GACGCGGTGT TCGTCTCCGA CTCGGTGGCG
GTGAACGCGG GCAACGGTGG CAGCGGCCTG GAGGAACTGG CGCTGACCGC CCGCGGTACC
CGGCTGGAAC CCTTTACCCT GCTCTCGGGG CTGGCCGCGG TGACCGAGCG CATCGGTCTG
ATCGCCACCG TCAGCACCAC CTACAACGAG CCCTATCACC TGGCCCGCAA GTTCGCCTCG
CTGGACTGGA TCTCCAGGGG ACGCGCCGGC TGGAACCTGG TGACCTCTTC GGCGCCGGCC
GAGGCGCTGA ACTTCAACCA GAGCGAGCTG CCCGAACACG GCGAGCGCTA CCGGCGCGCC
GAGGAGTTCC TCGAGGTGGT GCAGGGCCTG TGGGATAGCT GGGAAGACGA CGCCTTCGTC
CGCGACAAGG CCAGCGGCCA GTTCTTCGAT CCGGCCAGGC TGCACGTGCT GGACCACCAG
GGCGAGCACT ACCGGGTGCG CGGTCCGCTG AACGTGGCGC GCCCGCCGCA GGGCTATCCG
GTGCTGGTGC AGGCCGGCTC CTCCGAGCCG GGCAAGGAAA TCGCCGCGCG CAGCGCCGAG
GCGATCTTCA CCGCACACCA GACCCTGGAA AGCGCCCAGG CCTTCTACGC CGATGTGAAA
GGCCGTCTCG CCAAGTACGG CCGCGCGCCC GAGGAGCTGA AGATCCTCCC GGGCATTTTC
CCGGTGGTCG GCCGCAGCGA GGCCGAAGCC CAGGCCAAGT ACCGCGAACT GCAGGAACTG
ATCGACCCGC GGGTCGGCGT CAACCTGCTC AGCGCCCTGA TCGGCAGCGA CCTGTCCGGC
TTCGATGTCG ACGGCCCACT GCCCGAGCTG GCGACCACCA ACCAGAACAA GAGCCGCCAG
GCCCTGCTGC TCGAACTGGC GCACCGCGAG AACCTGACCA TCCGCGAACT CTACCTGCGC
ATCGCCGGCG CCCGCGGCCA CTGGACGGTG GTCGGCACCC CGACGCAGAT CGCCGACCAG
ATCCAGACCT GGTTCGAGCA GGGCGCGGCC GACGGCTTCA ACGTGCTGGC GCCCTGGCTG
CCCGGCGGCC TGGAGGAGTT CATCGACCAG GTGGTGCCGG AACTGCGCCG CCGCGGCCTG
TTCCGCGAGG AGTACAGCGG CGCCACCCTG CGCGAGCACC TGGGCCTGGC CCGTCCGGAA
AACCAGCATG CCGCCGGGCG CCAGCGGCGC GCGGCGAGCT GA
 
Protein sequence
MSKRQLALNA FILPTGHHIA AWRRADVPEH ANHDFGEYVR VARLAEAAKF DAVFVSDSVA 
VNAGNGGSGL EELALTARGT RLEPFTLLSG LAAVTERIGL IATVSTTYNE PYHLARKFAS
LDWISRGRAG WNLVTSSAPA EALNFNQSEL PEHGERYRRA EEFLEVVQGL WDSWEDDAFV
RDKASGQFFD PARLHVLDHQ GEHYRVRGPL NVARPPQGYP VLVQAGSSEP GKEIAARSAE
AIFTAHQTLE SAQAFYADVK GRLAKYGRAP EELKILPGIF PVVGRSEAEA QAKYRELQEL
IDPRVGVNLL SALIGSDLSG FDVDGPLPEL ATTNQNKSRQ ALLLELAHRE NLTIRELYLR
IAGARGHWTV VGTPTQIADQ IQTWFEQGAA DGFNVLAPWL PGGLEEFIDQ VVPELRRRGL
FREEYSGATL REHLGLARPE NQHAAGRQRR AAS