Gene Avin_31340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_31340 
SymboldszA 
ID7762033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3240053 
End bp3241378 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content72% 
IMG OID643806008 
Productdibenzothiophene sulfone monooxygenase 
Protein accessionYP_002800272 
Protein GI226945199 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.885637 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCGCC AGATCCACCT GGCCGGCTTC CTGCTCGCCG GCCCCGTCGT CCACAGCCAT 
GCCGCCTGGC GGCACCCGGA AACCCAGGGC AACTTCCTCG AACCCGAGTA CTACGCGCGC
AGCGCGAAGG TACTGGAAGA AGGTCTGTTC GACCTGCTGT TCTTCGCCGA CCGCTTCGCC
ATCGGCGACC AGTTGGGCGG CTCCCGAGAG CTGGCGCTGC GCCACGGCGC CCAGGACGCC
ACCCGCCTCG ACCCGCTGCC GATCCTCTCC TACCTGGTCG GCCAGACCAG CCGGCTCGGC
CTCGGCGTCA CCCGCTCCAC CACCTATTAC GAGCCGGTCC ACGTGGCCCG CTCGCTGGCC
ACCCTCGACC ATCTGTCGCG CGGGCGCGCC GCCTGGAACA TCGTCACCTC GATGAACGAC
AGCGAAGGCC GGCTGTTCGG CAAGGACCGC CACCTGGAGC ACGACCTGCG CTACGACCGC
GCCGACGAGT TCGTCGAGGT CGCCCTGAAG CTGTGGAGCA GTTGGGCGCC GGGCGCGCTC
AAGCTCGACC GCGAGCAAGG CATCCTCGCC GACCCGGCCG GGGTGCGCGA CGTCGCCCAC
CGGGGCGAGT GGTTCCGCGT CGAGGGGCCG CTGAACATCC CGCGCACCCC GCAGGGCCGC
CCGGTGCTGA TCCAGGCCGG CTCCAGCGGT CGCGGCCGGC GCTTCGGCGC GCGCTGGGCC
GAGGCCATCT TCAGCATCCA CGGCAACCTG CCGGCGATGC GCGCCTTCCG CGACGACGTG
CGCCAGCAGG TGGTCGCCCA GGGCCGCGCG GCGGAGCAGT GCAAGATCCT CACCGCGGTG
ATGCCCTTCA TCGGCGGCAG CGAGGCCGAG GCGCGCGCCA AGCGCGACCG CCACAACGCC
CTGGCGCACC CGCAGATCGG CCTGGCGACG CTGTCGGCAC AGCTCAACTT CGACTTCTCC
GCCTTCGCCC CGGACAGCCG CCTGGAAAGC ATCGCCGCGC ACCCCGAAAC GCCGCCGGCG
GTCGCCGAGA AACTGCGCTC GCTGGGCGCC TCGCTGAGCC TCGGCGAACT GGCGCAGACC
TTCGCCAGCA GCGTGCGCGT GCCGCAACTG GTGGGCACCG GCGTGCAGAT CGCCGAGCAA
CTGGCGGAGA TCTTCAACGC CGGCGGCTGC GACGGCTTCG TGATTTCCCC CGGCTACCTG
CCGGGCTCCT TCGCCGAATT CGCCGAGAGC GTGGTGCCGC ACCTGCAGCG CCTGGGCCTC
TACCGCCGCG CCTACGAGGG CGCGACCCTG CGCGAACACC TGGGCCTCGG CCCCCTGGAG
GCCTGA
 
Protein sequence
MSRQIHLAGF LLAGPVVHSH AAWRHPETQG NFLEPEYYAR SAKVLEEGLF DLLFFADRFA 
IGDQLGGSRE LALRHGAQDA TRLDPLPILS YLVGQTSRLG LGVTRSTTYY EPVHVARSLA
TLDHLSRGRA AWNIVTSMND SEGRLFGKDR HLEHDLRYDR ADEFVEVALK LWSSWAPGAL
KLDREQGILA DPAGVRDVAH RGEWFRVEGP LNIPRTPQGR PVLIQAGSSG RGRRFGARWA
EAIFSIHGNL PAMRAFRDDV RQQVVAQGRA AEQCKILTAV MPFIGGSEAE ARAKRDRHNA
LAHPQIGLAT LSAQLNFDFS AFAPDSRLES IAAHPETPPA VAEKLRSLGA SLSLGELAQT
FASSVRVPQL VGTGVQIAEQ LAEIFNAGGC DGFVISPGYL PGSFAEFAES VVPHLQRLGL
YRRAYEGATL REHLGLGPLE A