Gene Avin_31330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_31330 
Symbol 
ID7762032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3238826 
End bp3239917 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content69% 
IMG OID643806007 
Productalkanesulfonate monooxygenase 
Protein accessionYP_002800271 
Protein GI226945198 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGTTC AGATTCTCGG CATGATCGGC CATCGACTTT CCTCGGAAAC CATAGCCCCG 
GTAGGGCCGG TATTCGACAA GAACTACATC CGCAACTTCG CCCAGGCGCA CGAGAACGCC
GGTTTCGACC GCATCCTGGT CGGCTACTGG TCCGATCAGC CCGACGGCTT CCTGGTCACC
GCCCTGGCCG GCCTGTCCAC CAGCCGCATC GGCCTGCTGC TGGCGCACCG GCCGGGCTTC
GTCGCGCCGA CCCTGGCCGC GCGCAAGCTG GCGACCCTCG ACCAGTTGCT CGACGGCCGC
CTGGCGCTGA ACGTGATCAG CGGCGGCAGC GACAGCGAGC AGCGCAAGGA CGGCGACTTC
CTCGACCACG ACCAGCGCTA TGCGCGCACC GACGAGTTCC TCGAGGTGCT GAGGAAGACC
TGGACCTCGG AACAGCCGTT CGACCACAAG GGCGAGTTCT ATCGGGTCGA GCAGGCCTTC
TCGGCGGTCA AGAGCGAGCA GAAGCCGCAC CTGCCGGTGT ATTTCAGCGG CGCCTCGGAC
GCCGCCATCC GCGTCGCCGG CAAGCACGCC GACGTCTACA TGCTGTGGGG CGAATCCCTG
CAGCAGACCC GCGAGCTGGT CGAGCGCGTG CGCGCCGAGG CGGCCGGGCA CGGCCGCGAC
ATCGAGTTCA GCGTGTCCTT CCGGCCGATC GTCGCCGCCA CCGAGGACGC CGCCTGGGCC
AAGGCCGAGG TCATCCTGAG CCGCGCCCGC GCGCGTCACG AAGTGGCCCG ACCGGAACTC
TCCCTCAAGC CGGAAAGCAT CGGCGCCCAG CGCCTGCGCG CCACCGTGGC CCAGGGCGAG
CGGGTCGACA AGCGCCTGTG GACCGGTATC GCCGGGCTGG TCGGCGGCGG CCACAACTCC
ACCGCGCTGG TCGGCACCCC GGAACAGGTC GCCGACGCCC TGATCGACTA CTACGACCTG
GGCATCCGCA ACATCCTGAT CCGCGGCTTC GACCCGCTCA ACGACGCCGT CGACTACGGC
CGCGAGCTGA TCCCGCTGAT CCGCGCCAAG GCGGCCGAAC GCGACCTGCG AAACAGCGCC
CGCCGCGCCT GA
 
Protein sequence
MSVQILGMIG HRLSSETIAP VGPVFDKNYI RNFAQAHENA GFDRILVGYW SDQPDGFLVT 
ALAGLSTSRI GLLLAHRPGF VAPTLAARKL ATLDQLLDGR LALNVISGGS DSEQRKDGDF
LDHDQRYART DEFLEVLRKT WTSEQPFDHK GEFYRVEQAF SAVKSEQKPH LPVYFSGASD
AAIRVAGKHA DVYMLWGESL QQTRELVERV RAEAAGHGRD IEFSVSFRPI VAATEDAAWA
KAEVILSRAR ARHEVARPEL SLKPESIGAQ RLRATVAQGE RVDKRLWTGI AGLVGGGHNS
TALVGTPEQV ADALIDYYDL GIRNILIRGF DPLNDAVDYG RELIPLIRAK AAERDLRNSA
RRA