Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Avin_31330 |
Symbol | |
ID | 7762032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Azotobacter vinelandii DJ |
Kingdom | Bacteria |
Replicon accession | NC_012560 |
Strand | + |
Start bp | 3238826 |
End bp | 3239917 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 643806007 |
Product | alkanesulfonate monooxygenase |
Protein accession | YP_002800271 |
Protein GI | 226945198 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGTTC AGATTCTCGG CATGATCGGC CATCGACTTT CCTCGGAAAC CATAGCCCCG GTAGGGCCGG TATTCGACAA GAACTACATC CGCAACTTCG CCCAGGCGCA CGAGAACGCC GGTTTCGACC GCATCCTGGT CGGCTACTGG TCCGATCAGC CCGACGGCTT CCTGGTCACC GCCCTGGCCG GCCTGTCCAC CAGCCGCATC GGCCTGCTGC TGGCGCACCG GCCGGGCTTC GTCGCGCCGA CCCTGGCCGC GCGCAAGCTG GCGACCCTCG ACCAGTTGCT CGACGGCCGC CTGGCGCTGA ACGTGATCAG CGGCGGCAGC GACAGCGAGC AGCGCAAGGA CGGCGACTTC CTCGACCACG ACCAGCGCTA TGCGCGCACC GACGAGTTCC TCGAGGTGCT GAGGAAGACC TGGACCTCGG AACAGCCGTT CGACCACAAG GGCGAGTTCT ATCGGGTCGA GCAGGCCTTC TCGGCGGTCA AGAGCGAGCA GAAGCCGCAC CTGCCGGTGT ATTTCAGCGG CGCCTCGGAC GCCGCCATCC GCGTCGCCGG CAAGCACGCC GACGTCTACA TGCTGTGGGG CGAATCCCTG CAGCAGACCC GCGAGCTGGT CGAGCGCGTG CGCGCCGAGG CGGCCGGGCA CGGCCGCGAC ATCGAGTTCA GCGTGTCCTT CCGGCCGATC GTCGCCGCCA CCGAGGACGC CGCCTGGGCC AAGGCCGAGG TCATCCTGAG CCGCGCCCGC GCGCGTCACG AAGTGGCCCG ACCGGAACTC TCCCTCAAGC CGGAAAGCAT CGGCGCCCAG CGCCTGCGCG CCACCGTGGC CCAGGGCGAG CGGGTCGACA AGCGCCTGTG GACCGGTATC GCCGGGCTGG TCGGCGGCGG CCACAACTCC ACCGCGCTGG TCGGCACCCC GGAACAGGTC GCCGACGCCC TGATCGACTA CTACGACCTG GGCATCCGCA ACATCCTGAT CCGCGGCTTC GACCCGCTCA ACGACGCCGT CGACTACGGC CGCGAGCTGA TCCCGCTGAT CCGCGCCAAG GCGGCCGAAC GCGACCTGCG AAACAGCGCC CGCCGCGCCT GA
|
Protein sequence | MSVQILGMIG HRLSSETIAP VGPVFDKNYI RNFAQAHENA GFDRILVGYW SDQPDGFLVT ALAGLSTSRI GLLLAHRPGF VAPTLAARKL ATLDQLLDGR LALNVISGGS DSEQRKDGDF LDHDQRYART DEFLEVLRKT WTSEQPFDHK GEFYRVEQAF SAVKSEQKPH LPVYFSGASD AAIRVAGKHA DVYMLWGESL QQTRELVERV RAEAAGHGRD IEFSVSFRPI VAATEDAAWA KAEVILSRAR ARHEVARPEL SLKPESIGAQ RLRATVAQGE RVDKRLWTGI AGLVGGGHNS TALVGTPEQV ADALIDYYDL GIRNILIRGF DPLNDAVDYG RELIPLIRAK AAERDLRNSA RRA
|
| |