Gene Avin_43870 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_43870 
SymbolmsuD 
ID7763260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4434287 
End bp4435420 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content70% 
IMG OID643807242 
ProductAlkanesulfonate monooxygenase 
Protein accessionYP_002801483 
Protein GI226946410 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCTGG ACATCTTCTG GTTCCTTCCC ACTTCCGGCG ATACCCGCTA TCTCGGCAAG 
TCTTCCTCCG GCCGCCCGGC GACCAACGAA TACATGCGGC AGATCGCCGT GACCGCCGAA
AGCCTCGGCT ACGACGGCCT GCTGATCCCC ACCGGCAGCA GTTGCCTGGA CCCCTGGGTG
ACGGCGGCCA GCCTGGTGCC AGTGACCAGC CGCATCAAGC TGCTGGTGGC GTTGCGCACC
TCGGTCAGCG GCCCGACCGC CACCGCGCGC CAGGCCGCCA CTCTCGACCA GGCGCTGAAG
GGCCGCCTGC TGCTCAACGT GGTGCCGGGC GGCGATGCCA CCGAACTGGC CGCCGACGGT
GTGTTCCTCG ACCATGACGA GCGCTACGAG GCGGCCGACG AAGTGCTCAC CGTGTGGCGC
GACCTGCTGC AGGGCAAGAC CGTCGACTTC GCCGGCAAGC ACGTCACCGT CGAGGGCGCG
AAGAACTTCT TCCCGCCGGT GCAGCAACCC TATCCGCCGC TGTATTTCGG CGGCTCCTCG
CCGGCGGCCC ACGAACTGGC GGCCAAGCAC GTCGACGCCT ACCTGACCTG GGGCGAACCG
CCGGCGGCGG TGGCCGAGAA GATCGCCGAC GTGCGCGAGC GGGCGAAGAA GTACGGGCGC
AGCGTGCGCT TCGGCGTGCG CCTGCACGTG ATCGTGCGCG AGACCAACGA GGAAGCCTGG
GCCGCCGCCG AGAAGCTGAT CAGCCATCTC GACGACGAGA CCATCGCCAA GGCCCAGGCC
AACTACGCGG CCATGGACTC CGAGGGCCAG CGGCGCATGG CCGCGCTGCA CGGCGGGCGG
CGCGACAAAC TGGAAGTCAG CCCCAACCTC TGGGCCGGGG TCGGCCTGGT GCGCGGCGGC
GCCGGCACCG CGCTGGTCGG CGACCCGCAG ACCGTGGCCG CGCGCCTCAA GGAATACGCC
GACCTCGGCG TCGACAGCTT CGTGCTCTCC GGCTATCCGC ACCTGGAGGA GGCCATTCGC
TTCGCCGAAC TGGTGTTCCC GCTGCTGCCC GGCAAGCAGC CGGTGACCGT CGAGGAGGAA
CTGACCGGCG GCGCCTTCGA CGTGCGCGCC ACCAAGAGCG AGGCCGCCGC ATGA
 
Protein sequence
MSLDIFWFLP TSGDTRYLGK SSSGRPATNE YMRQIAVTAE SLGYDGLLIP TGSSCLDPWV 
TAASLVPVTS RIKLLVALRT SVSGPTATAR QAATLDQALK GRLLLNVVPG GDATELAADG
VFLDHDERYE AADEVLTVWR DLLQGKTVDF AGKHVTVEGA KNFFPPVQQP YPPLYFGGSS
PAAHELAAKH VDAYLTWGEP PAAVAEKIAD VRERAKKYGR SVRFGVRLHV IVRETNEEAW
AAAEKLISHL DDETIAKAQA NYAAMDSEGQ RRMAALHGGR RDKLEVSPNL WAGVGLVRGG
AGTALVGDPQ TVAARLKEYA DLGVDSFVLS GYPHLEEAIR FAELVFPLLP GKQPVTVEEE
LTGGAFDVRA TKSEAAA