Gene Aave_3044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAave_3044 
Symbol 
ID4667904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax citrulli AAC00-1 
KingdomBacteria 
Replicon accessionNC_008752 
Strand
Start bp3352945 
End bp3354120 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content72% 
IMG OID639824246 
Productalkanesulfonate monooxygenase 
Protein accessionYP_971385 
Protein GI120611707 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.301199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0454974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGTGT TCTGGTTCAT TCCCACCCAC GGCGACAGCC GCTACCTCGG GACGTCCGAG 
GGTGCCCGCG CCGTCCACTA CGACTACCTG CGCCAGGTGG CCACGGCCGC CGATACCCTG
GGCTACGAAG GGGTGCTGAT CCCCACGGGC CGCTCCTGCG AGGACCCCTG GGTGGTCGCC
TCGGCACTCG CGCCCGTGAC ACGGCGCCTG AAGTTCCTCG TGGCGGTGCG CCCGGGCCTG
CACCAGCCGG CCCTGGCCGC TCGCATGGCC GCGACCTTCG ACCGCCTGTC GGGCGGGCGC
CTGCTCATCA ACCTCGTGAC GGGCGGCGAC CGCACGGAAC TGGAGGGCGA CGGTGTCTTC
CTGGACCATG CCCAGCGCTA TGCACAGTCG GAGGAATTCA TCCGTATCTG GCGCGAGATC
CTGTCGCGCA GCCACGAGGG CGGCACCTTC GACTACGAGG GAGAGCACCT GTCGGTGAAG
GGCGCCAAGC TGCTCTACCC GCCGGTGCAG AAGCCCTATC CGCCGGTGTA CTTCGGCGGC
TCGTCCGAGG CCGCGCACGA CCTCGCGGCG GAGCAGGTCG ATACCTACCT CACCTGGGGC
GAGCCGCCGG CCGCGGTGGC GCAGAAGGTG GCCGACGTGC GCGCCCGCGC CGCGCAACGG
GGCCGGACGG TGCGCTTCGG CATACGGCTG CACGTGATCG TGCGCGAGAC CGATGCGGCC
GCATGGGCCG CGGCGGAAGA GCTCATCAGC CGCGTGCAGG ACGAGACCGT GGCCCAGGCG
CAGGCCGTGT TCTCGCGCAT GGATTCCGAA GGGCAGCGCC GCATGGCCGC GCTGCACGCC
GGGGGCACCC GCCGCTCCCG CGCGGACCTG GAGATCAGCC CCAACCTCTG GGCCGGCGTG
GGCCTGGTGC GCGGCGGCGC GGGCACGGCG CTGGTGGGCG ATCCGCAGAC CGTGGCCGCG
CGCATGCAGG AGTACGCGGA CCTGGGCATC GACACCTTCG TGCTCTCCGG CTATCCGCAC
CTGGAAGAGG CCTACCGCTT CGCGGAACTG GTGTTCCCGC TGCTGCCCGC CGAGGTGCGC
GAGCGCATCG GCGGCGGCCG CGCGGCGGGG CCCCTGACCG GGCCTTTCGG CGAAATCGTG
GGCAACCAGT ACGTGCCGCG CGCGGCGCAG AGCTGA
 
Protein sequence
MHVFWFIPTH GDSRYLGTSE GARAVHYDYL RQVATAADTL GYEGVLIPTG RSCEDPWVVA 
SALAPVTRRL KFLVAVRPGL HQPALAARMA ATFDRLSGGR LLINLVTGGD RTELEGDGVF
LDHAQRYAQS EEFIRIWREI LSRSHEGGTF DYEGEHLSVK GAKLLYPPVQ KPYPPVYFGG
SSEAAHDLAA EQVDTYLTWG EPPAAVAQKV ADVRARAAQR GRTVRFGIRL HVIVRETDAA
AWAAAEELIS RVQDETVAQA QAVFSRMDSE GQRRMAALHA GGTRRSRADL EISPNLWAGV
GLVRGGAGTA LVGDPQTVAA RMQEYADLGI DTFVLSGYPH LEEAYRFAEL VFPLLPAEVR
ERIGGGRAAG PLTGPFGEIV GNQYVPRAAQ S