Gene Ava_5016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_5016 
Symbol 
ID3679029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007413 
Strand
Start bp6304439 
End bp6305584 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content48% 
IMG OID637720376 
Productaliphatic sulfonate monooxygenase 
Protein accessionYP_325508 
Protein GI75911212 
COG category[C] Energy production and conversion 
COG ID[COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases 
TIGRFAM ID[TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000170849 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.183828 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGCTAC ATTGGTTTAT TCCCACTCAC GGTGAAGGAC GCTATCTAGG TACTGCCATA 
GGCGGGCGCG TAGTCAACTT TGATTACTGG CGGCAGATTG CTCAAGCTGT AGATCACTTA
GGTTTTACAG GTGCATTATT ACCTACAGGC CGTTCTTGTG AAGATGCTTG GATTTTGGCA
TCAACCTTGG TAACACACAC CAAACAGATG AAATTTTTGG TGGCGATACG TCCGGGGTTG
ATGTCACCGG GAGTTGCAGC CAGAATGGCA GCAACTTTTG ATCGCATTTC TGGAGGACGT
TTGCTAATTA ACGTCGTTAC AGGTGGTGAT CCAGTGGAAT TGGCAGGGGA TGGATTGCAT
CTTACCCATG ACGATCGCTA CAGACTAACA GATGAATTTT TGGCAGTGTG GCGGGAGATT
GCTGCCGGAG AAACCGCTAA CTTCCAGGGC GATTATCTCA ATATTCAAGA CGGCAAGCTG
TTATTTCCCC CAGTCCAGAA ACCTCATCCA CCTCTGTGGT TTGGCGGTTC TTCCCCCATT
GCTCAAGAAA TCGCTGCCAA ACACGTAGAT GTATATTTAA CTTGGGGCGA ACCACCAGAA
CAAGTTGCCC AAAAGATTGC CTCTGTACGC CGACTAGCAG CAGCACAAGG AAGAACATTA
AGTTTTGGGA TTCGTCTCCA TGTCATTGTC CGGGAGACTG AAAGCCAGGC TTGGGATGCA
GCCAACAATT TAATTCGCTA CGTAGATGAT TCGGCGATCG CCCAAGCCCA ACAAGCATAC
TCTCGCATGG ATTCCGAAGG ACAACGCCGG ATGAAAGAAT TACACAACGG CAGTCGAGAA
GCTTTAGAAA TTAGCCCGAA TTTATGGGCG GGAATTGGTT TAGTGCGCGG CGGTGCGGGG
ACTGCCTTAG TTGGTGACCC CGATACAGTA GCTCAAAGAA TGCTGGAATA TCAAAAATTA
GGTATCGATA CCTTTATCTT CTCCGGTTAT CCCCATCTAG AAGAAGCCTA TCGAGTAGCA
GAGTTACTCT TTCCACGGTT ACCCCTACAG CATCAACCAA CAGTAGCACC ACAACTATTG
AGTCCCTTTG GCGAAGTTGT AGCCAACCAA GACTTCCCTA AACAGCAAAT CGCCACTGTA
GATTAA
 
Protein sequence
MQLHWFIPTH GEGRYLGTAI GGRVVNFDYW RQIAQAVDHL GFTGALLPTG RSCEDAWILA 
STLVTHTKQM KFLVAIRPGL MSPGVAARMA ATFDRISGGR LLINVVTGGD PVELAGDGLH
LTHDDRYRLT DEFLAVWREI AAGETANFQG DYLNIQDGKL LFPPVQKPHP PLWFGGSSPI
AQEIAAKHVD VYLTWGEPPE QVAQKIASVR RLAAAQGRTL SFGIRLHVIV RETESQAWDA
ANNLIRYVDD SAIAQAQQAY SRMDSEGQRR MKELHNGSRE ALEISPNLWA GIGLVRGGAG
TALVGDPDTV AQRMLEYQKL GIDTFIFSGY PHLEEAYRVA ELLFPRLPLQ HQPTVAPQLL
SPFGEVVANQ DFPKQQIATV D