Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ava_5016 |
Symbol | |
ID | 3679029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anabaena variabilis ATCC 29413 |
Kingdom | Bacteria |
Replicon accession | NC_007413 |
Strand | + |
Start bp | 6304439 |
End bp | 6305584 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 637720376 |
Product | aliphatic sulfonate monooxygenase |
Protein accession | YP_325508 |
Protein GI | 75911212 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0000170849 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.183828 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCTAC ATTGGTTTAT TCCCACTCAC GGTGAAGGAC GCTATCTAGG TACTGCCATA GGCGGGCGCG TAGTCAACTT TGATTACTGG CGGCAGATTG CTCAAGCTGT AGATCACTTA GGTTTTACAG GTGCATTATT ACCTACAGGC CGTTCTTGTG AAGATGCTTG GATTTTGGCA TCAACCTTGG TAACACACAC CAAACAGATG AAATTTTTGG TGGCGATACG TCCGGGGTTG ATGTCACCGG GAGTTGCAGC CAGAATGGCA GCAACTTTTG ATCGCATTTC TGGAGGACGT TTGCTAATTA ACGTCGTTAC AGGTGGTGAT CCAGTGGAAT TGGCAGGGGA TGGATTGCAT CTTACCCATG ACGATCGCTA CAGACTAACA GATGAATTTT TGGCAGTGTG GCGGGAGATT GCTGCCGGAG AAACCGCTAA CTTCCAGGGC GATTATCTCA ATATTCAAGA CGGCAAGCTG TTATTTCCCC CAGTCCAGAA ACCTCATCCA CCTCTGTGGT TTGGCGGTTC TTCCCCCATT GCTCAAGAAA TCGCTGCCAA ACACGTAGAT GTATATTTAA CTTGGGGCGA ACCACCAGAA CAAGTTGCCC AAAAGATTGC CTCTGTACGC CGACTAGCAG CAGCACAAGG AAGAACATTA AGTTTTGGGA TTCGTCTCCA TGTCATTGTC CGGGAGACTG AAAGCCAGGC TTGGGATGCA GCCAACAATT TAATTCGCTA CGTAGATGAT TCGGCGATCG CCCAAGCCCA ACAAGCATAC TCTCGCATGG ATTCCGAAGG ACAACGCCGG ATGAAAGAAT TACACAACGG CAGTCGAGAA GCTTTAGAAA TTAGCCCGAA TTTATGGGCG GGAATTGGTT TAGTGCGCGG CGGTGCGGGG ACTGCCTTAG TTGGTGACCC CGATACAGTA GCTCAAAGAA TGCTGGAATA TCAAAAATTA GGTATCGATA CCTTTATCTT CTCCGGTTAT CCCCATCTAG AAGAAGCCTA TCGAGTAGCA GAGTTACTCT TTCCACGGTT ACCCCTACAG CATCAACCAA CAGTAGCACC ACAACTATTG AGTCCCTTTG GCGAAGTTGT AGCCAACCAA GACTTCCCTA AACAGCAAAT CGCCACTGTA GATTAA
|
Protein sequence | MQLHWFIPTH GEGRYLGTAI GGRVVNFDYW RQIAQAVDHL GFTGALLPTG RSCEDAWILA STLVTHTKQM KFLVAIRPGL MSPGVAARMA ATFDRISGGR LLINVVTGGD PVELAGDGLH LTHDDRYRLT DEFLAVWREI AAGETANFQG DYLNIQDGKL LFPPVQKPHP PLWFGGSSPI AQEIAAKHVD VYLTWGEPPE QVAQKIASVR RLAAAQGRTL SFGIRLHVIV RETESQAWDA ANNLIRYVDD SAIAQAQQAY SRMDSEGQRR MKELHNGSRE ALEISPNLWA GIGLVRGGAG TALVGDPDTV AQRMLEYQKL GIDTFIFSGY PHLEEAYRVA ELLFPRLPLQ HQPTVAPQLL SPFGEVVANQ DFPKQQIATV D
|
| |