Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_3871 |
Symbol | |
ID | 7969728 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 4100395 |
End bp | 4101504 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644794457 |
Product | Alkanesulfonate monooxygenase |
Protein accession | YP_002945751 |
Protein GI | 239816841 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAGA ACGACGTTGA ATTCATCGGC ATGATCCAGG GCCAGAAGGT CTCGGAGATC CACGCGGCCA AGGGCCCGGC CATCGACCGC GACTACGTGC GCGCCTTTGC CCAGGCGCAC GAGAACGCGG GCTTCGACCG CGTGCTGGTG CCGCACCATT CCACCGGCCC CGACGCCACG CTGACCGTGG CCCATGCCGC ATCGGTCACC GAGCGCATCC ACTTCATGCT TGCGCACCGC CCGGGCTTCG TGGCGCCCAC GCTCGCGGCG CGCCAGTTCG CCTCGCTCGA CCAGTTCAGC GGCGGCCGGC TCGCGGTGCA CTACATCTCG GGCGGCTCCG ACGAGGAACA GCGCCGCGAC GGCGACTGGC TCGGCCATGA CCAGCGCTAT GCGCGCACCG ACGAATACCT CGACGTGCTG CACAAGGTGT GGACCGCCGA CAAGCCCTTC GACCATGAAG GCGCGCACTA CCGCTTCCAG AACGCTTTCT CCGAAGTGAA GCCGCTGCAG ACGCGCAACG GCAAGCCGCA TGTGCCGGTG TACTTTGGCG GTGCCTCGGA GGCGGCGATT CCGGTGGCCG GCAAGTACGC CGACGTGTAT GCGCTGTGGG GCGAGTCGCT CGACCAGGCG CGCGAACTCA CGGCGCGCGT GCGTGCAGAG GCCGCGAAGC ACGGGCGCAG CGTGCGCTTC TCGGTGTCGT TCCGCCCGAT CCTGGCGCAG ACCGAAGAGG CCGCATGGGC GCGCGCCGAG AACATCCTGG CCGAGACGAA GCGCCTGCGC GTGGTGCAGG GCTACAACCG CGGCGGGCCG CAGCAGAGCG AAGGTGCGAA GCGCCTGCTC GCCGCGGCCG AGAAGGGCAC GCGGCTCGAC AAGCGGCTGT GGACGGCGGT GGCGCAGGAG ATCGGCGGCC GCTCCAACAG CACGGCGCTG GTCGGCACGC CCGAGCAGGT GGCCGATGCG CTGCTCGACT ACTACGACCT GGGCGTGACC ACCTTCCTGA TCCGCGGCTT CGATCCGCTG GAAGACGCGG TCGACTATGG ACGCGAGCTG ATTCCGCGTG TGCGCGAGCT GGTGGCGCAG CGTGCGGCTT CGGCTCGCAA AGCGGCGTGA
|
Protein sequence | MSQNDVEFIG MIQGQKVSEI HAAKGPAIDR DYVRAFAQAH ENAGFDRVLV PHHSTGPDAT LTVAHAASVT ERIHFMLAHR PGFVAPTLAA RQFASLDQFS GGRLAVHYIS GGSDEEQRRD GDWLGHDQRY ARTDEYLDVL HKVWTADKPF DHEGAHYRFQ NAFSEVKPLQ TRNGKPHVPV YFGGASEAAI PVAGKYADVY ALWGESLDQA RELTARVRAE AAKHGRSVRF SVSFRPILAQ TEEAAWARAE NILAETKRLR VVQGYNRGGP QQSEGAKRLL AAAEKGTRLD KRLWTAVAQE IGGRSNSTAL VGTPEQVADA LLDYYDLGVT TFLIRGFDPL EDAVDYGREL IPRVRELVAQ RAASARKAA
|
| |