Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_3957 |
Symbol | |
ID | 7970386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 4200378 |
End bp | 4201712 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644794543 |
Product | tryptophan halogenase |
Protein accession | YP_002945837 |
Protein GI | 239816927 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCCCG GACCTTTCAC GAGTACAGAG CCCGACGTCG TGGTCATCGG CGGGGGCCCG GCGGGTTCCA CCGTGGCCGC GTTGCTGGCG GACAAGGGCC ACGACGTGGT GCTGTTCGAG AAGGCGCACC ATCCGCGCTT TCACATCGGC GAATCGCTGC TGCCGATGAA CATGCCGCTG TTCGACCGCC TGGGCGTGCG CACCGAGGTC GAGGCCATCG GCATTGCCAA GCACGGCGCC GAGTTCGTCT CGCCCTGGCA TGACCACACC AGCCACTTCT TCTTCGGCGA GGCCATGGAC AAGAGCTTTC CGTATGCGGT GCACGTGCGC CGCTCGGAGT TCGACGAGCT GCTGTTCCGC CATGCCGGCA AGCGCGGCGC GCGTACCTTC GAAGGCCAGC GCGTCATGGG CGTGGACATG GACGCCGGCA AGGGCGCGGA CAAGCGCGCG CTGGTGAAGA TCAAGGCCGA GGACGGCAGC GAAACCAGCT GGCGCCCGCG CTTCGTGATC GACGCGAGCG GCCGCGACAC GGTGCTGTCG AACCAGTTCG ACGCCAAGCA GCGCAATCGC AAGCACGCCA GCGCGGCCCT GTTCGGCCAC TTTGCCAATG CGGAGCGCCG GCCCGGCCGC TACGAAGGCA ACATCTCGCT CTTCTGGTTC GACCACGGCT GGTTCTGGTA CATCCCGCTG AAGGACGGCA CCGTGAGCGT GGGCGCCGTG GCGTCGCCCG CGTATTTCAA GCGGCGCAAG GGCACGCTCG AGGAGTTCCT GATGGAGACC ATCGCGCTCG CGCCCAAGCT GGCCGCGCGC CTGAAGAACG CCACGCTGAT GGAAGGCGCC ACCACCACCG GCAACTACGC CTACGACTCC AAGTTCTGCC GCGGCGACCG TTTCATGATG GTTGGCGACG CCTATGCCTT CGTCGACCCG ATGTTCTCGT CGGGCGTGTA CCTGGCAATG AACAGCGCCT TCGAGAGCGC TACTGCGGCC GACCTCTGGC TCAGGGGCGA GATGAAGGAA GCCGAGAAGG CCTTCCGCCG CTTCGACAAG GTGATGAAGC ACGGCCCCAA GATGTTCTCG TGGTTCATCT ACCGCATCAC CTCGCCGGCC ATCCGCCGGC TGTTCATGAA CCCGCGCAAC ATCTGGCGCA TGCAGGAGGC GCTGCTGTCC ATCCTGGCGG GCGACCTGTT CCGCAACACG CCCATCGGCC CGCGCTTCTG GGGCTTCAAG ATCACCTACT ACGCCTCGTG CATCGGCATC CTGCCGCAGG CCATCAAGAC CTGGGCATGG CGCCGGCGCA ACCTGAAGGA ATCGCTGGAA GCAGCGAAAA CTTGA
|
Protein sequence | MTPGPFTSTE PDVVVIGGGP AGSTVAALLA DKGHDVVLFE KAHHPRFHIG ESLLPMNMPL FDRLGVRTEV EAIGIAKHGA EFVSPWHDHT SHFFFGEAMD KSFPYAVHVR RSEFDELLFR HAGKRGARTF EGQRVMGVDM DAGKGADKRA LVKIKAEDGS ETSWRPRFVI DASGRDTVLS NQFDAKQRNR KHASAALFGH FANAERRPGR YEGNISLFWF DHGWFWYIPL KDGTVSVGAV ASPAYFKRRK GTLEEFLMET IALAPKLAAR LKNATLMEGA TTTGNYAYDS KFCRGDRFMM VGDAYAFVDP MFSSGVYLAM NSAFESATAA DLWLRGEMKE AEKAFRRFDK VMKHGPKMFS WFIYRITSPA IRRLFMNPRN IWRMQEALLS ILAGDLFRNT PIGPRFWGFK ITYYASCIGI LPQAIKTWAW RRRNLKESLE AAKT
|
| |