Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Vapar_2969 |
Symbol | |
ID | 7972245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Variovorax paradoxus S110 |
Kingdom | Bacteria |
Replicon accession | NC_012791 |
Strand | - |
Start bp | 3127547 |
End bp | 3128914 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644793554 |
Product | tryptophan halogenase |
Protein accession | YP_002944855 |
Protein GI | 239815945 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.339747 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATTGC CTCAAGCCAC TTCCTCCATT TCTTCCGCGC CCCCGGCGGC CGACGAGTCC TGCGACGTGC TCGTGATCGG TGGCGGCCCC GCGGGCTCCA CGATCTCGGC GCTGCTCGCA CAGCAGGGCC GCAAGGTGGT GCTGCTCGAG AAGGAGCACC ATCCGCGCTT TCACATTGGC GAATCGCTGC TGCCCGCGAA CGTGGAACTG TTCGACAGGC TCGGCGTGCG CGACCAAGTC GAGAAGATCG GCATGCCCAA GTTCGGCATC GAGTTCGTCT CGCCCGAGCA TGAGCACCGC AGCTACGTGG ACTTTGCCGA AGGCTGGGAC AAGTCGCTCG ATTCGGCCTG GCAGGTGCGC CGCTCGGAGC TCGACGAGCT GCTGTTCCGC AATGCCGCCG CGCGCGGCGC GCAGGCACTC GAAGGCTGCA AGGTGCGCGA CGTGGCCTTC GATGCAAATG GCGCCACGGT TCAGGCCCAG ATGGACGACG GCGCCAGGCG CAGCTGGCGC GCGCGCTTCG TGGTCGATGC CACGGGCCGC GACACGCTGC TGGCCAACAA ATTCCGCTGC AAGGAAAAGA ACCCCGACCA CAACAGCACG GCGCTGTTCG GCCATTTCAC CAACGCCGAG CGGCTGGAGG GCAAGAAGGA AGGCAACATC AGCATCTGCT GGTTTGCGCA CGGCTGGTTC TGGTTCATTC CGCTGGCCGA CGGAACCACC AGCGTCGGTG CGGTCTGCTG GCCGTACTAC CTGAAGACAC GCGACAAGCC GCTGAAAGAT TTCTTCTACG ACACCATCGC GCTGTGCCCG GTGCTGGTGG ACCGGCTGAA GCACGCCACG CTGGTGGACG ATGCGGTGCA TGCCACCGGC AACTTCTCGT ATTCGAGCAC GCACGCCACG GGCGACCGCT ACCTGATGCT GGGCGATGCC TTCACCTTCA TCGACCCGAT GTTCTCGTCG GGCGTGTACC TGGCGATGCA CAGCGCCTTC GACGGCGCCG GGCTGGTGGC CACCGCGCTC GACCGGCCGG CCGAACTGGC GCCCGTGCGC GAGAACTTCG AGGCCATGAT GCGCAAGGGG CCGCGTGAGT ATTCGTGGTT CATCTACCGC GTGACCAACC CGACCATCCG CGACATGTTC ATGCACCCGG GCAATCCCTT CCGCGTGAAG GAGGGGCTGA TGTCGCTGCT GGCCGGCGAC ATCTACCGCG GCACGCCGAT CTGGCGGGCG CTCGGCATGT TCAAGTTTCT CTACTACTTC ATCTCGATCA CCCACCTGCG CCGCACCTGG GCCGGCTGGA AGCGGCACCG CTTCAACATC CGAGACATGG GCGCACTGAA GGGCGAGACG ATCCTCAAGT CGGAGTAG
|
Protein sequence | MSLPQATSSI SSAPPAADES CDVLVIGGGP AGSTISALLA QQGRKVVLLE KEHHPRFHIG ESLLPANVEL FDRLGVRDQV EKIGMPKFGI EFVSPEHEHR SYVDFAEGWD KSLDSAWQVR RSELDELLFR NAAARGAQAL EGCKVRDVAF DANGATVQAQ MDDGARRSWR ARFVVDATGR DTLLANKFRC KEKNPDHNST ALFGHFTNAE RLEGKKEGNI SICWFAHGWF WFIPLADGTT SVGAVCWPYY LKTRDKPLKD FFYDTIALCP VLVDRLKHAT LVDDAVHATG NFSYSSTHAT GDRYLMLGDA FTFIDPMFSS GVYLAMHSAF DGAGLVATAL DRPAELAPVR ENFEAMMRKG PREYSWFIYR VTNPTIRDMF MHPGNPFRVK EGLMSLLAGD IYRGTPIWRA LGMFKFLYYF ISITHLRRTW AGWKRHRFNI RDMGALKGET ILKSE
|
| |