Gene Vapar_3957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_3957 
Symbol 
ID7970386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp4200378 
End bp4201712 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content66% 
IMG OID644794543 
Producttryptophan halogenase 
Protein accessionYP_002945837 
Protein GI239816927 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCCCG GACCTTTCAC GAGTACAGAG CCCGACGTCG TGGTCATCGG CGGGGGCCCG 
GCGGGTTCCA CCGTGGCCGC GTTGCTGGCG GACAAGGGCC ACGACGTGGT GCTGTTCGAG
AAGGCGCACC ATCCGCGCTT TCACATCGGC GAATCGCTGC TGCCGATGAA CATGCCGCTG
TTCGACCGCC TGGGCGTGCG CACCGAGGTC GAGGCCATCG GCATTGCCAA GCACGGCGCC
GAGTTCGTCT CGCCCTGGCA TGACCACACC AGCCACTTCT TCTTCGGCGA GGCCATGGAC
AAGAGCTTTC CGTATGCGGT GCACGTGCGC CGCTCGGAGT TCGACGAGCT GCTGTTCCGC
CATGCCGGCA AGCGCGGCGC GCGTACCTTC GAAGGCCAGC GCGTCATGGG CGTGGACATG
GACGCCGGCA AGGGCGCGGA CAAGCGCGCG CTGGTGAAGA TCAAGGCCGA GGACGGCAGC
GAAACCAGCT GGCGCCCGCG CTTCGTGATC GACGCGAGCG GCCGCGACAC GGTGCTGTCG
AACCAGTTCG ACGCCAAGCA GCGCAATCGC AAGCACGCCA GCGCGGCCCT GTTCGGCCAC
TTTGCCAATG CGGAGCGCCG GCCCGGCCGC TACGAAGGCA ACATCTCGCT CTTCTGGTTC
GACCACGGCT GGTTCTGGTA CATCCCGCTG AAGGACGGCA CCGTGAGCGT GGGCGCCGTG
GCGTCGCCCG CGTATTTCAA GCGGCGCAAG GGCACGCTCG AGGAGTTCCT GATGGAGACC
ATCGCGCTCG CGCCCAAGCT GGCCGCGCGC CTGAAGAACG CCACGCTGAT GGAAGGCGCC
ACCACCACCG GCAACTACGC CTACGACTCC AAGTTCTGCC GCGGCGACCG TTTCATGATG
GTTGGCGACG CCTATGCCTT CGTCGACCCG ATGTTCTCGT CGGGCGTGTA CCTGGCAATG
AACAGCGCCT TCGAGAGCGC TACTGCGGCC GACCTCTGGC TCAGGGGCGA GATGAAGGAA
GCCGAGAAGG CCTTCCGCCG CTTCGACAAG GTGATGAAGC ACGGCCCCAA GATGTTCTCG
TGGTTCATCT ACCGCATCAC CTCGCCGGCC ATCCGCCGGC TGTTCATGAA CCCGCGCAAC
ATCTGGCGCA TGCAGGAGGC GCTGCTGTCC ATCCTGGCGG GCGACCTGTT CCGCAACACG
CCCATCGGCC CGCGCTTCTG GGGCTTCAAG ATCACCTACT ACGCCTCGTG CATCGGCATC
CTGCCGCAGG CCATCAAGAC CTGGGCATGG CGCCGGCGCA ACCTGAAGGA ATCGCTGGAA
GCAGCGAAAA CTTGA
 
Protein sequence
MTPGPFTSTE PDVVVIGGGP AGSTVAALLA DKGHDVVLFE KAHHPRFHIG ESLLPMNMPL 
FDRLGVRTEV EAIGIAKHGA EFVSPWHDHT SHFFFGEAMD KSFPYAVHVR RSEFDELLFR
HAGKRGARTF EGQRVMGVDM DAGKGADKRA LVKIKAEDGS ETSWRPRFVI DASGRDTVLS
NQFDAKQRNR KHASAALFGH FANAERRPGR YEGNISLFWF DHGWFWYIPL KDGTVSVGAV
ASPAYFKRRK GTLEEFLMET IALAPKLAAR LKNATLMEGA TTTGNYAYDS KFCRGDRFMM
VGDAYAFVDP MFSSGVYLAM NSAFESATAA DLWLRGEMKE AEKAFRRFDK VMKHGPKMFS
WFIYRITSPA IRRLFMNPRN IWRMQEALLS ILAGDLFRNT PIGPRFWGFK ITYYASCIGI
LPQAIKTWAW RRRNLKESLE AAKT