Gene Vapar_2969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVapar_2969 
Symbol 
ID7972245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVariovorax paradoxus S110 
KingdomBacteria 
Replicon accessionNC_012791 
Strand
Start bp3127547 
End bp3128914 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content65% 
IMG OID644793554 
Producttryptophan halogenase 
Protein accessionYP_002944855 
Protein GI239815945 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.339747 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTGC CTCAAGCCAC TTCCTCCATT TCTTCCGCGC CCCCGGCGGC CGACGAGTCC 
TGCGACGTGC TCGTGATCGG TGGCGGCCCC GCGGGCTCCA CGATCTCGGC GCTGCTCGCA
CAGCAGGGCC GCAAGGTGGT GCTGCTCGAG AAGGAGCACC ATCCGCGCTT TCACATTGGC
GAATCGCTGC TGCCCGCGAA CGTGGAACTG TTCGACAGGC TCGGCGTGCG CGACCAAGTC
GAGAAGATCG GCATGCCCAA GTTCGGCATC GAGTTCGTCT CGCCCGAGCA TGAGCACCGC
AGCTACGTGG ACTTTGCCGA AGGCTGGGAC AAGTCGCTCG ATTCGGCCTG GCAGGTGCGC
CGCTCGGAGC TCGACGAGCT GCTGTTCCGC AATGCCGCCG CGCGCGGCGC GCAGGCACTC
GAAGGCTGCA AGGTGCGCGA CGTGGCCTTC GATGCAAATG GCGCCACGGT TCAGGCCCAG
ATGGACGACG GCGCCAGGCG CAGCTGGCGC GCGCGCTTCG TGGTCGATGC CACGGGCCGC
GACACGCTGC TGGCCAACAA ATTCCGCTGC AAGGAAAAGA ACCCCGACCA CAACAGCACG
GCGCTGTTCG GCCATTTCAC CAACGCCGAG CGGCTGGAGG GCAAGAAGGA AGGCAACATC
AGCATCTGCT GGTTTGCGCA CGGCTGGTTC TGGTTCATTC CGCTGGCCGA CGGAACCACC
AGCGTCGGTG CGGTCTGCTG GCCGTACTAC CTGAAGACAC GCGACAAGCC GCTGAAAGAT
TTCTTCTACG ACACCATCGC GCTGTGCCCG GTGCTGGTGG ACCGGCTGAA GCACGCCACG
CTGGTGGACG ATGCGGTGCA TGCCACCGGC AACTTCTCGT ATTCGAGCAC GCACGCCACG
GGCGACCGCT ACCTGATGCT GGGCGATGCC TTCACCTTCA TCGACCCGAT GTTCTCGTCG
GGCGTGTACC TGGCGATGCA CAGCGCCTTC GACGGCGCCG GGCTGGTGGC CACCGCGCTC
GACCGGCCGG CCGAACTGGC GCCCGTGCGC GAGAACTTCG AGGCCATGAT GCGCAAGGGG
CCGCGTGAGT ATTCGTGGTT CATCTACCGC GTGACCAACC CGACCATCCG CGACATGTTC
ATGCACCCGG GCAATCCCTT CCGCGTGAAG GAGGGGCTGA TGTCGCTGCT GGCCGGCGAC
ATCTACCGCG GCACGCCGAT CTGGCGGGCG CTCGGCATGT TCAAGTTTCT CTACTACTTC
ATCTCGATCA CCCACCTGCG CCGCACCTGG GCCGGCTGGA AGCGGCACCG CTTCAACATC
CGAGACATGG GCGCACTGAA GGGCGAGACG ATCCTCAAGT CGGAGTAG
 
Protein sequence
MSLPQATSSI SSAPPAADES CDVLVIGGGP AGSTISALLA QQGRKVVLLE KEHHPRFHIG 
ESLLPANVEL FDRLGVRDQV EKIGMPKFGI EFVSPEHEHR SYVDFAEGWD KSLDSAWQVR
RSELDELLFR NAAARGAQAL EGCKVRDVAF DANGATVQAQ MDDGARRSWR ARFVVDATGR
DTLLANKFRC KEKNPDHNST ALFGHFTNAE RLEGKKEGNI SICWFAHGWF WFIPLADGTT
SVGAVCWPYY LKTRDKPLKD FFYDTIALCP VLVDRLKHAT LVDDAVHATG NFSYSSTHAT
GDRYLMLGDA FTFIDPMFSS GVYLAMHSAF DGAGLVATAL DRPAELAPVR ENFEAMMRKG
PREYSWFIYR VTNPTIRDMF MHPGNPFRVK EGLMSLLAGD IYRGTPIWRA LGMFKFLYYF
ISITHLRRTW AGWKRHRFNI RDMGALKGET ILKSE