Gene Plav_1761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1761 
Symbol 
ID5455622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp1913585 
End bp1914997 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content57% 
IMG OID640877342 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001413037 
Protein GI154252213 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID[TIGR03229] benzoate 1,2-dioxygenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.104126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTACAT CCGGCGTGAG CAACCCATTC GATCGCATTC GCCACGGTCT CGATTCATGG 
CTTGTCGAGG ATACGTCGAG CGGGCGCTTC ATGCTCGACC GCGCCATATT CACCGATCCC
GAGTTGTTCG CTCTTGAGAT GAAGCACATC TTCGAGCGCA ACTGGGTGTT TCTCGCCCAT
GAAAGCCAGC TCGCCAACGA AAGCGCCTTT CTGACCCTGA CAATCGGTCG TCAACCTGTG
CTTCTGACCC GGAACCGGTC GGGCGAATTG AAATGCTTTA TAAATGCATG CACGCATCGC
GGCGCGCGGC TGTGCCGGGA AAAACGGGGC AGGCGAGCGA CATTCACATG CCCGTTTCAC
GGCTGGACGT TCAGCAACGA TGGCGCGCTG CTGAATGTCA CAGACGAGGA AAATGGCGCA
TATCCGGAAC ATTTCGACCG GAGCGATCTG GGTCTCCGCG AAGTCGGACG CCTGGAATCC
TACAAGGGAT TCGTTTTCGC CAGCCTTTCA GAGGACGTGC CACCGCTGAG CGATTACCTT
GCGGGTACGA AGACCTTCAT CGATCTCATG GTGGATCAGT CGCCGGAAGG GAAGCTTGAA
ATCATCCGCG GGGTCGGGCG GTATACCTAT CGCGGCAACT GGAAGATGCA GACCGAGAAC
GGTCTCGACG GGTATCACGT GCCGACCGTG CATTCCAACT ACCTGTTGAC CGTCGCAAAC
CGAATGACGG GCGGTTCAAC GAACACCACC AAGAGCGCAA ACGTATCCGC GTGGATGAAG
ACGGATCAGG GCGGTTTTTT CTCATTCGAT CACGGACACG CTTTGATCTG GAACGCGTCG
GCCAGTTCTG CCTCCCGGCC CAATTTTGAG TTTGTCGACC AGTACAAGCG GAAATTCGGC
GAGGAGCGCT CCCGCTGGAT GACCGAAACC ACACGAAATC TTCTTTTGTT TCCGAATGTG
TTCCTGATGG ATCAGGTCAG CACACAAGTC CGTATCGTGC GTCCGATCTC GGTCGATGAA
ACGGAAATAA CGACATATTG CATCGCGCCG GTTGGAGAAA GTGCGCGGGC GCGGGCGATG
CGCATTCGAC AGTATGAGGA TTTCTTCAAT GCCAGCGGCA TGGCCACGCC CGACGATTTG
ACGGAGTTCA ACAATTGCCA GATCGCGTAT GGCCGCGATG GTGGTCGCTA TAACGATTTG
TCGCGCGGGG CAACGAGAAG CGTTCAAGGG TCGGGGAAGT TTGGCAAGGG GCTCAACATC
GACGCGACCA TGAGCAGCAT CAATGTCGCC GATGAAGGAC TTTTCGTCAC CATGCACAAG
GAGTGGATCG AGCGGATGAA ACACGCGATC GCGCAGGAGC GGGCGGAAGC GTCCCTCACT
CGCGAAAACG ATCGTTCGAG GGCCGCAGAA TGA
 
Protein sequence
MSTSGVSNPF DRIRHGLDSW LVEDTSSGRF MLDRAIFTDP ELFALEMKHI FERNWVFLAH 
ESQLANESAF LTLTIGRQPV LLTRNRSGEL KCFINACTHR GARLCREKRG RRATFTCPFH
GWTFSNDGAL LNVTDEENGA YPEHFDRSDL GLREVGRLES YKGFVFASLS EDVPPLSDYL
AGTKTFIDLM VDQSPEGKLE IIRGVGRYTY RGNWKMQTEN GLDGYHVPTV HSNYLLTVAN
RMTGGSTNTT KSANVSAWMK TDQGGFFSFD HGHALIWNAS ASSASRPNFE FVDQYKRKFG
EERSRWMTET TRNLLLFPNV FLMDQVSTQV RIVRPISVDE TEITTYCIAP VGESARARAM
RIRQYEDFFN ASGMATPDDL TEFNNCQIAY GRDGGRYNDL SRGATRSVQG SGKFGKGLNI
DATMSSINVA DEGLFVTMHK EWIERMKHAI AQERAEASLT RENDRSRAAE