Gene Plav_0159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0159 
Symbol 
ID5456503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp173954 
End bp174976 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content58% 
IMG OID640875720 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001411439 
Protein GI154250615 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATA CCCTTGTCAA ATCCAGCCCG GAAACACTGC CCGCCGATTG GTACTGGAGC 
GAGGACGCTT GGCAAATAGA ACGCCGTGAA ATCTGGGCAA AGCACTGGTT CCTGATAGCC
CGTACGCAAC AGTTCGATGT AGCGGGCGAT TATGTTTCAG CGACGTTTGC GGGCTACCCG
ATTTTCGCGA TCAAAGGCAA GGACGGCTTG GTGCGCGCCT TTCACAATGC CTGCCGCCAC
CGCGCTTCCC CCTTGGTGCA AGAATGCGCG GGACATGTGG AGCGACTCAG TTGCCCCTAT
CACCGCTGGC TTTACGATTT CGAGGGCCGG CTGCGTGGCG CCCCCACCAT GGAGATCGAC
AAGGAGGAGT ACGGACTCTT TCCCATCCGC TGCGAGATAT GGCGCGGACT GGTCTTCGTG
TCGCTCGATC TTGCGGCGAT GCCGCTCTCC GGCTGGCTGG CAGATATCGC CACCGCGGCG
CTACGCTATC CCCTGGAGGA GATGCAGCTC GCTCGCGAGT TCACGATCGA GGCGGATGTG
AACTGGAAAA CCTATGGCGA CAACTATGCC GAGGCATGGC ACATACCTAC GATCCATCCG
GGGCTCAACG CAGCGATTGA CATGGCGAGC TATAAAATCG CGACGGTAGG ACATACGCTG
CAGAGCCACA TGGCGGACGC GCGTGACGGC GGCAAGACGG ATGGCTTCTG GGTATGGCGG
CTGCCGGGGC TCTTTTTCAA TATGTACAAC TGGGGAATGA ACGTCGCGCA GCTCGAACCT
TTGGGGCCTC GCCGCATGAA GCTGACCTAT CGCTACTTCG TGAAGGACCT CGATCCGACG
AAGCAAGCAG AGCGAGATGC ACTTATCGAC TGGGCCTACA TGGTGGCAAA GGAAGACATC
GATATCTGCA TTGCCGTACA AAGAAATCTC GAAACCGGCA TCTACGAACG GGGACGGCTT
TCCTCCGTTC ACGAGAATGG TGTCATCCAG TTTCAGGAAA TGGTGGCCTC GGCGCATCGT
TGA
 
Protein sequence
MSDTLVKSSP ETLPADWYWS EDAWQIERRE IWAKHWFLIA RTQQFDVAGD YVSATFAGYP 
IFAIKGKDGL VRAFHNACRH RASPLVQECA GHVERLSCPY HRWLYDFEGR LRGAPTMEID
KEEYGLFPIR CEIWRGLVFV SLDLAAMPLS GWLADIATAA LRYPLEEMQL AREFTIEADV
NWKTYGDNYA EAWHIPTIHP GLNAAIDMAS YKIATVGHTL QSHMADARDG GKTDGFWVWR
LPGLFFNMYN WGMNVAQLEP LGPRRMKLTY RYFVKDLDPT KQAERDALID WAYMVAKEDI
DICIAVQRNL ETGIYERGRL SSVHENGVIQ FQEMVASAHR