Gene Plav_3021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_3021 
Symbol 
ID5456080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp3222268 
End bp3223458 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content63% 
IMG OID640878609 
Productputative deoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_001414285 
Protein GI154253461 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0844637 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGATAG CCAATCCCGC CGCGACCGCG CCCTATGCCA CGCGAGCCGA GGAGACGCGA 
GGACGGCTCT TTCAGGAGCC GGAAAGCGCC ACCCGCACGG CCTTTCAGCG GGACCGCGAC
AGGATCATTC ATTCCGGCGC CTTCCGGCGC CTCAAATACA AGACGCAGGT TTTCGTCTAT
CACGAAGGCG ACAATTACCG GACCCGCCTG TCTCACTCTC TCGAAGTGTC GCAAATCGCG
CGGTCCGTCG CACGTGTGTT TGGTCTCGAT GAAGACCTCT CGGAGACGCT GGCGCTGGCG
CACGATCTGG GCCACACGCC CTTCGGCCAT GCCGGCGAAA CCGCGCTGGA CAGTTGCATG
CGCGACTTCG GCGGCTTCGA TCACAACGCC CAGACGCTTC GCATCGTCAC CAAGCTCGAA
CATCGCTATG CGCGTTTCGA TGGTCTCAAT CTCACCTGGG AAACGCTGGA AGGGCTCGTG
AAGCACAATG GGCCGGTGGT GACGCCGGGC CGCAGCATCG CGGATTTGCC ACGCGCCATT
GCCGAATATG CGGAGACGCA GGATCTCGAA CTGGCCACCT ATGCCGGCCC GGAAGCACAG
GTCGCTGCGC TGGCCGACGA CATTGCCTAC AACAACCACG ACATCGATGA CGGGCTTCGT
GCCGGCCTTT TCGACATCGA GGACCTGATG GCTCTGCCGC TCGTTGGCGA TGTGTTTCAG
CGCGTGATGG ATCGCTATCC AGGCCTCGAA ACCACGCGTG TGATCCATGA GGCAGTGCGC
GAGCTTATAG GCACGATGAT CGAGGACCTT CTCAGCGAGA CCAGAAGCCG CCTTGCCGAG
GCCCGGCCCC GATCGGCGGC GGATGTCCGC GCGATGAGCC GGCCGCTGGT CGGCTTCACG
GCGGAAATGA CGGAGCACAA TGCGGCCCTC AAGGCGTTCC TGTTCGAGCG CATGTACCGG
CACTACCGGG TCAACCGTTC CATGAGCAAG GCGCAGCGGA TCGTCCGCGA CCTGTTCTCC
TTGCTCCATG GAGAGCCGGA TCAGTTGGCG CCGGAATGGC AGGCAGGCTG CGACGGGCCC
GGCGGCATCA AGACGGCCCG GCGGGTCTGC GATTTCATCG CCGGAATGAC CGACAAATTC
GCCATTGAGG AGCATGCACG GCTCTTCGAC CTCCACGACC CCCGCGCTTG A
 
Protein sequence
MPIANPAATA PYATRAEETR GRLFQEPESA TRTAFQRDRD RIIHSGAFRR LKYKTQVFVY 
HEGDNYRTRL SHSLEVSQIA RSVARVFGLD EDLSETLALA HDLGHTPFGH AGETALDSCM
RDFGGFDHNA QTLRIVTKLE HRYARFDGLN LTWETLEGLV KHNGPVVTPG RSIADLPRAI
AEYAETQDLE LATYAGPEAQ VAALADDIAY NNHDIDDGLR AGLFDIEDLM ALPLVGDVFQ
RVMDRYPGLE TTRVIHEAVR ELIGTMIEDL LSETRSRLAE ARPRSAADVR AMSRPLVGFT
AEMTEHNAAL KAFLFERMYR HYRVNRSMSK AQRIVRDLFS LLHGEPDQLA PEWQAGCDGP
GGIKTARRVC DFIAGMTDKF AIEEHARLFD LHDPRA