Gene Plav_2071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2071 
Symbol 
ID5456868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2256428 
End bp2257537 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content67% 
IMG OID640877648 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_001413342 
Protein GI154252518 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value4.39377e-16 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGCGACCA TTCCACCCGG CGGGACCATC GGCATTCTCG GCGGCGGCCA GCTTGGGCGG 
ATGCTGGCGA TGGCGGCGGC GCAGCTGGGG CTCGCCACGC ATATCTATTG CCCGGATGAG
GAATTGCCCG CGGCGGATGT GGCCGGCGAG GTCACGCGCG CGGCTTATGA CGACGAGGCG
GCGCTGGTGC GCTTTGCCGG GAGCGTCGAG GTCGTCACTT ATGAATTCGA GAATGTGCCG
GCGGAGACGG CGCGCATTTT GAGCGAGCGC GGCATCGTGC GGCCTGGGCC GCTTGCGCTG
GCGACGGCAC AGGACCGCGT GGTCGAAAAG AATTTCCTCG TGAGCCACGG CATCGCGACC
GCACCCTTCG CGGATGTGGC GGACGAAGCG GGCCTGCGCA GCGCCATGGA AGCGATCGGC
ACGCCGTCGA TCCTGAAGAC GCGGCGCTTC GGCTATGACG GCAAGGGACA GGCGAAGATC
GCATCGGCGG CGGACGCGCT TGCCGCCTAT GACGAGATCG GCCGCGCCCC CGCCATCCTC
GAAGGCTTCG TGCCCTTCGA ACGGGAAATC TCCGTGATCG TCGCACGCGG GCTCGATGGA
CGGACGGCGG CTTACGATCC CGTCGAGAAC ATCCACAAGA ACCACATTCT CGACCGCACG
CTGGCGCCCG CCGCGCTGAC CAGGGCGCTT TCCGACGAGG CCTGCGCAAT CGCGGCGCGC
ATCGTCTCCG AACTCGATTA TGTGGGCGTG ATGGGCGTCG AGCTGTTTCT GCTGCCGGAA
AGCGGAAGCA AGAGGCGGCT GCTCGTCAAC GAGATCGCGC CGCGCGTCCA CAATTCCGGC
CACTGGACGA TGGATGCCTG CGCAGTGAGC CAGTTCGAGC AGCATATTCG CGCGATCTGC
GGCTGGCCGC TTGGAAGCCC GGCGCGCCAC TCCGACGCGG TGATGACCAA TCTGATCGGC
GAAGAGGCGG CGGATTGGGC GCGGCTCGCG GCGACGCCGG ACACGGCCCT CCATCTCTAC
GGCAAGCGGG AAGCCCGGCC CGGCCGCAAG ATGGGCCATG CGACAAGGCT TTACCCGCTC
GGAACACGGC CGCCGGTTAC GCCTTCTTAA
 
Protein sequence
MATIPPGGTI GILGGGQLGR MLAMAAAQLG LATHIYCPDE ELPAADVAGE VTRAAYDDEA 
ALVRFAGSVE VVTYEFENVP AETARILSER GIVRPGPLAL ATAQDRVVEK NFLVSHGIAT
APFADVADEA GLRSAMEAIG TPSILKTRRF GYDGKGQAKI ASAADALAAY DEIGRAPAIL
EGFVPFEREI SVIVARGLDG RTAAYDPVEN IHKNHILDRT LAPAALTRAL SDEACAIAAR
IVSELDYVGV MGVELFLLPE SGSKRRLLVN EIAPRVHNSG HWTMDACAVS QFEQHIRAIC
GWPLGSPARH SDAVMTNLIG EEAADWARLA ATPDTALHLY GKREARPGRK MGHATRLYPL
GTRPPVTPS