Gene Plav_1096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1096 
Symbol 
ID5456690 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp1201654 
End bp1203048 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content65% 
IMG OID640876666 
Productphospho-2-dehydro-3-deoxyheptonate aldolase 
Protein accessionYP_001412374 
Protein GI154251550 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3200] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase 
TIGRFAM ID[TIGR01358] 3-deoxy-7-phosphoheptulonate synthase, class II 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.966899 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGATA CCTGGAAACC GGAGAGCTGG CGCGCCAAAC CCGCGAAGCA TCTCCCGAGC 
TACCCGGATG AAGCGGCTCT GGCCGCCGTG GAAGCGCGCC TGCGCTCCTA TCCGCCGCTC
GTTTTTGCAG GTGAGGCCCG CAAGCTGAAG GCCGACCTCG CCGAGGTCTG CGAGGGCCGT
GCATTCCTCC TCCAGGGCGG TGATTGCGCC GAAAGCTTCG CCGAATTCTC CGCCGACAAT
ATCCGCGACA CCTTCCGCGT GCTGCTGCAG ATGGCGGTTG TGCTCACCTT CGCCGCCGCC
TCGCCGGTCG TGAAGGTCGG CCGCATCGCC GGCCAGTTCG CGAAGCCGCG TTCGTCCGCC
ACCGAAACCA TCGGCGACGT GACGCTGCCG TCCTATCTCG GCGACAACAT CAACGGTATC
GAGTTCGACG AGAAATCGCG CGTGCCCGAT CCCGAAAGGC TGCTCCGCGC CTATTCGCAA
TCGGCCTCGA CGCTGAATCT CATTCGCGCC TTCGCGAATG GCGGCTATGC CGATCTCGAT
TTCGTGCATC GCTGGAATCT GGGCTTCGTC TCCGACAGCG CCGAAGGTGC GCGCTACGAG
GAACTGGCGA ACCGCATCAC GGAAGCGCTC GATTTCATGC GCGCCTGTGG CATCGACAGC
GCCACGCAGC CCCAGCTTCA CACGACGGAT TTCTACACCA GCCACGAGGC CCTGCTGCTC
GGCTACGAAC AGGCGATGAC GCGCATCGAC AGCACGACGG GCGATTGGTA CGACACCTCC
GCCCACATGC TCTGGATCGG CGACCGCACG CGCCAGCCCG ATCACGGCCA TGTCGAATAC
ATGCGTGGCA TCAAGAACCC GATCGGCATG AAATGCGGCC CCTCCCTCGA TCCCGAGGAG
CTTGTGCGTC TCACCGACAT TCTCAATCCG AAGAACGAGC CGGGCCGCTT GACGCTCATC
TGCCGCTTCG GCGCCGAGAA TGTCGAGAAG CACCTGCCGC AGCTCATCCG CGCCATCGAG
CGCGAGGGCA AGAAGGTGGT CTGGTCCTGC GACCCGATGC ACGGCAACAC CATCAAGGCG
TCGTCCGGCT ACAAGACGCG GCCCGTGGAC CGCATCCTCG CCGAAGTACA GGCCTTCATG
GCCGTGCACC GCGCCGAAGG CACCCATGCC GGCGGCGTCC ATTTCGAAAT GACCGGCCAG
AACGTCACCG AATGCATCGG CGGCGCGCAG GCGATTTCGG AAACGCAACT CGGCGACCGC
TACCACACGC ATTGCGACCC GCGCCTCAAC GCCAGCCAGT CGCTGGAACT CGCCTTCCTC
ATCGCGGAAG GCCTGAAGAA GGAGCGCCTG GAAGCCCTGC GCGCCGAACC CGTCGCCGCC
CTCGGCGCCT GGTAA
 
Protein sequence
MADTWKPESW RAKPAKHLPS YPDEAALAAV EARLRSYPPL VFAGEARKLK ADLAEVCEGR 
AFLLQGGDCA ESFAEFSADN IRDTFRVLLQ MAVVLTFAAA SPVVKVGRIA GQFAKPRSSA
TETIGDVTLP SYLGDNINGI EFDEKSRVPD PERLLRAYSQ SASTLNLIRA FANGGYADLD
FVHRWNLGFV SDSAEGARYE ELANRITEAL DFMRACGIDS ATQPQLHTTD FYTSHEALLL
GYEQAMTRID STTGDWYDTS AHMLWIGDRT RQPDHGHVEY MRGIKNPIGM KCGPSLDPEE
LVRLTDILNP KNEPGRLTLI CRFGAENVEK HLPQLIRAIE REGKKVVWSC DPMHGNTIKA
SSGYKTRPVD RILAEVQAFM AVHRAEGTHA GGVHFEMTGQ NVTECIGGAQ AISETQLGDR
YHTHCDPRLN ASQSLELAFL IAEGLKKERL EALRAEPVAA LGAW