Gene Plav_1915 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_1915 
Symbol 
ID5455078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2084720 
End bp2085787 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content60% 
IMG OID640877492 
Productpolysaccharide pyruvyl transferase 
Protein accessionYP_001413187 
Protein GI154252363 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5039] Exopolysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACC TTTCCGCACC GCGCCATGCA GGCATGACGG CTCTTGCCGC GCGGCATGAC 
GAGATTGCCG GACTTGTCCC CGGCGACAGG CGCATCGCCT ATCTTGACTA CCCGCTCGGC
AAGAATGTCG GCGACCTGCT CATCATGCTG GGCACGCTGA AATTCTTCGA GCGGCATGGG
CTGAAAGTGC GGTTGTCGCG GACGCTGAAG AACACGCCGC CGCGCGGCCG CCTGCCGATC
GGTGAGGGGG ATACGATCGT CCTGCAGGGC GGCGGCAATT TCGGCGATCT CTACCCGCAT
ATCCAGAATT ACCGCGAGCG CATCATCGAG GAATATCCAG ACCACAAGGT TCTGATTTTT
CCGCAAACGA TCTTCTTCAA GGACAAGGAG AAGCTCAAGC GGTCCGCCGA GAAAATGATG
AGGCATCCCG ATCTCACATT TTTTGTCCGC GACAGGGCAA GCGAGGCGCT GGCGCGGCCG
CTGTTCGGCG AAAAGGTCAG GCTGGTGCCC GACATGGCGC ATCAGCTCTG GCCGGATCTC
CATCAGCGGA TCGGGAGTCG AGGAGAACAG GCGTCAAATC CGCTGTTCTT CATTCGAAAG
GACGAAGAAG CGGGGGATTC CTTCGCGGCG ATCGAGGCGC ACAAGGCGCA ATTTCTCGAC
TGGGAGGATG TGAACTACAC ATCCTTGCGC GTCTGGAAGC GCGTGTTTGA CGAACTCGCG
CAGCTGGAGC TGCGGCTGGG CTTCTCCTTC GGACTGGAAG CCGCCTACTT CAATATCATC
CGCGGCGAGG TGGATAAAGT TGCCCGCCGG ATGGCTCGGC ATGATGTGTG GATTACGTCG
CGCCTGCACG GCTTTATCCT CGGACTGCTG CTCGGCAAGC CGGTCTTCGC CATCGACAAC
AGCTACGGCA AGCTTTCGTC CTATGTGGAG ACGTGGCGGG AAGATATTCG CGACATCAAG
CTTCTCTCCG GCGAAACCGA TGCTGAAGAG GCAATCGCCT TCCTCAACAA GGCGCGCGGG
CTCGACCGCC ATGCTCTCTG GCAGGCCTAT AGCGACACTT GCCGGTAA
 
Protein sequence
MSDLSAPRHA GMTALAARHD EIAGLVPGDR RIAYLDYPLG KNVGDLLIML GTLKFFERHG 
LKVRLSRTLK NTPPRGRLPI GEGDTIVLQG GGNFGDLYPH IQNYRERIIE EYPDHKVLIF
PQTIFFKDKE KLKRSAEKMM RHPDLTFFVR DRASEALARP LFGEKVRLVP DMAHQLWPDL
HQRIGSRGEQ ASNPLFFIRK DEEAGDSFAA IEAHKAQFLD WEDVNYTSLR VWKRVFDELA
QLELRLGFSF GLEAAYFNII RGEVDKVARR MARHDVWITS RLHGFILGLL LGKPVFAIDN
SYGKLSSYVE TWREDIRDIK LLSGETDAEE AIAFLNKARG LDRHALWQAY SDTCR