Gene Plav_2621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2621 
Symbol 
ID5454260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2827161 
End bp2828699 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content62% 
IMG OID640878198 
Productsulfatase 
Protein accessionYP_001413886 
Protein GI154253062 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0161574 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.118013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGGA AAATTCTTTT CATCACCACC GATCAGATGC GCTTCGATGC CATCGGCGCG 
AATGGTCAGA AGGTCGCGCG CACACCCGCC ATCGACGCGC TGGCAAAAGC CGGCATCAAC
TACACCCGCG CGCATAATCA GAACGTCGTC TGCATGCCCG CCCGCTCCAC CATGATCACC
GGGCAATATG TGTCGACGCA TGGCGTCTGG ATGAACGGCG TGCCGCTTCC CGTCGATGCG
CCCTCCGTCG CGCAATATCT CAACGAAAAA GGCGGCTACA AGACGGCGCT GATCGGCAAG
GCGCATTTCG AGCCCTTCCT CGATCTCCAT CAGCAATTCT ACGAAAGCCA GATGGCGCGG
CGAGGCGAAA ACGGTCCGCA TCGCGGCTTC GACTACATGG AGCTCGCCAC GCATTCGCCG
CTCATCCTTC ACTACAATGA ATGGATGAAG AAGAACGAAC CCGAGGCGCT CAATTATTTC
TACCAGAACC TCAACGACAA GTTTCAGGTG AACGCTGCCG GCGGCGGCGA GACCGGCGGC
TGCCAGCTCC ATTTCAACAA GATCGCGCGC GAGCACTACC ACACCGACTG GGTCGCCGAC
CGCACCATCG ACTGGCTCGC CTCCGTCGGC GCAGGCGACG ACTGGTTCTG CTGGATGAGC
TTCCCCGATC CGCACCACCC GTGGGACCCG CCGCAATCCG AACTTCACCG TCATCCCTGG
CGCGATACGC CGCTGCCGGA ATTCTATCCG GGCTCGAAGG AAAAGATCGA AGCCGTCCTC
GCGGACAAGC CGCGCCACTG GATGGAATGG TACACCGGCG AGCGCGTGAC GAACTTCGAA
GCCCCGCCCG AATTCCGCGC GCAGGACATG ACCGCCGATC AGGTGCAGGA GATCAACGCC
TTCACCCATG TCGAAAACGA ATTGATCGAC GAAGCCATCG CGAAAGTCAT GGCCTATGTC
GAAAAGCGCG GCTGGGGCGA TGATGTCGAT GTCGTCTTCA CCACCGACCA CGGCGAATTC
CAGGGCGAAT TCGGCCTGCT CTTCAAGGGC CCCTATCACG TCGATGCGCT GATGCGCCTC
CCCATGATCT GGCGCCCCGC GAAATCCGCG AAGGTCGCGC CCGCCGCCGT CGAAAAACCC
GTCGGCCAGG TCGACCTCGC GCCCACCTTC TGCGAAATCG CCGGCCTCCC CGTGCCCGAA
TGGATGCAGG GAAAGCCGAT GCCGAAAACC GATGCCGAAG GCGACGCCCA GGGCCGCGAG
CGCGTCTTCA CCGAATGGGA CTGCAAACAT GTCGACGGCA CCACCGTCGG CCTCCGCACC
ATCTATCGCG ACGGCTACAC CATCACCGCC TATCTCCCCG GCACCATCTA CGACGGCAGC
GAAGGCGAGC TTTACGACCA CGCCAACGAT CCGCGGCAGT TCCGCAACCT CTGGAACGAC
CCGGCCTACG CCAAGCTGAA ATCCGATCTT CTCGCCGATC TGAAAGACAA CCTCCCCCCC
GTCCGCGACC CCCAGCTCGA ATACGTCGCC CCTGTTTAA
 
Protein sequence
MGRKILFITT DQMRFDAIGA NGQKVARTPA IDALAKAGIN YTRAHNQNVV CMPARSTMIT 
GQYVSTHGVW MNGVPLPVDA PSVAQYLNEK GGYKTALIGK AHFEPFLDLH QQFYESQMAR
RGENGPHRGF DYMELATHSP LILHYNEWMK KNEPEALNYF YQNLNDKFQV NAAGGGETGG
CQLHFNKIAR EHYHTDWVAD RTIDWLASVG AGDDWFCWMS FPDPHHPWDP PQSELHRHPW
RDTPLPEFYP GSKEKIEAVL ADKPRHWMEW YTGERVTNFE APPEFRAQDM TADQVQEINA
FTHVENELID EAIAKVMAYV EKRGWGDDVD VVFTTDHGEF QGEFGLLFKG PYHVDALMRL
PMIWRPAKSA KVAPAAVEKP VGQVDLAPTF CEIAGLPVPE WMQGKPMPKT DAEGDAQGRE
RVFTEWDCKH VDGTTVGLRT IYRDGYTITA YLPGTIYDGS EGELYDHAND PRQFRNLWND
PAYAKLKSDL LADLKDNLPP VRDPQLEYVA PV