Gene Plav_2519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2519 
Symbol 
ID5455757 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2721016 
End bp2722539 
Gene Length1524 bp 
Protein Length507 aa 
Translation table11 
GC content64% 
IMG OID640878096 
Productsulfatase 
Protein accessionYP_001413785 
Protein GI154252961 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.602817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.0521471 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACC AGCTCGACAT TCTTTTCATC ACCGCCGACC AGTGGCGCGG CGAGTGCCTC 
TCCGCCCTCG GCCACCCGAT GGTAAAGACC CCGAACCTCG ACGCGCTGGC GGCGGACGGC
GTTCTCTTCA AGCGGCACTA TGCGAACGCC GTTCCCTGCG GCCCGTCGCG CGCCTCGCTT
CACACCGGCA TGTATCTCCA GAACCACCGC TCCGGCACCA ACGGCACACC GCTCGACGCG
CGCCATACCA ACTGGGCGAA GGAGGCGGCG CGCATCGGCT ACGACCCCGT GCTCTTCGGC
TATACGGATA CGAGCCAGGA TCCGCGCGAG GAAGACCCGG AAAGCCCGTG GCTTAGGACT
TATGAGGGCC CGCTCCCCGG CATCCGCCCC GTCTGCATGA TGGGCACATG GCCGACGCCC
TGGACAAACT GGCTGAAGGA AGAGGGCTAC GAAGTGCCGG AGGACATCCG CTTCGCCTAT
GGCACGCGGA CACCGGGCGA CGATTATGAG GACGGCGCGC CCGTGCCGCG CCCGCTCATC
TATCCTCAGG AGGCGGACGA TACGAGCTTC CTCACCAACC GGCTGATGGA CTACATCGCG
GAAACGAAGG GGCGCTTCGT CGCGCATCTC TCGCTGCTGC GCCCTCACCC GCCCTTCGTC
GCCTCGGAAC CCTGGAACGC GATGTACGAC CCGGAAGCAG TGCCGGGCTT CACGCGCAAG
GAGAAGCCCG CGGATGAAGC GGAGCAGCAC CCCTGGCTCG AACATCAGCT CGGCCGGAAA
CTTTACCGCG CGCCCGGAAA CGAAAGGAAA CTGCGGCGCA TGAAGGCCGT CTATTACGGG
CTGATGTCGG AAGTGGACGC AGCCCTCGGC CGCGTCTTCG ATTTCCTGAA AGCGAGCGGC
CGCTGGAACC GCACGCTCAT CATCTTCACG TCCGATCACG GCGAACAGAT GGGCGATCAC
TGGCTGCTCG GCAAATGCGG CTACTTCGAT GCCTCCTATC GCATTCCGCT GATCATCCGC
GATCCGCGCA AGGCGGCGGA CGGCGCGCGC GGAAGCGTGG TCGACCGCTT CACGGAGAAT
GTCGACATCA TGCCAACCAT GCTCGAACTC ATCGGCGCGG AGATACCGGT GCAATGCGAC
GGCGCATCGC TCCGCCCCTT CCTCGAGGCG CGCGAACCCA CGACATGGCG GCGCGAGGCG
CATTGGGAAT TCGACTTCCG CGACCCTGCC GACGACAGCG CCGAGAAGCG GCTCGGCCTC
ACCATGCATC AATGCACGAT GAACATCATT CGCGACGAGA AATACAAATA TGTCCACTTC
ACGAAACTGC CGCCGCTCTT CTTCGATCTC GAAAAGGACC CGGACGAATT CGTGAACCGC
GCCACCGACC CGGACTACCT GCCGCTGGTG CTGGAATACG CCCAGAAGCT CCTCTCCTGG
CGCATGAACC ACGACGAGCA GACCCTGACC CATATCGCAA TCACCGATGA CGGCCCGGTC
GAACGCCGCG CCGCGAAATA TTGA
 
Protein sequence
MTDQLDILFI TADQWRGECL SALGHPMVKT PNLDALAADG VLFKRHYANA VPCGPSRASL 
HTGMYLQNHR SGTNGTPLDA RHTNWAKEAA RIGYDPVLFG YTDTSQDPRE EDPESPWLRT
YEGPLPGIRP VCMMGTWPTP WTNWLKEEGY EVPEDIRFAY GTRTPGDDYE DGAPVPRPLI
YPQEADDTSF LTNRLMDYIA ETKGRFVAHL SLLRPHPPFV ASEPWNAMYD PEAVPGFTRK
EKPADEAEQH PWLEHQLGRK LYRAPGNERK LRRMKAVYYG LMSEVDAALG RVFDFLKASG
RWNRTLIIFT SDHGEQMGDH WLLGKCGYFD ASYRIPLIIR DPRKAADGAR GSVVDRFTEN
VDIMPTMLEL IGAEIPVQCD GASLRPFLEA REPTTWRREA HWEFDFRDPA DDSAEKRLGL
TMHQCTMNII RDEKYKYVHF TKLPPLFFDL EKDPDEFVNR ATDPDYLPLV LEYAQKLLSW
RMNHDEQTLT HIAITDDGPV ERRAAKY