Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_2519 |
Symbol | |
ID | 5455757 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | - |
Start bp | 2721016 |
End bp | 2722539 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640878096 |
Product | sulfatase |
Protein accession | YP_001413785 |
Protein GI | 154252961 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.602817 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.0521471 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGACC AGCTCGACAT TCTTTTCATC ACCGCCGACC AGTGGCGCGG CGAGTGCCTC TCCGCCCTCG GCCACCCGAT GGTAAAGACC CCGAACCTCG ACGCGCTGGC GGCGGACGGC GTTCTCTTCA AGCGGCACTA TGCGAACGCC GTTCCCTGCG GCCCGTCGCG CGCCTCGCTT CACACCGGCA TGTATCTCCA GAACCACCGC TCCGGCACCA ACGGCACACC GCTCGACGCG CGCCATACCA ACTGGGCGAA GGAGGCGGCG CGCATCGGCT ACGACCCCGT GCTCTTCGGC TATACGGATA CGAGCCAGGA TCCGCGCGAG GAAGACCCGG AAAGCCCGTG GCTTAGGACT TATGAGGGCC CGCTCCCCGG CATCCGCCCC GTCTGCATGA TGGGCACATG GCCGACGCCC TGGACAAACT GGCTGAAGGA AGAGGGCTAC GAAGTGCCGG AGGACATCCG CTTCGCCTAT GGCACGCGGA CACCGGGCGA CGATTATGAG GACGGCGCGC CCGTGCCGCG CCCGCTCATC TATCCTCAGG AGGCGGACGA TACGAGCTTC CTCACCAACC GGCTGATGGA CTACATCGCG GAAACGAAGG GGCGCTTCGT CGCGCATCTC TCGCTGCTGC GCCCTCACCC GCCCTTCGTC GCCTCGGAAC CCTGGAACGC GATGTACGAC CCGGAAGCAG TGCCGGGCTT CACGCGCAAG GAGAAGCCCG CGGATGAAGC GGAGCAGCAC CCCTGGCTCG AACATCAGCT CGGCCGGAAA CTTTACCGCG CGCCCGGAAA CGAAAGGAAA CTGCGGCGCA TGAAGGCCGT CTATTACGGG CTGATGTCGG AAGTGGACGC AGCCCTCGGC CGCGTCTTCG ATTTCCTGAA AGCGAGCGGC CGCTGGAACC GCACGCTCAT CATCTTCACG TCCGATCACG GCGAACAGAT GGGCGATCAC TGGCTGCTCG GCAAATGCGG CTACTTCGAT GCCTCCTATC GCATTCCGCT GATCATCCGC GATCCGCGCA AGGCGGCGGA CGGCGCGCGC GGAAGCGTGG TCGACCGCTT CACGGAGAAT GTCGACATCA TGCCAACCAT GCTCGAACTC ATCGGCGCGG AGATACCGGT GCAATGCGAC GGCGCATCGC TCCGCCCCTT CCTCGAGGCG CGCGAACCCA CGACATGGCG GCGCGAGGCG CATTGGGAAT TCGACTTCCG CGACCCTGCC GACGACAGCG CCGAGAAGCG GCTCGGCCTC ACCATGCATC AATGCACGAT GAACATCATT CGCGACGAGA AATACAAATA TGTCCACTTC ACGAAACTGC CGCCGCTCTT CTTCGATCTC GAAAAGGACC CGGACGAATT CGTGAACCGC GCCACCGACC CGGACTACCT GCCGCTGGTG CTGGAATACG CCCAGAAGCT CCTCTCCTGG CGCATGAACC ACGACGAGCA GACCCTGACC CATATCGCAA TCACCGATGA CGGCCCGGTC GAACGCCGCG CCGCGAAATA TTGA
|
Protein sequence | MTDQLDILFI TADQWRGECL SALGHPMVKT PNLDALAADG VLFKRHYANA VPCGPSRASL HTGMYLQNHR SGTNGTPLDA RHTNWAKEAA RIGYDPVLFG YTDTSQDPRE EDPESPWLRT YEGPLPGIRP VCMMGTWPTP WTNWLKEEGY EVPEDIRFAY GTRTPGDDYE DGAPVPRPLI YPQEADDTSF LTNRLMDYIA ETKGRFVAHL SLLRPHPPFV ASEPWNAMYD PEAVPGFTRK EKPADEAEQH PWLEHQLGRK LYRAPGNERK LRRMKAVYYG LMSEVDAALG RVFDFLKASG RWNRTLIIFT SDHGEQMGDH WLLGKCGYFD ASYRIPLIIR DPRKAADGAR GSVVDRFTEN VDIMPTMLEL IGAEIPVQCD GASLRPFLEA REPTTWRREA HWEFDFRDPA DDSAEKRLGL TMHQCTMNII RDEKYKYVHF TKLPPLFFDL EKDPDEFVNR ATDPDYLPLV LEYAQKLLSW RMNHDEQTLT HIAITDDGPV ERRAAKY
|
| |