Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_2077 |
Symbol | |
ID | 5455213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | - |
Start bp | 2262530 |
End bp | 2263555 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640877654 |
Product | sulfate ABC transporter, periplasmic sulfate-binding protein |
Protein accession | YP_001413348 |
Protein GI | 154252524 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1613] ABC-type sulfate transport system, periplasmic component |
TIGRFAM ID | [TIGR00971] sulfate/thiosulfate-binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000000000402733 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00000000000000351214 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACTTTTT CAGATTTGAC CAAACGCTCC GCCGTTCTTC TGGCAGGCAT TTTCCTGCTG CTCGCCATGA CCTTCGAGGT CCGGGCCGAA ACCACGCTTC TCAACGTGTC CTACGACCCG ACGCGCGAGC TTTATCGCGA TTTCAACGCG GCCTTCGTGG AGCATTGGAA GAAGGAAACC GGCGAGACCG TTTCCATCGA GCAGTCGCAT GGCGGCTCGG GCAAGCAGGC CCGCGCGGTG ATTGACGGGC TCGAAGCCGA TGTCGTCACG CTGGCGCTGG CCGGCGACAT AGACGAGATC GCCGATGCGA CGGGCAAGCT GCCGAAGGAC TGGCAGAAGT CCCTCCCCTA CAACTCATCG CCCTATACCT CGACCATCGT CTTCCTCGTT CGCGACGGCA ACCCGGAAGG CATCAAGGAC TGGGACGACC TGGTGAAGCC GGGGATCGAA GTCATCACGC CGAACCCGAA GACCTCGGGC GGCGCGCGCT GGAACTACCT CGCGGCATAC GCCTATGCGC TCGAACATTC CGGCAATGAC GACGCGAAGG CGCGCGAATT CGTCGGCAAG CTTTTCAGGA ATGTGCCGGT GCTCGACACG GGCGCGCGCG GCTCCACCAC CACCTTCGTC CAGCGCGGCA TCGGCGACGT CTTCATTTCG TGGGAGAACG AAGCCTTCCT CGCCCAGAAG GAATTTCCCG GCAAGTTCGA GATCGTCGTG CCGACGCTCT CGATCCGCGC CGAGCCGCCC GTCGCAATCG TGACCGGCAA TACGGACAAG CGCGGCACGA CCAAGCTCGC CCGCGCCTAT CTCGAATATC TTTATTCCTC GACCGGCCAG AACCTCGCAG CGAAACATTT CTATCGCCCC GTGAAGCCGG AATTCGCCGA CAAGGAAGAC CTCAAGCGTT TCCCGACCGT GAAGCTCGTC TCGATCGATG ATGTCTTCGG CGGCTGGGTA AAGGCGCAGC CCGAGCATTT CGGCGACGGC GGCGTGTTCG ACCAGATTTA TCAGCCGGGC CGTTAA
|
Protein sequence | MTFSDLTKRS AVLLAGIFLL LAMTFEVRAE TTLLNVSYDP TRELYRDFNA AFVEHWKKET GETVSIEQSH GGSGKQARAV IDGLEADVVT LALAGDIDEI ADATGKLPKD WQKSLPYNSS PYTSTIVFLV RDGNPEGIKD WDDLVKPGIE VITPNPKTSG GARWNYLAAY AYALEHSGND DAKAREFVGK LFRNVPVLDT GARGSTTTFV QRGIGDVFIS WENEAFLAQK EFPGKFEIVV PTLSIRAEPP VAIVTGNTDK RGTTKLARAY LEYLYSSTGQ NLAAKHFYRP VKPEFADKED LKRFPTVKLV SIDDVFGGWV KAQPEHFGDG GVFDQIYQPG R
|
| |