Gene Plav_2077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_2077 
Symbol 
ID5455213 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp2262530 
End bp2263555 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content62% 
IMG OID640877654 
Productsulfate ABC transporter, periplasmic sulfate-binding protein 
Protein accessionYP_001413348 
Protein GI154252524 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1613] ABC-type sulfate transport system, periplasmic component 
TIGRFAM ID[TIGR00971] sulfate/thiosulfate-binding protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000402733 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000000351214 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACTTTTT CAGATTTGAC CAAACGCTCC GCCGTTCTTC TGGCAGGCAT TTTCCTGCTG 
CTCGCCATGA CCTTCGAGGT CCGGGCCGAA ACCACGCTTC TCAACGTGTC CTACGACCCG
ACGCGCGAGC TTTATCGCGA TTTCAACGCG GCCTTCGTGG AGCATTGGAA GAAGGAAACC
GGCGAGACCG TTTCCATCGA GCAGTCGCAT GGCGGCTCGG GCAAGCAGGC CCGCGCGGTG
ATTGACGGGC TCGAAGCCGA TGTCGTCACG CTGGCGCTGG CCGGCGACAT AGACGAGATC
GCCGATGCGA CGGGCAAGCT GCCGAAGGAC TGGCAGAAGT CCCTCCCCTA CAACTCATCG
CCCTATACCT CGACCATCGT CTTCCTCGTT CGCGACGGCA ACCCGGAAGG CATCAAGGAC
TGGGACGACC TGGTGAAGCC GGGGATCGAA GTCATCACGC CGAACCCGAA GACCTCGGGC
GGCGCGCGCT GGAACTACCT CGCGGCATAC GCCTATGCGC TCGAACATTC CGGCAATGAC
GACGCGAAGG CGCGCGAATT CGTCGGCAAG CTTTTCAGGA ATGTGCCGGT GCTCGACACG
GGCGCGCGCG GCTCCACCAC CACCTTCGTC CAGCGCGGCA TCGGCGACGT CTTCATTTCG
TGGGAGAACG AAGCCTTCCT CGCCCAGAAG GAATTTCCCG GCAAGTTCGA GATCGTCGTG
CCGACGCTCT CGATCCGCGC CGAGCCGCCC GTCGCAATCG TGACCGGCAA TACGGACAAG
CGCGGCACGA CCAAGCTCGC CCGCGCCTAT CTCGAATATC TTTATTCCTC GACCGGCCAG
AACCTCGCAG CGAAACATTT CTATCGCCCC GTGAAGCCGG AATTCGCCGA CAAGGAAGAC
CTCAAGCGTT TCCCGACCGT GAAGCTCGTC TCGATCGATG ATGTCTTCGG CGGCTGGGTA
AAGGCGCAGC CCGAGCATTT CGGCGACGGC GGCGTGTTCG ACCAGATTTA TCAGCCGGGC
CGTTAA
 
Protein sequence
MTFSDLTKRS AVLLAGIFLL LAMTFEVRAE TTLLNVSYDP TRELYRDFNA AFVEHWKKET 
GETVSIEQSH GGSGKQARAV IDGLEADVVT LALAGDIDEI ADATGKLPKD WQKSLPYNSS
PYTSTIVFLV RDGNPEGIKD WDDLVKPGIE VITPNPKTSG GARWNYLAAY AYALEHSGND
DAKAREFVGK LFRNVPVLDT GARGSTTTFV QRGIGDVFIS WENEAFLAQK EFPGKFEIVV
PTLSIRAEPP VAIVTGNTDK RGTTKLARAY LEYLYSSTGQ NLAAKHFYRP VKPEFADKED
LKRFPTVKLV SIDDVFGGWV KAQPEHFGDG GVFDQIYQPG R