Gene Plav_0541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlav_0541 
Symbol 
ID5454146 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameParvibaculum lavamentivorans DS-1 
KingdomBacteria 
Replicon accessionNC_009719 
Strand
Start bp584628 
End bp587813 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content62% 
IMG OID640876109 
Producttrehalose synthase 
Protein accessionYP_001411821 
Protein GI154250997 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00661441 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.174056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGAAC AGGCGCTCGA GACCGATCCC CTCTGGTACA AGGACGCGGT GATCTACCAG 
CTTCATGTGA AGTCGTTCTT CGATGCCAAT AATGACGGCA TCGGCGATTT CGCCGGGCTG
ATGCGTAAGC TCGACTACAT CGCCGATCTC GGCGTGACCG CGATCTGGCT GCTGCCTTTC
TACCCGAGCC CCCGCCGCGA CGACGGCTAC GACATCGGCG AGTACCGCGA TGTGAGCCCG
GACTACGGCA CCTTTGAAGA GATGCGCGCC TTTGTGCAGG CGGCGCATGG GCGGGGCATT
CGCGTCATCA CCGAGCTGGT CATCAACCAC ACATCGGACC AGCATCCCTG GTTTCAGGCG
GCGCGCCGCG CACCGCCGGG AAGCCCCGAG CGGGATTTCT ATGTCTGGTC GGACAGCGAC
AAGAATTACG CCGGCACGCG GATCATCTTC TGCGATACGG AGAAATCGAA CTGGACCTGG
GACGAGGAGG CTGGCGCCTA TTTCTGGCAT CGCTTCTATT CGCACCAGCC GGACCTGAAT
TTCGACAATC CTGCGGTGCT GAAGGAAGTC TTGTCCGTGA TGCATTTCTG GCTCGATGCG
GGCGTCGACG GGCTGCGGCT CGACGCCATC CCCTATCTGA TCGAGCGGGA GGGAACATCG
AACGAAAATC TGCCGGAGAC GCATGAAGTC CTTAAGAAGA TCCGCGCCGA CCTCGATCGA
CACTACAAGG ACCGCGTATT GCTGGCGGAA GCGAATATGT GGCCGGAAGA CACTCAGCAA
TATTTCGGCA CGGACGGCGA CGAATGCCAC ATGGCGTTTC ACTTTCCGCT TATGCCGCGC
ATGTACATGG CGGTGGCGCA GGAAGATCGC TTCCCGATCA CCGACATCAT GCGCCAGACC
CCCGACATAG ATGAAAGCTG CCAATGGGCG ATTTTCCTGC GCAATCATGA CGAGTTGACG
CTCGAAATGG TGACGGACGC GGAGCGCGAC TATCTCTGGA ACACCTATGC CGCGGACCGG
CGGGCGCGGA TCAATCTCGG CATTCGCCGC AGGCTGGCGC CGCTGATGGA GCGGGACAGG
CGCAGGATCG AGTTGATGAA TGCGCTGCTG CTGACCATGC CGGGCACGCC GGTGCTCTAC
TACGGCGATG AAATCGGCAT GGGCGACAAT GTCTATCTCG GCGACAGGGA TGGTGTGCGC
ACGCCGATGC AGTGGTCGCC CGACCGCAAT GGCGGCTTCT CGCTCGCGGA CCCGGCGACA
CTGGCGCTGC CCGCGATCAT GGACCCGCTT TACGGCTTTC AGGCCGTCAA TGTCGAGGCG
CAGGAACGGG ACCGGCACTC GCTGCTGAAC TGGCTGAAGC GGATGCTGGC GGTGCGGCGG
GAGCACCGCG CGTTCGGACG CGGTGCGCAA CGCTTTCTCA GGCCAGCGAA CCGGAAAGTG
CTCGCCTATC TGCGCGAGCA TGACGGAGAC ATCATTCTCT GCGTCGCCAA TCTCAGCCGG
ACCGCGCAGG CGGTGGAGCT GGACCTGCAT GAATATGAAA ACCGGACGCC CATGGAACTG
AGCGGCGGCG CGGCTTTTCC GGCCATCGGG CAGTTGCCTT ATCTCCTGAC GCTGCCGCCT
TACGGCTTTC TCTGGTTTCG CCTCGCGGAG GATGCGACCT CGCCCGAATG GGCGAGCGGG
GCGCCCGGCA TGGAGATCGA GAGATACACA TTCGTGCTGC GCCCCTCGCT GACGGATGTC
ACGCAGGGGA ACAACCGGCA TGTTCTCGAG AACGATGTGC TGCCCGCCTA TCTTCCCTAT
CGCCGCTGGT TTGGCGCCAA GGACGAGACG CTGCGCGGCG TTCGCGTTGC GAAAACGGCG
GCGCTGCCCG GCGCGGAGGA TTTGCTGTGG ACGGAACTCG AGGTGACGAC CGATAGCGGC
ACGAATTCAT ATGCATTGCC TCTCGGCATC GTCTGGGAGG GAGAGCAGGC GGGCCCCTTC
GCCTCCAACC TCGCCCTCGC GCGCGTGCGG CGCGGGCGGC ATGTGGGGCT GCTGACGGAT
GGCTTCTCGC TCGAGACCTT CTCGCAGACG GTGGTGAAGG CAATGCAAAG CGGGGCGGAA
CTGCCGCTCG ACGGAAGCGT GGTGAAGTTC ACCGCTCATC CGGGCTTCGA CATTGAACCC
GACCTCAAGC CGCATTGGCT GGCGGCGGAG CAATCGAACA GCACGATGGT GCTGGGCGAG
CGAGCGGTAC TGAAAGTTCT GCGCAAGATA CAGAAGGGCA TTCACCCTGA GGTCGAGATG
GTGCGCCATC TGACGGACAA GGGATTTGAA AATGTCCCGG CCATTCTCGC CGAAGCGAAT
CGTATTGATG ATGACGGCAC GAGCCTGCTG ATGCTGATGC AGACCTTCGT CTATAACCAG
GGCGACGGAT GGCAGTGGAC GCTGGGCGCG CTTGAGCGGA TGGCGACCGA CATGGATTGG
AGTTTCTCCA ACTACGCCAA TTTCGCAAGG AACCTTGGCC AGAGGCTTGC GGAGATGCAC
GCCGTGCTCG CGCAACCGGC AAAGCATCCG GCCTTTGCAC CCGAGACCAT GGATGCGGCG
GCCGCTTCCG AAATGGGCGA GCGCATCGCG AACGAGGTTT CTACCGCGCT CGACCTGACT
CCGCAGACCG GAACCGAGGA CGAAACGCTG CTCGGCCAGC GCGAGATATT GCTCATGCGC
ATCCGCGAGC TCGCCATTTC CGCTGAAGGG CGCGTGCGCA CAAGAATTCA TGGCGATCTG
CATCTGGGGC AGGTGCTCGT GACCGGCAGC GACGTGATGC TGATCGATTT CGAAGGCGAG
CCGGCAAAGC CGCTTGCCGA CCGGCGGAAG AAAGATATTC CACTGCGCGA TGTGGCCGGG
CTGCTCCGCT CGTTCGATTA TGCGGCGGCG GTTGCCGAGC GGCAGCGCCC CGCAAGCGCG
GAGACGGAGG AAATCCGGAC GGGTGAACGC TATGCGCGCT TCCGCGTTCA GGCGGTGGAG
GCGTTCCTGG ACGGATATGC CGGAGAGCGC GACATCGGCA ACGACCCCTT GCTCAATCTC
CTCGTTCTGG AGAAGGCGGC ATATGAGGTT GCCTACGAGG CGGCCAACCG GCCGGACTGG
ATCGACGTTC CCGTTGCGGG GTTTGCACGC GTGGCGGAAG CGATACTCAA CGGAAAGGGA
ATTTGA
 
Protein sequence
MTEQALETDP LWYKDAVIYQ LHVKSFFDAN NDGIGDFAGL MRKLDYIADL GVTAIWLLPF 
YPSPRRDDGY DIGEYRDVSP DYGTFEEMRA FVQAAHGRGI RVITELVINH TSDQHPWFQA
ARRAPPGSPE RDFYVWSDSD KNYAGTRIIF CDTEKSNWTW DEEAGAYFWH RFYSHQPDLN
FDNPAVLKEV LSVMHFWLDA GVDGLRLDAI PYLIEREGTS NENLPETHEV LKKIRADLDR
HYKDRVLLAE ANMWPEDTQQ YFGTDGDECH MAFHFPLMPR MYMAVAQEDR FPITDIMRQT
PDIDESCQWA IFLRNHDELT LEMVTDAERD YLWNTYAADR RARINLGIRR RLAPLMERDR
RRIELMNALL LTMPGTPVLY YGDEIGMGDN VYLGDRDGVR TPMQWSPDRN GGFSLADPAT
LALPAIMDPL YGFQAVNVEA QERDRHSLLN WLKRMLAVRR EHRAFGRGAQ RFLRPANRKV
LAYLREHDGD IILCVANLSR TAQAVELDLH EYENRTPMEL SGGAAFPAIG QLPYLLTLPP
YGFLWFRLAE DATSPEWASG APGMEIERYT FVLRPSLTDV TQGNNRHVLE NDVLPAYLPY
RRWFGAKDET LRGVRVAKTA ALPGAEDLLW TELEVTTDSG TNSYALPLGI VWEGEQAGPF
ASNLALARVR RGRHVGLLTD GFSLETFSQT VVKAMQSGAE LPLDGSVVKF TAHPGFDIEP
DLKPHWLAAE QSNSTMVLGE RAVLKVLRKI QKGIHPEVEM VRHLTDKGFE NVPAILAEAN
RIDDDGTSLL MLMQTFVYNQ GDGWQWTLGA LERMATDMDW SFSNYANFAR NLGQRLAEMH
AVLAQPAKHP AFAPETMDAA AASEMGERIA NEVSTALDLT PQTGTEDETL LGQREILLMR
IRELAISAEG RVRTRIHGDL HLGQVLVTGS DVMLIDFEGE PAKPLADRRK KDIPLRDVAG
LLRSFDYAAA VAERQRPASA ETEEIRTGER YARFRVQAVE AFLDGYAGER DIGNDPLLNL
LVLEKAAYEV AYEAANRPDW IDVPVAGFAR VAEAILNGKG I