Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Plav_0541 |
Symbol | |
ID | 5454146 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Parvibaculum lavamentivorans DS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009719 |
Strand | + |
Start bp | 584628 |
End bp | 587813 |
Gene Length | 3186 bp |
Protein Length | 1061 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640876109 |
Product | trehalose synthase |
Protein accession | YP_001411821 |
Protein GI | 154250997 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00661441 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.174056 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGAAC AGGCGCTCGA GACCGATCCC CTCTGGTACA AGGACGCGGT GATCTACCAG CTTCATGTGA AGTCGTTCTT CGATGCCAAT AATGACGGCA TCGGCGATTT CGCCGGGCTG ATGCGTAAGC TCGACTACAT CGCCGATCTC GGCGTGACCG CGATCTGGCT GCTGCCTTTC TACCCGAGCC CCCGCCGCGA CGACGGCTAC GACATCGGCG AGTACCGCGA TGTGAGCCCG GACTACGGCA CCTTTGAAGA GATGCGCGCC TTTGTGCAGG CGGCGCATGG GCGGGGCATT CGCGTCATCA CCGAGCTGGT CATCAACCAC ACATCGGACC AGCATCCCTG GTTTCAGGCG GCGCGCCGCG CACCGCCGGG AAGCCCCGAG CGGGATTTCT ATGTCTGGTC GGACAGCGAC AAGAATTACG CCGGCACGCG GATCATCTTC TGCGATACGG AGAAATCGAA CTGGACCTGG GACGAGGAGG CTGGCGCCTA TTTCTGGCAT CGCTTCTATT CGCACCAGCC GGACCTGAAT TTCGACAATC CTGCGGTGCT GAAGGAAGTC TTGTCCGTGA TGCATTTCTG GCTCGATGCG GGCGTCGACG GGCTGCGGCT CGACGCCATC CCCTATCTGA TCGAGCGGGA GGGAACATCG AACGAAAATC TGCCGGAGAC GCATGAAGTC CTTAAGAAGA TCCGCGCCGA CCTCGATCGA CACTACAAGG ACCGCGTATT GCTGGCGGAA GCGAATATGT GGCCGGAAGA CACTCAGCAA TATTTCGGCA CGGACGGCGA CGAATGCCAC ATGGCGTTTC ACTTTCCGCT TATGCCGCGC ATGTACATGG CGGTGGCGCA GGAAGATCGC TTCCCGATCA CCGACATCAT GCGCCAGACC CCCGACATAG ATGAAAGCTG CCAATGGGCG ATTTTCCTGC GCAATCATGA CGAGTTGACG CTCGAAATGG TGACGGACGC GGAGCGCGAC TATCTCTGGA ACACCTATGC CGCGGACCGG CGGGCGCGGA TCAATCTCGG CATTCGCCGC AGGCTGGCGC CGCTGATGGA GCGGGACAGG CGCAGGATCG AGTTGATGAA TGCGCTGCTG CTGACCATGC CGGGCACGCC GGTGCTCTAC TACGGCGATG AAATCGGCAT GGGCGACAAT GTCTATCTCG GCGACAGGGA TGGTGTGCGC ACGCCGATGC AGTGGTCGCC CGACCGCAAT GGCGGCTTCT CGCTCGCGGA CCCGGCGACA CTGGCGCTGC CCGCGATCAT GGACCCGCTT TACGGCTTTC AGGCCGTCAA TGTCGAGGCG CAGGAACGGG ACCGGCACTC GCTGCTGAAC TGGCTGAAGC GGATGCTGGC GGTGCGGCGG GAGCACCGCG CGTTCGGACG CGGTGCGCAA CGCTTTCTCA GGCCAGCGAA CCGGAAAGTG CTCGCCTATC TGCGCGAGCA TGACGGAGAC ATCATTCTCT GCGTCGCCAA TCTCAGCCGG ACCGCGCAGG CGGTGGAGCT GGACCTGCAT GAATATGAAA ACCGGACGCC CATGGAACTG AGCGGCGGCG CGGCTTTTCC GGCCATCGGG CAGTTGCCTT ATCTCCTGAC GCTGCCGCCT TACGGCTTTC TCTGGTTTCG CCTCGCGGAG GATGCGACCT CGCCCGAATG GGCGAGCGGG GCGCCCGGCA TGGAGATCGA GAGATACACA TTCGTGCTGC GCCCCTCGCT GACGGATGTC ACGCAGGGGA ACAACCGGCA TGTTCTCGAG AACGATGTGC TGCCCGCCTA TCTTCCCTAT CGCCGCTGGT TTGGCGCCAA GGACGAGACG CTGCGCGGCG TTCGCGTTGC GAAAACGGCG GCGCTGCCCG GCGCGGAGGA TTTGCTGTGG ACGGAACTCG AGGTGACGAC CGATAGCGGC ACGAATTCAT ATGCATTGCC TCTCGGCATC GTCTGGGAGG GAGAGCAGGC GGGCCCCTTC GCCTCCAACC TCGCCCTCGC GCGCGTGCGG CGCGGGCGGC ATGTGGGGCT GCTGACGGAT GGCTTCTCGC TCGAGACCTT CTCGCAGACG GTGGTGAAGG CAATGCAAAG CGGGGCGGAA CTGCCGCTCG ACGGAAGCGT GGTGAAGTTC ACCGCTCATC CGGGCTTCGA CATTGAACCC GACCTCAAGC CGCATTGGCT GGCGGCGGAG CAATCGAACA GCACGATGGT GCTGGGCGAG CGAGCGGTAC TGAAAGTTCT GCGCAAGATA CAGAAGGGCA TTCACCCTGA GGTCGAGATG GTGCGCCATC TGACGGACAA GGGATTTGAA AATGTCCCGG CCATTCTCGC CGAAGCGAAT CGTATTGATG ATGACGGCAC GAGCCTGCTG ATGCTGATGC AGACCTTCGT CTATAACCAG GGCGACGGAT GGCAGTGGAC GCTGGGCGCG CTTGAGCGGA TGGCGACCGA CATGGATTGG AGTTTCTCCA ACTACGCCAA TTTCGCAAGG AACCTTGGCC AGAGGCTTGC GGAGATGCAC GCCGTGCTCG CGCAACCGGC AAAGCATCCG GCCTTTGCAC CCGAGACCAT GGATGCGGCG GCCGCTTCCG AAATGGGCGA GCGCATCGCG AACGAGGTTT CTACCGCGCT CGACCTGACT CCGCAGACCG GAACCGAGGA CGAAACGCTG CTCGGCCAGC GCGAGATATT GCTCATGCGC ATCCGCGAGC TCGCCATTTC CGCTGAAGGG CGCGTGCGCA CAAGAATTCA TGGCGATCTG CATCTGGGGC AGGTGCTCGT GACCGGCAGC GACGTGATGC TGATCGATTT CGAAGGCGAG CCGGCAAAGC CGCTTGCCGA CCGGCGGAAG AAAGATATTC CACTGCGCGA TGTGGCCGGG CTGCTCCGCT CGTTCGATTA TGCGGCGGCG GTTGCCGAGC GGCAGCGCCC CGCAAGCGCG GAGACGGAGG AAATCCGGAC GGGTGAACGC TATGCGCGCT TCCGCGTTCA GGCGGTGGAG GCGTTCCTGG ACGGATATGC CGGAGAGCGC GACATCGGCA ACGACCCCTT GCTCAATCTC CTCGTTCTGG AGAAGGCGGC ATATGAGGTT GCCTACGAGG CGGCCAACCG GCCGGACTGG ATCGACGTTC CCGTTGCGGG GTTTGCACGC GTGGCGGAAG CGATACTCAA CGGAAAGGGA ATTTGA
|
Protein sequence | MTEQALETDP LWYKDAVIYQ LHVKSFFDAN NDGIGDFAGL MRKLDYIADL GVTAIWLLPF YPSPRRDDGY DIGEYRDVSP DYGTFEEMRA FVQAAHGRGI RVITELVINH TSDQHPWFQA ARRAPPGSPE RDFYVWSDSD KNYAGTRIIF CDTEKSNWTW DEEAGAYFWH RFYSHQPDLN FDNPAVLKEV LSVMHFWLDA GVDGLRLDAI PYLIEREGTS NENLPETHEV LKKIRADLDR HYKDRVLLAE ANMWPEDTQQ YFGTDGDECH MAFHFPLMPR MYMAVAQEDR FPITDIMRQT PDIDESCQWA IFLRNHDELT LEMVTDAERD YLWNTYAADR RARINLGIRR RLAPLMERDR RRIELMNALL LTMPGTPVLY YGDEIGMGDN VYLGDRDGVR TPMQWSPDRN GGFSLADPAT LALPAIMDPL YGFQAVNVEA QERDRHSLLN WLKRMLAVRR EHRAFGRGAQ RFLRPANRKV LAYLREHDGD IILCVANLSR TAQAVELDLH EYENRTPMEL SGGAAFPAIG QLPYLLTLPP YGFLWFRLAE DATSPEWASG APGMEIERYT FVLRPSLTDV TQGNNRHVLE NDVLPAYLPY RRWFGAKDET LRGVRVAKTA ALPGAEDLLW TELEVTTDSG TNSYALPLGI VWEGEQAGPF ASNLALARVR RGRHVGLLTD GFSLETFSQT VVKAMQSGAE LPLDGSVVKF TAHPGFDIEP DLKPHWLAAE QSNSTMVLGE RAVLKVLRKI QKGIHPEVEM VRHLTDKGFE NVPAILAEAN RIDDDGTSLL MLMQTFVYNQ GDGWQWTLGA LERMATDMDW SFSNYANFAR NLGQRLAEMH AVLAQPAKHP AFAPETMDAA AASEMGERIA NEVSTALDLT PQTGTEDETL LGQREILLMR IRELAISAEG RVRTRIHGDL HLGQVLVTGS DVMLIDFEGE PAKPLADRRK KDIPLRDVAG LLRSFDYAAA VAERQRPASA ETEEIRTGER YARFRVQAVE AFLDGYAGER DIGNDPLLNL LVLEKAAYEV AYEAANRPDW IDVPVAGFAR VAEAILNGKG I
|
| |