Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3679 |
Symbol | |
ID | 3969616 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 4096451 |
End bp | 4099750 |
Gene Length | 3300 bp |
Protein Length | 1099 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637926789 |
Product | trehalose synthase-like |
Protein accession | YP_533533 |
Protein GI | 90425163 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases [COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis |
TIGRFAM ID | [TIGR02456] trehalose synthase [TIGR02457] trehalose synthase-fused probable maltokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATATCA TGCCCTCGAT CGAGCCCAAC GGCTCGCAGT CCAGCGTCGA GACCGACGAA TTGTGGTACA AAGACGCCAT CATCTATCAG CTGCACGTCA AGGCGTTTGC CGACAGCAAC AATGACGGCA TCGGCGATTT CGCCGGCCTG ACCGAAAAGC TGGACTATCT GCAGGAGCTC GGCGTCACCG CGCTGTGGCT GATGCCGTTC TATCCGTCGC CCGGCCGCGA CGACGGCTAC GACATCGCCG ATTACGGCGC GGTCAGCCCC GACTACGGCA CCATGAAGGA TTTCCGCCGC TTCATCGTCG AGGCGAAGAA GCGCGGGCTG CGCGTCATCA CCGAATTGGT CATCAACCAC ACCTCGGATC AGCACGACTG GTTCAAACGC GCCCGCCGCA GCCCGAAGGA CTCGAGCGCG CGGGACTGGT ACGTCTGGAG CGACACCGAC CAGAAATATG CCGGCACCCG CATCATCTTC ACCGACACCG AGAAGTCGAA CTGGTCGTGG GACCCCGAGG CCAAGGCGTA TTATTGGCAC CGCTTCTTCT CGCACCAGCC GGATCTGAAC TTCGACAACC CGCGGGTGGT GTCCGCGGTG ATCCAGGTGA TGAAGCGCTG GCTCGACACC GGGGTCGACG GCTTCCGGCT CGACGCCATC CCCTATCTGT GCGAGCGCGA CGGCACCAAC AACGAGAACC TCCCGGAGAC CCACGCGGTC ATCAAGACGT TGCGCGCCGA ACTCGACGCC TATGCCAAGG GCAAGGTGCT GCTCGCCGAG GCCAACCAGT GGCCGGAGGA CGTCCAGCAA TATTTCGGCC AGGGCGACGA ATGCCACATG GCCTATCATT TCCCGCTGAT GCCCCGAATC TACATGGCGC TGGCGCAGGA AGACCGCTTT CCGATCACCG ACATCCTGCG GCAGACCCCG GACATCCCGG CGAATTGCCA ATGGGCGATG TTCCTGCGCA ACCATGACGA ACTGACGCTG GAAATGGTCA CCGATGTCGA GCGCGATTAT CTGTGGTCGA CCTATGCCAA CGACCCGCGG GCGCGGATCA ATGTCGGCAT CCGCAGGCGC CTTGCGCCAT TGATGGACAA CGACCGCCGC AAGATCGAAC TGATGAATTC GCTGCTGCTG TCGTTTCCCG GCACGCCGAT CATCTATTAC GGCGACGAAA TCGGGATGGG CGACAACATC TATCTCGGCG ACCGCAACGG CGTGCGCACC CCGATGCAGT GGACGCCCGA CCGCAACGGC GGCTTCTCCC GCGCCGATCC CGCACGGCTC TATGCGCCGA CCATCATGGA CCCGGTGTAC GGCTACGAGG CGGTCAACGT CGAAGCGCAG TCGCGCAGCC TGTCGTCGCT GCTCAGCGCC ACCAAGCGGC TGATCTCGGT GCGCAAGTCG ACGCCGGCGT TCGGCCGCGG CACCATGAGT TTCATCCGGC CGTCGAACCG CGCGGTGTTG GCCTATGTCC GGCAATACGA GGACGAAGTG ATTCTCTGCG TCGCCAATCT GTCACGCTCC GCGCAGGCCA CCGAACTCGA TCTGTCGGCC TGGAAAGACC GGATCCCGCA GGAAATGCTG GGACGAACCC GGTTTCCGGC GATCGGCGAA CTGCCCTACA TGATCACGCT GGCGCCCTAC GGGTTCTATT GGTTCCGGCT GTGCGAGCGC GACGCCTCGC AGCATCCGGC GCCCTCCGTG GTGCCGGAAT TCGAGACCCT GGTGGTGCCG CTCGGCGCCA CCTGGATGAC GCTCGGCCGC ACCCGCGGCG TGTTCGAGCG CGACGTGCTG CCGGCGCATC TGGCGCGCAC CCGGTGGTAT CCCGAACGCT CGGCGCGGGC GATCCATCCG CGGCTGACCT CGGCGATCCC GTTCTGCATC GAGGGCGACA ACCGGCCCTG GCTGGCGTTC TTCGAAGCCA CCCAGCGCGG GGTGACCTCG CGTTACGTGC TGCCGATGCA GATCGCCTGG GTGCGGTTCG ATCGCGAACG CTTCAATCCG CGCGCCTTCG CCGCGGTGCG CCAGGGCGCC CGCGAAGGCA CCCTGCTCGA CGTCGCCTCC GATCCCGACT TCGTGACGCT GCTGTTGCAG AATCTGCGGG CGTCGCTGAC CGTCGAAGAG CAAGGCGTGC GGCTGGTGTT CCGGCCGACC GCGCGGTTCG CCGAGAAGCC GGACAAGCCG TTGCAGAACA TTCGCGCCGT CGAAACCGAG CAGTCCAACA GCACCACGCT GGTCGATGGC GATTATGTGG TGAAACTCTA TCGCCGGCTG GAGACCGGGA TCAATCCCGA GATCGAGATG GGCCGCTTCC TGACCGACGT CGCCGGCTTC GCCAACACCC CGGCGCTGAT GGGCAGCGTC GAACTGATCG AGGGCGACGC CACCAGCGCG GTGGCGGTGG TGCACGCCTT CGTCGGCAAC CAGGGCGACG GCTGGACCGT GACCCAGGCC TATCTCGACC GCTTCGTCGA CGAGCAGCGG CTGCTGACGA CCGCCGAGCA GCCCGGCGAG AGCGACGAAC AGGTGCCATA TCTGCGCTAC ATGTCGCAAA CCGGTAAACG CGTCGCAGAG ATGCATGTCG CGCTGGCCGC GCATCCGGAA GAGCCGGAGT TCGCTCCCGA ACCGATCACC GCCGAGCAAG TGCAGGGCTG GGTCGACAGC GTCACCGGCT ACGCCGAACG GGTGATCGAC GACTTGAAGC GGCGGCGCGA TCAGCTCAAG GAGGGCGATC GCCGGCTGGT CGACCAATTG GCGGCGCTGC GCGAATCGAT CCGCGAACGG CTGCGCAGCC TGCTGCACGA AGACGGCGGC GGCCTCAACA TCCGCCACCA CGGCGATTTT CATCTCGGCC AGATGCTGAT CGTCAAGGAC GACATCTCGA TCATCGACTT CGAAGGCGAG CCGCGGCGCA GCCAGGCCGA ACGACGCCGC AAGGCGCCGG CGGCGCGCGA CGTCGCCGGA TTGATCCGTT CGATTGACTA TTCGGCCACC GCGGCGTTGG ATCGCGCGCT GAAAGCCGCC CCGGACGAGC AGGGCAAGCT TGCCTCCGCG CTCGAAGCGT GGCGTAACCG GTCGACCGCC GCGTTTCTCA CGGCCTACCG CGAGTCCATG ACCGATACCC GGTTATGGCC TGGTCATCCT GATGCAGCCG GTGGGATTCT CGATTTCTTC CTGCTAGAAA AAGCCTTCTA CGAAATCGAA TACGAACTCG CCCACCGCCC CGATTGGCTC CGCGTCCCGC TTGCCGGCGT CATTCGAATT TTGTCCCAGC GTTCCCAGGA GGTCACATGA
|
Protein sequence | MNIMPSIEPN GSQSSVETDE LWYKDAIIYQ LHVKAFADSN NDGIGDFAGL TEKLDYLQEL GVTALWLMPF YPSPGRDDGY DIADYGAVSP DYGTMKDFRR FIVEAKKRGL RVITELVINH TSDQHDWFKR ARRSPKDSSA RDWYVWSDTD QKYAGTRIIF TDTEKSNWSW DPEAKAYYWH RFFSHQPDLN FDNPRVVSAV IQVMKRWLDT GVDGFRLDAI PYLCERDGTN NENLPETHAV IKTLRAELDA YAKGKVLLAE ANQWPEDVQQ YFGQGDECHM AYHFPLMPRI YMALAQEDRF PITDILRQTP DIPANCQWAM FLRNHDELTL EMVTDVERDY LWSTYANDPR ARINVGIRRR LAPLMDNDRR KIELMNSLLL SFPGTPIIYY GDEIGMGDNI YLGDRNGVRT PMQWTPDRNG GFSRADPARL YAPTIMDPVY GYEAVNVEAQ SRSLSSLLSA TKRLISVRKS TPAFGRGTMS FIRPSNRAVL AYVRQYEDEV ILCVANLSRS AQATELDLSA WKDRIPQEML GRTRFPAIGE LPYMITLAPY GFYWFRLCER DASQHPAPSV VPEFETLVVP LGATWMTLGR TRGVFERDVL PAHLARTRWY PERSARAIHP RLTSAIPFCI EGDNRPWLAF FEATQRGVTS RYVLPMQIAW VRFDRERFNP RAFAAVRQGA REGTLLDVAS DPDFVTLLLQ NLRASLTVEE QGVRLVFRPT ARFAEKPDKP LQNIRAVETE QSNSTTLVDG DYVVKLYRRL ETGINPEIEM GRFLTDVAGF ANTPALMGSV ELIEGDATSA VAVVHAFVGN QGDGWTVTQA YLDRFVDEQR LLTTAEQPGE SDEQVPYLRY MSQTGKRVAE MHVALAAHPE EPEFAPEPIT AEQVQGWVDS VTGYAERVID DLKRRRDQLK EGDRRLVDQL AALRESIRER LRSLLHEDGG GLNIRHHGDF HLGQMLIVKD DISIIDFEGE PRRSQAERRR KAPAARDVAG LIRSIDYSAT AALDRALKAA PDEQGKLASA LEAWRNRSTA AFLTAYRESM TDTRLWPGHP DAAGGILDFF LLEKAFYEIE YELAHRPDWL RVPLAGVIRI LSQRSQEVT
|
| |