Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_0917 |
Symbol | |
ID | 3909770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1058173 |
End bp | 1059657 |
Gene Length | 1485 bp |
Protein Length | 494 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637882810 |
Product | Alpha,alpha-trehalose-phosphate synthase |
Protein accession | YP_484539 |
Protein GI | 86748043 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0380] Trehalose-6-phosphate synthase |
TIGRFAM ID | [TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.424622 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACCTAG TCGTCGTTTC AAACCGGGTG GCGAGGGCGT CTTCGAACGA GCCGATGACC GGTGGTCTGG CCGCCGCCTT GCTGCCCGTG GTCGAAAAGT CCGGCGCCAT CTGGGTCGGT TCCAGCGGCC GGGTGCGCGA TGGCGCGCAA CGGGAACCCT TTGCGGAGAT TGAACAACTC GGCGCCGGCG CGCTGGCGAT GCTGGATCTG CCGGCGGCGC ATTACGGCGG TTATTACGAG GGCTTCGCCA ATTCCGCGCT GTGGCCGGCG CTGCATTCCC GCGCCGATCT GATCCGCGTC TCGCAGGACG ACTACCGTTC CTATCGCGAG GTCAATAGCT TCATGGCGCG CGCCCTGCTG CGCTTCCGCA AGCCCGACAC TGCGTTCTGG ATACAGGATT ACCATTTTCT CGCCCTCGGC GCGGAGTTGC GCGCCCTCGG CGTGACCCAG CCGATCGGCT TCTTCCTGCA CACGCCGTGG GCGTCCCCGG CGACGATGGG CTGCGTGCCG CACAATCGCG AGCTGGTCGA GGCGATGCTC GCCTATGATC TGATCGGATT CCAGACCGAA GAGGACCGGA GCAATTTTCT GGCCTATGGC AAAGCCGAAC TCGGCTTTGC CATCGCCGAC GGCGTCGTCA CGACGCCGTA CGGCACATCG CGTTGTGAAG TGTTCCCGAT CGGCATCGAT GCCGATCTGT TTGCGCAGCA GGCGCAGAAG GCGACCGCGC ATCCCGATGT GTCGCGGCTG CGCAAGAGCC TCAACGGCGA GAAGCTCGTG ATCGGCGTCG ACCGCCTGGA TTATTCGAAG GGTTTGATCA ACCGCGTCAA TGCCTTCGAC CGAATGCTGA CCATGCGACC GTCGCTGCAA CGCACCGTGT CATTGCTGCA GATCGCGACC CCGTCGCGCG GCACGATCGA GGCCTATGGC AATCTGCAGG GCGAACTCGC CAAGCTCGTC AGCGACGTCA ACGGCCGGCT CGGCGAGGCC GACTGGACCC CGATCCGCTA TCTCAACAAG GGCTTCCGTC AGGGCGTGCT GGCGGGGCTG TACCGCACCG CGCAGGTCGG TCTTGTCACG CCGCTGCAGG ACGGCATGAA TCTGGTGGCG AAGGAATACG TCGCCGCGCA GAATCCGATC GACCCCGGCG TGCTGGTGCT GTCGAAATTC GCGGGCGCCG CCAACGAGCT CGATACCGCG CTGCTGGTCA ATCCGCACGA CGTCGAAGGC ATGGCCCGCG CCATCGCCAC CGCATTGTCG ATGCCCCTGA CCGAGCGCCG GCTGCGCTGG GAAGCGATGA TGGCCAAGCT GCGCCGCGGC AGCGTCCAGT CCTGGTTCGC GGATTTCGTG GCGTCGCTGG AAGACGCCCA TACCGCCAAC AGCGACGCCG CCGGCGCCCT CGCCAGCCAG CCGCCGGCGC TGAAAATGGC CGGCGGCTGG TCGCGGGGCG GGCCTCTGGC CTTCGGCGGC GCGCGGCTGC AATAG
|
Protein sequence | MNLVVVSNRV ARASSNEPMT GGLAAALLPV VEKSGAIWVG SSGRVRDGAQ REPFAEIEQL GAGALAMLDL PAAHYGGYYE GFANSALWPA LHSRADLIRV SQDDYRSYRE VNSFMARALL RFRKPDTAFW IQDYHFLALG AELRALGVTQ PIGFFLHTPW ASPATMGCVP HNRELVEAML AYDLIGFQTE EDRSNFLAYG KAELGFAIAD GVVTTPYGTS RCEVFPIGID ADLFAQQAQK ATAHPDVSRL RKSLNGEKLV IGVDRLDYSK GLINRVNAFD RMLTMRPSLQ RTVSLLQIAT PSRGTIEAYG NLQGELAKLV SDVNGRLGEA DWTPIRYLNK GFRQGVLAGL YRTAQVGLVT PLQDGMNLVA KEYVAAQNPI DPGVLVLSKF AGAANELDTA LLVNPHDVEG MARAIATALS MPLTERRLRW EAMMAKLRRG SVQSWFADFV ASLEDAHTAN SDAAGALASQ PPALKMAGGW SRGGPLAFGG ARLQ
|
| |