Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1885 |
Symbol | |
ID | 3908080 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 2156642 |
End bp | 2158615 |
Gene Length | 1974 bp |
Protein Length | 657 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637883779 |
Product | 4-alpha-glucanotransferase |
Protein accession | YP_485504 |
Protein GI | 86749008 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1640] 4-alpha-glucanotransferase |
TIGRFAM ID | [TIGR00217] 4-alpha-glucanotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTTAT TGACCAAAGC TGCAGAACTC GGAATTCAAA CCGAATTCTT CGATGGGCAG GGGCAGCGCC ACGTCACTCC GCCCGACGCC CTCAAGATCG TTCTCGATGC GATGCCGGTG CGGTCGACGC ATCGATTGCT CGACCGGATC GCCGTTGTTC GGGCCGGGGA AGAGGCTGCT GCAGATCTGA GCGATGCAGC GGTCTTGCCG GTGGCGTGGA AAATTGTCGC AGGCGACGCC TGCGTGGCGG AAGGGACGAC CCAGCAGCGC CGCGTCACCT GGCCGTCGCA TTTGCCGGTC GGCAGCTATC GTCTGCATCT TTCAGACGCG GGCGGTCACA CTGAGGAGGC GCCGCTCCTG GTGACGCCGC CCACCGCCTT TCGCGGCGAG TTCGATCGCT CCTGGATCAT CGCGGTGCAA CTCTACGGAC TGCGCTCCGC GCGCAATTGG GGCATCGGCG ACTTCACCGA TCTGGCGCGA TTGATCGAGA TTGCGGCCGG ATGGGGCGCA GGCGGCATCG GCCTCAATCC GTTGCATGCG CTGTTCGACG ATCGGCCGGG CGACTGCAGT CCGTATTCGC CCAACAGCCG CATGTTTCTC AACCCGCTGT ATATCGATGT CGAGCGACTG CCCGAATTCA AACTCGCCGA TCTGCCGGGC GCCGCGGACG CGCTCGCGCC GCTGCGCGAC GCGGAACTGA TCGACTATCC GAAATCGGCC GATCTCAAAT GGCGTGCGCT ACGGATGGCT CACGCGGCGT TTCGCGCCGC GCCGGCGCCG GATCGACGCG CCGCGTTCGA GGCCTTTCGC CGCGAGCGCG CTCCGCTGCT GACCCGGTTC GCCTGCTTCG AGGCGCTGAG GCATCGGTTC GCCAAGCCGT GGTGGGAATG GCCGGATGAA TGGCAGCATC CCGACGATGT CCGTTGCGCC CGCTTGCGTG ACGGTGCCGA CGGCGCCGAG ATCGAATTCA TCGAATACGT CCAATGGTGC GCCGACGATC AGTTGCGCGC CTGCCACGAC CTCGCTGCCG CGCGCGGCAT GGATGTCGGG CTTTATCTCG ACGTTGCGGT CGGCGTGCAG AGCGACGGCT TCGACGCCTG GAACGAGCAG GTCGCGATCT CCCGGCATCT GTCGGTCGGC GCGCCGCCGG ACCAGTTGAA CTCGGCCGGG CAAAACTGGG GCCTCGCGGG CTTCAACGCC GCGGGGCTCG AACTCACCGG CTTTGCGTCG TTCCGCGAGA TGCTGCGCGC CTCGATGCGC TATGCCGGCG CGATCCGGCT CGATCATGTT CTCGGCTTGA ACCGGCTGTA TCTGGTGCCG CACGGCTACG CCGCCGACAG CGGCGTGTAT GTGCAGATGC CGTTCGAAGC GCTGCTGGCG GTCACCGCGC AGGAGAGTGT CGCCAATCGC TGCGTGGTGA TCGGCGAGGA TCTCGGCACG GTGCCGGAAG GCTTCCGCGA GCGCCTGGCG GCGTGGGGCG TGTGGTCGTA TCGCGTGATG ATGTTCGAGC GCGACTTCCA TCAGGGGTGG TTCTTCGGCG TCGATCATTA TCTGCCGGAG GCGCTGGTGA CCTTCAACAC CCACGACCTC GCGACCTATT CGGGCTGGCG CGCCTCGACA GACCTGCAGT TCAAGCATTC GGTGGGGATC GATCCCGGCG AGAGCGACGA TGCCAGGCGA TACGCCTTCG CGATGCTCGG CGACGTGCTG CGGCAGCAGG GCATCGAACA GGAGGATATC TATGCGGTGC TCACCTTCCT GGCGCGAACC CGGTCGCGAC TGCTGGCGAT CTCGCTCGAA GATCTGCTCG GCGTGATCGA TCAGCCCAAC GTGCCCGGCA CCGTGTTCCA GCATCCGAAC TGGCGCCGTC GCCTGCCGCG CGCGCTGGAC GAGATCGCGT CGGCGATCAA TCAGCAGGCG CTGTCCGCCG CGACGCGGGA GCGCCGGTCC GCGGTGACAT CCGTTGCCGG GTAG
|
Protein sequence | MDLLTKAAEL GIQTEFFDGQ GQRHVTPPDA LKIVLDAMPV RSTHRLLDRI AVVRAGEEAA ADLSDAAVLP VAWKIVAGDA CVAEGTTQQR RVTWPSHLPV GSYRLHLSDA GGHTEEAPLL VTPPTAFRGE FDRSWIIAVQ LYGLRSARNW GIGDFTDLAR LIEIAAGWGA GGIGLNPLHA LFDDRPGDCS PYSPNSRMFL NPLYIDVERL PEFKLADLPG AADALAPLRD AELIDYPKSA DLKWRALRMA HAAFRAAPAP DRRAAFEAFR RERAPLLTRF ACFEALRHRF AKPWWEWPDE WQHPDDVRCA RLRDGADGAE IEFIEYVQWC ADDQLRACHD LAAARGMDVG LYLDVAVGVQ SDGFDAWNEQ VAISRHLSVG APPDQLNSAG QNWGLAGFNA AGLELTGFAS FREMLRASMR YAGAIRLDHV LGLNRLYLVP HGYAADSGVY VQMPFEALLA VTAQESVANR CVVIGEDLGT VPEGFRERLA AWGVWSYRVM MFERDFHQGW FFGVDHYLPE ALVTFNTHDL ATYSGWRAST DLQFKHSVGI DPGESDDARR YAFAMLGDVL RQQGIEQEDI YAVLTFLART RSRLLAISLE DLLGVIDQPN VPGTVFQHPN WRRRLPRALD EIASAINQQA LSAATRERRS AVTSVAG
|
| |