Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_2780 |
Symbol | mdoG |
ID | 3910573 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3167609 |
End bp | 3169120 |
Gene Length | 1512 bp |
Protein Length | 503 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637884680 |
Product | glucan biosynthesis protein G |
Protein accession | YP_486393 |
Protein GI | 86749897 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3131] Periplasmic glucans biosynthesis protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.309239 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.811126 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAATCGTC GGCAGGTTCT CACTGGTCTT GCAGCGATTC CGCTGTTGCA GTCCCAGGCC CGCTCGTCGT TCGCGCAGGC CGCGGATAAA TCCCAACCCT TCGATCCCTC GGTGGTGCGC CAGTTGGCGC GCGAGCTCGC GGGCAAGCCG TATCAGACGC CCGACAGCAA GCTGCCATCA CCGCTCGCTA ACCTCAACTA CGACGCCTAC CGCGAAATCC GCTTCAATCC GGAGCGTGCG CTGTGGCGCA GCGATCACCT GCCGTTTCAG GTGCAGTTCT TCCATCGCGG CTTTCTCTAC AACAACCGCA TCAACATCTA CGAAGTGACA GGTGGTCAGG CGAAGCCTGT CCCATATCGC GCCGGCGACT TCTCGTTCGG TGACACGCCG CCGCCATCGC CGGACGCCGA TCTCGGCTTC GGCGGTTTCC GGATTCACGC CCCGATTAAC AAGCCGGACT ACTACGACGA GCTCTGCGTC TTCCTCGGCG CATCGTATTT TCGTGCGGTG GCGAAAGGCG AGCAGTACGG CCTGTCCGCG CGCGGACTGA CGGTCGACAC TGGGCAGAGC GGCGGCGAGG AGTTTCCGCT TTTCAAGGCG TTCTGGCTGG AGCGGCCATC TCCCGACGCT TCGTCGATGG TGGTGCACGC CCTACTCGAC AGCAAGAGCG TCGCCGGCGC CTATCGGTTC ACTATACGTC CGGGCGACAC CACGGTGTTC GACGTCGAAG CTGCGGTCTA TCCGCGCGTC GATCTGCAGC ATGCCGGCCT CGCGCCGATG ACCAGCATGT TCTTCTTCGG ACCGAACGAC CCCGCCGATG CGGCGGATTT TAGACCCGCG GTGCACGATT CCGAGGGGCT GGGAATCTTC AACGGCCGCG GCGAGCAATT GTGGCGCCCG CTGTGCAATC CGCGCGACCT GCAGATCAGT TCGTTCGCCG ACCAGAACCC GCGCGGCTTC GGCCTGATGC AGCGGGAGCG AAAATTCGAG GCCTATCAGG ATTTGGAGTC GCGCTTCGAA ATGCGGCCGA GCTTGTGGGC CGAGCCGATC GGCGACTGGG GCGAAGGCGT GGTGAAGCTG GTCGAGATTC CGACCAAGGA AGAAGTCCAC GACAACATCG CTTCGTTCTG GGAGCCGAAG GGGCCGCTCA AGGCCAAGGG CGAGCACATC TACACCTACC GCCTGCATTG GGGGCCGGAC ACGCCGAAGC CGTCCGCGCT GGCCCGCTTC ACCCGCACCG GCGTCAGTGC GCGCGCCGAC AACGACCGGC TGTTCGTGCT CGATATCACC GGCGACAAGC TGAGGGGCCT CGATCCCGGA GCGGTTCGAG GCCAAGTCAC GACCAACAAG GGGGAGATCC GCAACGTGGT CAGCCTGCCG AACCCGCTGA CTGACGGCTG GCGGCTCAGC TTCAACCTGA TCACCGATCA GCCGCCGATC GAGCTCCGCG CTATCCTGAT GGCGGGCGAC ACCGCGCTAT CCGAAGTCTG GGTCTATCGA TGGGTCCCGT AG
|
Protein sequence | MNRRQVLTGL AAIPLLQSQA RSSFAQAADK SQPFDPSVVR QLARELAGKP YQTPDSKLPS PLANLNYDAY REIRFNPERA LWRSDHLPFQ VQFFHRGFLY NNRINIYEVT GGQAKPVPYR AGDFSFGDTP PPSPDADLGF GGFRIHAPIN KPDYYDELCV FLGASYFRAV AKGEQYGLSA RGLTVDTGQS GGEEFPLFKA FWLERPSPDA SSMVVHALLD SKSVAGAYRF TIRPGDTTVF DVEAAVYPRV DLQHAGLAPM TSMFFFGPND PADAADFRPA VHDSEGLGIF NGRGEQLWRP LCNPRDLQIS SFADQNPRGF GLMQRERKFE AYQDLESRFE MRPSLWAEPI GDWGEGVVKL VEIPTKEEVH DNIASFWEPK GPLKAKGEHI YTYRLHWGPD TPKPSALARF TRTGVSARAD NDRLFVLDIT GDKLRGLDPG AVRGQVTTNK GEIRNVVSLP NPLTDGWRLS FNLITDQPPI ELRAILMAGD TALSEVWVYR WVP
|
| |