Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1604 |
Symbol | |
ID | 3910075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 1807772 |
End bp | 1809727 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637883500 |
Product | polysaccharide biosynthesis protein CapD |
Protein accession | YP_485225 |
Protein GI | 86748729 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1086] Predicted nucleoside-diphosphate sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.43289 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGACTG GGATCCGTTC ATTTTCGTAT CGCAACTGGA TGATCGCAAT TCACGACGCC GTTGCGACCG CGGTTGCGGT TCTGCTGAGT TTTTTCCTGC GCTTCGACGG CGAAAATCTG CTGGACCGGC TGCCGTTGCT GCTCTGGATA CTGCCTTACT TCGTCGTCTT CAGTTTTTTC GTTTGTTACG CCTTTCAGCT GACCACGACG AAATGGCGAT TCATCTCGAT TCCCGATCTG CTCAACATCA TACGGGCGGC AAGCGTTCTC ACGCTTGCGC TGCTCGTGAT GGACTACATC TTCCTCGCCC CGAACGTTTA CGGTTCCTTC TTCCTGGGCA AGACGACCAT CGTCATCTAT TGGGTGCTCG AGGTCTTCCT TCTGTCCGGC TCGCGGATCG CGTATCGCTA TTTCCGCTAC ACGCGCACGC GCAACAAGGC CCACCAGTTG GACGCCGCGC CGGCGGTGCT GATCGGCCGT GCTGCCGACG CCGAGGTGCT GTTGCGCGGG ATCGAAAGCG GCGCCGTCAA GCGGTTGTGG CCGGTCGGGA TCCTTTCGCC CGCCAGATCG GATCGAGGGC AAACCATCCG CGGAATCCCC GTGCTGGGCG GGATCGACGA TCTGCCCAAC GTGGTCGAGG ACTTCGCTCA TCGCAAGCGG CCGATCGAGC GCGTGGTGAT GACGCCGTCG GCATTCGAGG CCGACGCCAA GCCGGAAGCG GTGCTGATGC GGGCACGCAA GCTGGGGCTC GCCGTCAGCC GTCTTCCATC GCTGGGGGAA AGTCGGGACA CCCCCCGCCT CTCGCCGGTG GCGGTGGAGG ATCTGCTGCT GCGGCCCAGC GTCGATATCG ACTATGGTCG GCTCGAGAAT CTGCTCAACG GCAAATCCAT CGTCGTCACC GGCGGTGGCG GCTCGATCGG ATTGGAGATG TGCGATCGGG TGACCACCTT CGGAGCCGCG CGCCTTCTCG TGATCGAGAA CTCGGAGCCG GCCCTGTACG CGGCGATGGA GGCGCTCTCC ACCAAGATCA CCAAGACCAA GATCGACGGC CATATCGCCG ACATTCGCGA TCGCGCGCGG ATCTTTCAGC TCATCATCGA ATTCCAGCCG GATCTGGTCT TTCACGCCGC GGCGCTGAAG CACGTCCCGA TCCTCGAACG CGACTGGGGT GAAGGCGTCA AGACCAACAT CTTCGGCTCG GTGAATGTCG CGGATGCGGC ACGCGCCGCC AACGCCCAAG CGATGGTGAT GATCTCGACC GACAAGGCGA TCGAGCCGGT GTCGATGCTG GGTCTCACCA AACGATTCGC CGAATTGTAC TGCCAGGGAA TCGATCGTGA ACTCTCCGGC GCCGCGGGCG ACGAGCCGGC GATGCGCCTG ATCTCCGTGC GCTTCGGAAA CGTCCTGGCA TCGAACGGCT CGGTGGTACC GAAGTTCAAG GCGCAGATCG AAGCCGGCGG GCCGGTGACG GTGACGCATC CCGACATGGT GCGTTACTTC ATGACGATCC GCGAAGCCTG CGATCTGGTG ATTACCGCCG CCACCCACGC GCTGAATCCG CAACATGCCG ACGCTTCGGT ATTCGTTCTG AGCATGGGGC AGCCGGTGAA GATCGTCGAT CTCGCGGATC GGATGATCCG GCTGTCCGGC CTGCAACCGG GCTACGACAT CGACATCGTC TTCACCGGCG TGAGGCCGGG CGAGCGGATG CACGAGATCC TGTTCGCCGA GCACGAATCC TTCATCGAGA TCGGGCTTCC CGGTGTGGTC GCCGCACGAC CGAAGGAATT GCCATTGAAG ACGCTGCGGC AATGGCTGAC CGAACTTGAG AAGGCCACAA CCGAAGGGCG CTACGACTGC GTCGTTGCCA TTCTCAAGGA CGCGGTTCCG GAGTATCAGG CCGGCGACGC CGCCCAGGAC CAGAACAGCA GTGCGTCAGG CAAGTTAGCT TTGTGA
|
Protein sequence | MLTGIRSFSY RNWMIAIHDA VATAVAVLLS FFLRFDGENL LDRLPLLLWI LPYFVVFSFF VCYAFQLTTT KWRFISIPDL LNIIRAASVL TLALLVMDYI FLAPNVYGSF FLGKTTIVIY WVLEVFLLSG SRIAYRYFRY TRTRNKAHQL DAAPAVLIGR AADAEVLLRG IESGAVKRLW PVGILSPARS DRGQTIRGIP VLGGIDDLPN VVEDFAHRKR PIERVVMTPS AFEADAKPEA VLMRARKLGL AVSRLPSLGE SRDTPRLSPV AVEDLLLRPS VDIDYGRLEN LLNGKSIVVT GGGGSIGLEM CDRVTTFGAA RLLVIENSEP ALYAAMEALS TKITKTKIDG HIADIRDRAR IFQLIIEFQP DLVFHAAALK HVPILERDWG EGVKTNIFGS VNVADAARAA NAQAMVMIST DKAIEPVSML GLTKRFAELY CQGIDRELSG AAGDEPAMRL ISVRFGNVLA SNGSVVPKFK AQIEAGGPVT VTHPDMVRYF MTIREACDLV ITAATHALNP QHADASVFVL SMGQPVKIVD LADRMIRLSG LQPGYDIDIV FTGVRPGERM HEILFAEHES FIEIGLPGVV AARPKELPLK TLRQWLTELE KATTEGRYDC VVAILKDAVP EYQAGDAAQD QNSSASGKLA L
|
| |