Gene RPB_1604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1604 
Symbol 
ID3910075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1807772 
End bp1809727 
Gene Length1956 bp 
Protein Length651 aa 
Translation table11 
GC content62% 
IMG OID637883500 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_485225 
Protein GI86748729 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.43289 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGACTG GGATCCGTTC ATTTTCGTAT CGCAACTGGA TGATCGCAAT TCACGACGCC 
GTTGCGACCG CGGTTGCGGT TCTGCTGAGT TTTTTCCTGC GCTTCGACGG CGAAAATCTG
CTGGACCGGC TGCCGTTGCT GCTCTGGATA CTGCCTTACT TCGTCGTCTT CAGTTTTTTC
GTTTGTTACG CCTTTCAGCT GACCACGACG AAATGGCGAT TCATCTCGAT TCCCGATCTG
CTCAACATCA TACGGGCGGC AAGCGTTCTC ACGCTTGCGC TGCTCGTGAT GGACTACATC
TTCCTCGCCC CGAACGTTTA CGGTTCCTTC TTCCTGGGCA AGACGACCAT CGTCATCTAT
TGGGTGCTCG AGGTCTTCCT TCTGTCCGGC TCGCGGATCG CGTATCGCTA TTTCCGCTAC
ACGCGCACGC GCAACAAGGC CCACCAGTTG GACGCCGCGC CGGCGGTGCT GATCGGCCGT
GCTGCCGACG CCGAGGTGCT GTTGCGCGGG ATCGAAAGCG GCGCCGTCAA GCGGTTGTGG
CCGGTCGGGA TCCTTTCGCC CGCCAGATCG GATCGAGGGC AAACCATCCG CGGAATCCCC
GTGCTGGGCG GGATCGACGA TCTGCCCAAC GTGGTCGAGG ACTTCGCTCA TCGCAAGCGG
CCGATCGAGC GCGTGGTGAT GACGCCGTCG GCATTCGAGG CCGACGCCAA GCCGGAAGCG
GTGCTGATGC GGGCACGCAA GCTGGGGCTC GCCGTCAGCC GTCTTCCATC GCTGGGGGAA
AGTCGGGACA CCCCCCGCCT CTCGCCGGTG GCGGTGGAGG ATCTGCTGCT GCGGCCCAGC
GTCGATATCG ACTATGGTCG GCTCGAGAAT CTGCTCAACG GCAAATCCAT CGTCGTCACC
GGCGGTGGCG GCTCGATCGG ATTGGAGATG TGCGATCGGG TGACCACCTT CGGAGCCGCG
CGCCTTCTCG TGATCGAGAA CTCGGAGCCG GCCCTGTACG CGGCGATGGA GGCGCTCTCC
ACCAAGATCA CCAAGACCAA GATCGACGGC CATATCGCCG ACATTCGCGA TCGCGCGCGG
ATCTTTCAGC TCATCATCGA ATTCCAGCCG GATCTGGTCT TTCACGCCGC GGCGCTGAAG
CACGTCCCGA TCCTCGAACG CGACTGGGGT GAAGGCGTCA AGACCAACAT CTTCGGCTCG
GTGAATGTCG CGGATGCGGC ACGCGCCGCC AACGCCCAAG CGATGGTGAT GATCTCGACC
GACAAGGCGA TCGAGCCGGT GTCGATGCTG GGTCTCACCA AACGATTCGC CGAATTGTAC
TGCCAGGGAA TCGATCGTGA ACTCTCCGGC GCCGCGGGCG ACGAGCCGGC GATGCGCCTG
ATCTCCGTGC GCTTCGGAAA CGTCCTGGCA TCGAACGGCT CGGTGGTACC GAAGTTCAAG
GCGCAGATCG AAGCCGGCGG GCCGGTGACG GTGACGCATC CCGACATGGT GCGTTACTTC
ATGACGATCC GCGAAGCCTG CGATCTGGTG ATTACCGCCG CCACCCACGC GCTGAATCCG
CAACATGCCG ACGCTTCGGT ATTCGTTCTG AGCATGGGGC AGCCGGTGAA GATCGTCGAT
CTCGCGGATC GGATGATCCG GCTGTCCGGC CTGCAACCGG GCTACGACAT CGACATCGTC
TTCACCGGCG TGAGGCCGGG CGAGCGGATG CACGAGATCC TGTTCGCCGA GCACGAATCC
TTCATCGAGA TCGGGCTTCC CGGTGTGGTC GCCGCACGAC CGAAGGAATT GCCATTGAAG
ACGCTGCGGC AATGGCTGAC CGAACTTGAG AAGGCCACAA CCGAAGGGCG CTACGACTGC
GTCGTTGCCA TTCTCAAGGA CGCGGTTCCG GAGTATCAGG CCGGCGACGC CGCCCAGGAC
CAGAACAGCA GTGCGTCAGG CAAGTTAGCT TTGTGA
 
Protein sequence
MLTGIRSFSY RNWMIAIHDA VATAVAVLLS FFLRFDGENL LDRLPLLLWI LPYFVVFSFF 
VCYAFQLTTT KWRFISIPDL LNIIRAASVL TLALLVMDYI FLAPNVYGSF FLGKTTIVIY
WVLEVFLLSG SRIAYRYFRY TRTRNKAHQL DAAPAVLIGR AADAEVLLRG IESGAVKRLW
PVGILSPARS DRGQTIRGIP VLGGIDDLPN VVEDFAHRKR PIERVVMTPS AFEADAKPEA
VLMRARKLGL AVSRLPSLGE SRDTPRLSPV AVEDLLLRPS VDIDYGRLEN LLNGKSIVVT
GGGGSIGLEM CDRVTTFGAA RLLVIENSEP ALYAAMEALS TKITKTKIDG HIADIRDRAR
IFQLIIEFQP DLVFHAAALK HVPILERDWG EGVKTNIFGS VNVADAARAA NAQAMVMIST
DKAIEPVSML GLTKRFAELY CQGIDRELSG AAGDEPAMRL ISVRFGNVLA SNGSVVPKFK
AQIEAGGPVT VTHPDMVRYF MTIREACDLV ITAATHALNP QHADASVFVL SMGQPVKIVD
LADRMIRLSG LQPGYDIDIV FTGVRPGERM HEILFAEHES FIEIGLPGVV AARPKELPLK
TLRQWLTELE KATTEGRYDC VVAILKDAVP EYQAGDAAQD QNSSASGKLA L