Gene RPD_2698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2698 
Symbol 
ID4023196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3015160 
End bp3016362 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content66% 
IMG OID637962897 
Producthypothetical protein 
Protein accessionYP_569828 
Protein GI91977169 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.402222 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.567569 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATGA TGGCCGTGCT GAGCGAAGGC CGGTCCGCGG GCCACGCGGC GGCGTCGGCG 
CAGCCCGGCC GGATTGCGCG CGTCGAAATC GTCCGCGAGA TGGCGGCGGC GGAAGCGATC
TGGCGTTCGC TCGAGCAACC CGAGCAGTTC TTCACGCCCT ATCAGCGCTT CGATTTCCTC
GATGCCTGGC AGCGGCACGT CGGCGTTGCG GAACAACTTG AACCCTTCAT CGTGGTGGCG
CGTGATGCCG AGCTTCGGCC GTTGATGTTG CTGCCGCTCG GACTCGAGCG CCGCTTCGGC
CTGCGCATCG CGCGCTTCCT CGGCGGCAAA CATGCCACGT TCAACATGCC GCTGTGGCGT
CGCGACGCGG CGCAGACTGC CGATGCGCGC GAACTCGATG CGCTCATCGC GGGCCTGCGC
GCACAGCCGG ACGGCGCCGA CGTGCTGGCG CTCCGCCAGC AGCCACTGCG CTGGCGCGAC
CTCGCCAATC CGCTGGCGCA ACTACCGCAT CAAGCCTCGG TCAACGAATG TCCGGTGCTG
TTGCTCGATC CCGCAGCGTC CCCGAGCGAT CGCATCAGCA ATGCGTCTCG CCGCCGTCTC
AAGACCAAGG AAAAGAAGCT GCAGGCGCTG CCCGGCTATC GTTACAGTCA GGCGACCAGC
GACGACGATG TTCGGCGCGT GCTCGATGCG TTCTTCCGGA TCAAGCCGGT CCGGATGGCG
GCGCAGAAGC TGCCGAACGT GTTCGCCGAT CCCGGCGTCG AGGATTTCAT CCGCCGGGCG
TGCCAGACCG AACTCGCCGG CGGCGGCCGT GCGATCGAAA TCCACGCGCT GGAATCCGAC
GACGATATGA TCGCGATGTT CGCCGGCGTC GCCGACGGCC ACCGCTATTC GATGATGTTC
AACACCTACA CGTTGTCCGA AGCCGCGCGC TACAGCCCCG GCCTGATCCT GATGCGCTCG
ATCATCGATC ACTACGCCGA GCTGGGCTAC AGTCGGCTCG ATCTCGGCAT CGGCTCCGAC
GATTACAAGA AGCAGTTCTG CAAGGATGAC GAGCCGATCT TCGACAGCTT CGTCGCCCTG
ACGCCGCGCG GCCGGATTGC GGCTTCGGCG ATGGCGTCGA TCGACCGCGC CAAACGCACG
GTCAAGCAGA CCCCTGCCCT GATGCAGATG GCGCAGGCGC TGCGCGGCGC GCTGTATCGC
TGA
 
Protein sequence
MTMMAVLSEG RSAGHAAASA QPGRIARVEI VREMAAAEAI WRSLEQPEQF FTPYQRFDFL 
DAWQRHVGVA EQLEPFIVVA RDAELRPLML LPLGLERRFG LRIARFLGGK HATFNMPLWR
RDAAQTADAR ELDALIAGLR AQPDGADVLA LRQQPLRWRD LANPLAQLPH QASVNECPVL
LLDPAASPSD RISNASRRRL KTKEKKLQAL PGYRYSQATS DDDVRRVLDA FFRIKPVRMA
AQKLPNVFAD PGVEDFIRRA CQTELAGGGR AIEIHALESD DDMIAMFAGV ADGHRYSMMF
NTYTLSEAAR YSPGLILMRS IIDHYAELGY SRLDLGIGSD DYKKQFCKDD EPIFDSFVAL
TPRGRIAASA MASIDRAKRT VKQTPALMQM AQALRGALYR