Gene RPC_2685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2685 
Symbol 
ID3970355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2915613 
End bp2916812 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content68% 
IMG OID637925796 
Producthypothetical protein 
Protein accessionYP_532553 
Protein GI90424183 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0527922 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATGA TGACAGCGGT GATCGACGGT TCGGCTGCGC AGGCCACGCC CACCGGCGCG 
GTCAGCAGGA TCGCCGACGT CGAGGTGTTG CAGGATCTGG CTTCGGCCGA ACCGATCTGG
CGGGCGCTGC AACAGCCGCC GCATCTTTCC ACCCCCTATC AGCGTTTCGA ACTGCTCGCG
GCCTGGCAGC GCAATGTCGG CGCCGGCCAG CAGATCTCGC CATTCATCGT GGTCGCCCGC
GACGCCGAGC AGCAGCCGCT CGTACTGCTG CCGCTCGGGC TGCAACGCGC CCACGGCTTG
CGCATCGCGA AATTTCTCGG CGGCAAGCAC ACCACCTTCA ACATGGCGCT GTGGCAGCAG
GACTTCGCGC AGCGCGCCGG CGCCGCCGAT CTCGCCGCGC TGATCGCCGG GTTGCGCCGT
CATGCCGACA AGGTCGACGT GCTGGCCTTG CTGCAGCAGC CGGCGAGCTG GCGCGGCGTC
GTCAATCCGC TGTCGCTCGT GCCGCAGCAA CCGTCCGCCA ACGACTGCCC GCTGCTCTGC
ATCGCGCCGG GCGCGACGCC AGTCAGCCTG ATCAGCAATT CGTTCCGCCG CCGGCTGAAG
AGCAAGGAGA AGAAGCTGCA GCCGCTTGCC GGCTATCGCT ACGGCATCGC CACCACCGAC
GCCGAGATCA CCGCGCTGCT GGATTGGTTT TTCGCCGTCA AGCCGCTGCG GATGGCGGCG
CAGAAACTGC CCGACGTGTT CGCCGAGCCG GGCATCGCCG GCTTCATCCG CGAGGCTTGT
TTGGCGAAGC TTGCCGATGG CGGCCGCGCC ATCGACATCC ACGCGCTGCA ATGCGACGAC
GAACCGATCG CGATCTTCGC CGGCGTCGCC GACGGCCAGC GCTTCTCGAT GATGTTCAAC
ACCTACACGC TGTCGGACAG CGCGCGCTAC AGCCCCGGCC TGATCCTGAT GCGCAACATC
ATCGATCACT ACGCCGCACT GGGCTACAAC GCGCTCGATC TCGGCATCGG CTCCGACGAC
TACAAGCGGC TGTTCTGCAA ATCCGACGAG CCGATCTTCG ACGGCTACAT CGCGCTCACC
GCGCGCGGCC GATTCGGCGC CGCAGCGCTG GCGGCGACGT CGCGCGCCAA GCGCGTGGTC
AAGCACAACG CGGCGCTGAT GCGGTTGGTG CAGCTGGCGC GCGGCGCGCT GCGACGATAG
 
Protein sequence
MAMMTAVIDG SAAQATPTGA VSRIADVEVL QDLASAEPIW RALQQPPHLS TPYQRFELLA 
AWQRNVGAGQ QISPFIVVAR DAEQQPLVLL PLGLQRAHGL RIAKFLGGKH TTFNMALWQQ
DFAQRAGAAD LAALIAGLRR HADKVDVLAL LQQPASWRGV VNPLSLVPQQ PSANDCPLLC
IAPGATPVSL ISNSFRRRLK SKEKKLQPLA GYRYGIATTD AEITALLDWF FAVKPLRMAA
QKLPDVFAEP GIAGFIREAC LAKLADGGRA IDIHALQCDD EPIAIFAGVA DGQRFSMMFN
TYTLSDSARY SPGLILMRNI IDHYAALGYN ALDLGIGSDD YKRLFCKSDE PIFDGYIALT
ARGRFGAAAL AATSRAKRVV KHNAALMRLV QLARGALRR