Gene RPC_4183 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4183 
Symbol 
ID3972540 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp4648272 
End bp4649273 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content65% 
IMG OID637927286 
Productglycosyl transferase family protein 
Protein accessionYP_534027 
Protein GI90425657 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAGTG GCGACCGCCT GCAGATCATC GTGCCTTGTT ACAACGAGGA GCAGGTGCTG 
CAATCCACCG CCGCCACGCT GGCCTCCGTG ATCGAGTCCT GCATCAGCGC GGGCCTGATC
GCGCCGTCGA GCGCGGTGCT GTTTGTCGAT GATGGATCCT CGGACCAGAC ATGGCGTCTG
ATCGAAGACT TGCACGCCGC CGATCCGCAG CGTTTCGACG GCGTTCGGTT GTCCGCCAAT
CGCGGACACC AGGCGGCGCT GTGGGCGGGG CTGTCGACCG CGGATGCCGA TCTTGTCGTG
TCGATCGATG CCGATCTGCA GGACGACCCC CAGGCGATCG TCAGGATGAT CAAGGAATAC
GACGCCGGCG CAGACGTGGT ATTCGGACTG CGCTCGAATC GCGAAAGCGA CGGCTGGTTC
AAGCGCAGTT CGGCCACGCT GTTCTACCGC TTGTTGCGCC TGCTGGGCGT CAACATCGTG
CCGCAGCACG CCGATTTCCG GTTGATGAGC CGCCCGGCGA TCGACGCCTT GCTGCAATAC
TCGGAATCCA ACTTGTTCCT ACGGGCGTTG GTGCCGCAGC TCGGCTTCGC CACGGCGCAG
GTCAGCTACC CCCGAACGTC GCGGGCCGCG GGAACCACCA AGTACCCTAT CGGCAAGATG
CTCGGTCTGG CCATTGACGG GATCACCTCC TGGTCGGTGG CGCCGCTTCG GGCGATCGGG
TTGCTTGGCC TGACGGTGTC GGCAATGGCC TTCTTGCTCG GTCTGTGGGC GCTGTGGGCC
GCGCTGTTCA CCCATGCGAC CATCCCGGGC TGGGCCTCGA TCATGCTGCC GCTGCTGTTC
TCCCAGGGCT TGCAGTTCAT CTTCCTGGGC CTAATCGGCG AGTACATCGG TAAGATCTTC
GTGGAGACCA AGCGCCGGCC GAAATTCATC ATCCGCGCCC GGGCGGGGAC GAACCCGCGC
TCGGCCGCGG CCCGCGCCGA GCGCGCCGAG AAAGTGAACT GA
 
Protein sequence
MASGDRLQII VPCYNEEQVL QSTAATLASV IESCISAGLI APSSAVLFVD DGSSDQTWRL 
IEDLHAADPQ RFDGVRLSAN RGHQAALWAG LSTADADLVV SIDADLQDDP QAIVRMIKEY
DAGADVVFGL RSNRESDGWF KRSSATLFYR LLRLLGVNIV PQHADFRLMS RPAIDALLQY
SESNLFLRAL VPQLGFATAQ VSYPRTSRAA GTTKYPIGKM LGLAIDGITS WSVAPLRAIG
LLGLTVSAMA FLLGLWALWA ALFTHATIPG WASIMLPLLF SQGLQFIFLG LIGEYIGKIF
VETKRRPKFI IRARAGTNPR SAAARAERAE KVN