Gene RPC_2544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2544 
Symbol 
ID3970973 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2758077 
End bp2759822 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content68% 
IMG OID637925652 
Productglycosyl transferase family protein 
Protein accessionYP_532413 
Protein GI90424043 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.327974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.26724 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAGA CCTTTGCAAG ACCCCGATTT GGCATGCCGC CGGAGCCGAA GAACCGGATC 
AATCCGGGCC AGCGGCTGGT CGGCGTGATC GATTACGTCG CCGCCAGCCA TGGCCGCGCG
GTGGCCTTTC TGGCGCTGTG CGGGTTGTTG TTCACGCTGT CGGGATTTTT CACGATTCCG
CCGATCGACC GCGACGAAGC GCGCTTCGCA CAGGCCACCA AGCAGATGGT CGAGACCAGC
GATTTCGTCG ATATCCGGTT TCAGGGCGAG GTTCGCTATA AGAAGCCGGT CGGGATCTAT
TGGCTGCAGG CGACCGTGGT CGAAGCCGCC TCGGCGCTCG GCCTGCCGCG CGCCGAGGTG
CGGATCTGGC TGTACCGGGT GCCCTCGATG CTGGGCGCGA TCGGCGCGGT GTTGCTGACC
TATTGGACCG CCTTGGCGTT CGTGACGCGG CGCGGCGCGG TGCTCGCCGG CTTGATCCTT
TGCAGTTCGA TTTTGCTCGG CGTCGAAGCC CGGCTCGCCA AGACCGACGC GGTGCTGCTG
TTGACCGTGA TCGCCGCGAT GGGGGCGATG GCGCGGGTCT ATCTCGCCTG GCAGCGCGGC
GAGGATTCCG CGCGCGGCTC GTGGACCACG CCGGCGATCT TCTGGACCGC GCTGGCCGGC
GGCATCCTGC TGAAGGGGCC GCTGATCCTG ATGTTCGTCG GCCTCACCAT CGTCACCCTG
GCGATCTGCG ACCGCTCGCT GGCCTGGCTG CGGCGGCTGC GGCCGTTGTG GGGCGCGCTG
TGGATGCTGG CATTGGTGCT GCCGTGGTTC ATCGCCATCT TCCAGCGCGC CGGCGAGAGC
TTCTTTTCCG ACTCGGTCGG CGGCGACATG CTGAGCAAGA TCACCAGCGC CAAGGAATCC
CACGGCGCGC CGCCCGGAGT GTATTTCCTG CTGTTCTGGG TGACGTTCTG GCCCGGCGCG
CCGCTCGCCG CGATGGCGGC GCCGGCGGTG TGGCGGGCGC GGCGCGAGCC CGGCGCGCAA
TATCTGTTGG CTTGGCTGGT GCCGTCCTGG ATCGTGTTCG AACTGGTGAT GACCAAGCTG
CCGCATTACG TGCTGCCGCT GTATCCGGCG ATCGCCATCC TGCTGGTCGG GGCATTGGAG
CGGCGCGTGC TGTCGCGCGG CTGGCTGACC CGTGGGGCGG CGTGGTGGTT CGCGATTCCC
GCCGCGGTGC TGACCATCGC GGTGGTCGGC GCGGTGTGGC TGACACGGCA GCCGGCGTTC
CTGGCCTGGC CGTTCGTCGC GGTGTCGATG ATCTTCGGGC TGTTGGCCTG GCGGCTCTAT
GACGACATCC GCTCCGAGCA CGCCATGCTC AACGCGGTGG CGTCGTCGCT GTTTCTCAGC
GTTGCCGTGT ACGGCATCGT GGTGCCGTCG CTGACGCCGC TGTTTCCCAG CGTCGAGATC
GCCCGTGCGC TGCGCAACGT GGTCTGCGTC GGCCCGAAGG CGGCGGCGGC GGGCTATCAG
GAGCCCAGCC TGGTGTTCAT GACCGGGACC TCGACGCTGC TCACCGACGG CTCCGGCGCC
GCGGATTTCT TAGGCCAAGG CAGTTGCCGC TTCGCGCTGG TGGAATCCCG TACCGAACGG
GCGTTCGCGG CGCGCGCCGA AGCCATCGGC CTGCGCTACG ACGTCGCCGC ACGGATCGAC
GGCTATAATT TCTCGCAGGG CCGGGCGATC TCGGTGGCGG TGTTCCGCTC CGAGGGCACC
CAGTAA
 
Protein sequence
MSETFARPRF GMPPEPKNRI NPGQRLVGVI DYVAASHGRA VAFLALCGLL FTLSGFFTIP 
PIDRDEARFA QATKQMVETS DFVDIRFQGE VRYKKPVGIY WLQATVVEAA SALGLPRAEV
RIWLYRVPSM LGAIGAVLLT YWTALAFVTR RGAVLAGLIL CSSILLGVEA RLAKTDAVLL
LTVIAAMGAM ARVYLAWQRG EDSARGSWTT PAIFWTALAG GILLKGPLIL MFVGLTIVTL
AICDRSLAWL RRLRPLWGAL WMLALVLPWF IAIFQRAGES FFSDSVGGDM LSKITSAKES
HGAPPGVYFL LFWVTFWPGA PLAAMAAPAV WRARREPGAQ YLLAWLVPSW IVFELVMTKL
PHYVLPLYPA IAILLVGALE RRVLSRGWLT RGAAWWFAIP AAVLTIAVVG AVWLTRQPAF
LAWPFVAVSM IFGLLAWRLY DDIRSEHAML NAVASSLFLS VAVYGIVVPS LTPLFPSVEI
ARALRNVVCV GPKAAAAGYQ EPSLVFMTGT STLLTDGSGA ADFLGQGSCR FALVESRTER
AFAARAEAIG LRYDVAARID GYNFSQGRAI SVAVFRSEGT Q