Gene RPD_1141 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1141 
Symbol 
ID4021617 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1296521 
End bp1297588 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content64% 
IMG OID637961333 
Productglycosyl transferase, group 1 
Protein accessionYP_568280 
Protein GI91975621 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0760697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCATTG CGCAGGTTGC TCCGCTGACG GAGGCTATCC CGCCCAAGCT CTACGGCGGT 
ACGGAAAGAG TCGTGCATTG GTTGACCGAA GAACTCGTCG CGCTCGGACA CGACGTGACG
CTGTTCGCCT CCGGCGATTC CACCACGTCG GCGAAACTCG AGGCGACCTG GCCGAGAGCG
CTCCGCCTCG ATGGCGCGGT GCGCGACGCC AACGCGCTGC ACATGGTCAT GCTGGAGCAG
GTGAGACAAC GGTGTGACAA AGAGGAATTC GATCTCCTCC ACTTCCATCT CGATTACTAT
CCCTGGTCGC TGTTTCGTCG ACAGCCGACG CCCTTCATTA CAACGCTGCA CGGCCGTCTC
GATTTGCCCG AGCATCAGCC GGTGTTCGCG GCTTTCGCAG ATGTGCCGGT GGTGTCGATT
TCGGATTCGC AGCGCCGCCC GGTGCCGAAG GCGAACTGGA TCCGCACCAT CCATCACGGG
CTTCCGGCCG ATCTGCTGAC GCCGCTGGTC CGCAAGCCGA GCTATCTCGC GGTACTCGGG
CGGATCGCGC CGGAGAAGGG CGTCGACCGT GCGATCCGGA TCGCGATCCG CGCCAATGTC
CCGCTGAAGA TCGCGGCGAA GGTCGACCGG GCCGACCTGG AGTATTTCGA ACAGGTCATC
GAGCCGATGT TGCTTCACCC GCTGATCGAG TTCATCGGCG AAATCGGCGA CCAGGAGAAA
TCCGAGTTTC TCAGCGGCGC GCTGGGATTG CTGCTGCCGC TGGATTGGCC GGAGCCGTTC
GGCCTGGTGA TGATCGAATC GCTCGCGTGC GGCGCGCCGG TGATCGCCTA TAACCGCGGC
TCAGTCCCCG AGATCATCGA ACAGGGACTG ACCGGATTCA TCGTCGAGGA CGAGACCAGC
GCGGTGACGG CTGTGCATCA ACTCGAAGAT CTCGATCGCT CCGCGATCCG CGCACGGTTC
GAGGAACGCT TCACAGCGCG GCGGATGGCG CTCGACTATC TGGCGGCCTA TCGAGGCCTG
CTCGCAAAGG CGGTCCCGCC GCGGATCAAG CTGGTGTCGG GCGAGTAA
 
Protein sequence
MRIAQVAPLT EAIPPKLYGG TERVVHWLTE ELVALGHDVT LFASGDSTTS AKLEATWPRA 
LRLDGAVRDA NALHMVMLEQ VRQRCDKEEF DLLHFHLDYY PWSLFRRQPT PFITTLHGRL
DLPEHQPVFA AFADVPVVSI SDSQRRPVPK ANWIRTIHHG LPADLLTPLV RKPSYLAVLG
RIAPEKGVDR AIRIAIRANV PLKIAAKVDR ADLEYFEQVI EPMLLHPLIE FIGEIGDQEK
SEFLSGALGL LLPLDWPEPF GLVMIESLAC GAPVIAYNRG SVPEIIEQGL TGFIVEDETS
AVTAVHQLED LDRSAIRARF EERFTARRMA LDYLAAYRGL LAKAVPPRIK LVSGE