Gene RPD_1209 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1209 
Symbol 
ID4021685 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1366541 
End bp1368127 
Gene Length1587 bp 
Protein Length528 aa 
Translation table11 
GC content70% 
IMG OID637961401 
Producthypothetical protein 
Protein accessionYP_568348 
Protein GI91975689 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.688773 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGCGG CCAAGCTGGT TGCCTGTGGC GGCGGCGGGC TCTATCACGG GCCAACCGAA 
CTCCGCTGGC CCCTCATGCG CTTTACCTCC CTGATCATCG AGCTGATCCG CGCCCGGCCG
CGGCTGATGT TCTGGCTGGT GGTGTCGGCG CAGGCGCTGC TCTGGGTGCT GGTGCCGCTG
CTGGTCTATT CCAGTCCACC CGAAGGCGTC GCCACGGTCC TCGCCTATGG CCGCGAGTAT
CAGGTCGGCA GCGACCTCGG GCCGCCGCTG GCGTTCTGGC TCGCCGATAT CGCGTTTCGC
GCCGCGGGCG GCCATATGGT GGGCGTCTAT CTGCTGGCGC AGGTCTGTTT CATCATCACC
TTCTACGGGC TGTTTCAGCT CGCGCGCAGC ATGGTCGGCC CGCAGCACGC GGTGATCGCC
GTGCTGCTGA CGTCGACGGT GACCGCCTTT GCGGAGCAGG GCGCCGAATT CGGTCCGCTG
GTCCTGGCGC GGCCGCTCTG GGCGCTGGTG TTGTGGCATA GCTGGGAAAT CATCGGCCGC
GGCCGGCGCA GCGCCTGGTT CGCGCTGTCG ATTGAGGTCG GGCTGTTGCT GCTGACCACG
GTGGCTGCGC CGGCCCTGTT GCTGCTGCCG ATCGGCTTCG CGCTGTCGAC CGCGCGCAGC
CGGCGCGCTT TGATGTCGCT GGATCCGATG TTCAGCCTGC TGGTGGTCGC AGTGCTGGTG
CTGCCCTATG GGATCTGGCT GCTACGCGCC GACATTTTCG CGCTGCCGTC GCTGCCCGCG
TTGGGCGATC TCGGCGATCG TGCGCTGCTC GGCGTCGAGC TGTTCGGCGG CCTGGTGGTC
GCAATCGGCG GGATGGCGCT GCTGGTGCTG CTCAACACCA GCCGTTTTGA TCCGAGGCCG
GACGACGCGC CGGTCGTGTA TCGCGCGCCG GTCGATCCGC TGGCGCGGCA GTTCGTGTAC
TTCTTCGCAC TCGCGCCGGC GCTCCTCGGC GCCATCGTCG CCGGCCTGTT CGGGCTGAAG
CACGTCATTG GCGGGGCGGG GATCGCTCTG CTGATGGTCG GACTCGCGGT GGTGATCGCG
ACCGGCGACC TCATCCATCT GCGCCGGCAG CGCCTGCTGC GCGCGGCGTG GGCCGCGCTG
GTGGCGGCAC CCGCGTTGGT GGTGATTGTG GCGTCCGTGG TTCAGCCCTG GGTCAGCCAG
ACCGAACTCG CCACATCGCT GCCCGCCAAG GACATCGCCC GCTATTTCGG CGACAGCTTC
GAGCGCCGCA CCGGCCGGCC GCTATCGGCG GTGGCGGGCG ATCCAGAGCT TGCCGGGCTG
ATCGCGATGG GCGCGTCGCG GCCGCATCTG TTTCTCGACG CGACGCCGTC GCGCACGCCC
TGGGTGACGC CGGCGACGTT CAACGAGCGC GGCGGCGTGG TGGTGTGGCG CGCCGCCGAT
ACCGCGGGCA GGCCGCCGCC GGAACTCGCC ATGCGGTTTC CCGATATCGT GCCCGAGCTT
CCGCGCGCGT TCGAGCGGAT GATCGCCGGG CGTCAGCCCT TGCTGCGGAT CGGTTGGGCG
ATCGTGCGGC CGAAAGCGGC GCCTTAA
 
Protein sequence
MSAAKLVACG GGGLYHGPTE LRWPLMRFTS LIIELIRARP RLMFWLVVSA QALLWVLVPL 
LVYSSPPEGV ATVLAYGREY QVGSDLGPPL AFWLADIAFR AAGGHMVGVY LLAQVCFIIT
FYGLFQLARS MVGPQHAVIA VLLTSTVTAF AEQGAEFGPL VLARPLWALV LWHSWEIIGR
GRRSAWFALS IEVGLLLLTT VAAPALLLLP IGFALSTARS RRALMSLDPM FSLLVVAVLV
LPYGIWLLRA DIFALPSLPA LGDLGDRALL GVELFGGLVV AIGGMALLVL LNTSRFDPRP
DDAPVVYRAP VDPLARQFVY FFALAPALLG AIVAGLFGLK HVIGGAGIAL LMVGLAVVIA
TGDLIHLRRQ RLLRAAWAAL VAAPALVVIV ASVVQPWVSQ TELATSLPAK DIARYFGDSF
ERRTGRPLSA VAGDPELAGL IAMGASRPHL FLDATPSRTP WVTPATFNER GGVVVWRAAD
TAGRPPPELA MRFPDIVPEL PRAFERMIAG RQPLLRIGWA IVRPKAAP