Gene Rpal_4685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4685 
Symbol 
ID6412371 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5044067 
End bp5045428 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content66% 
IMG OID642714564 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_001993651 
Protein GI192293046 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTCTC CACCCGCGGC AACGTCCCTC GCGGCGCTGA GGGCGCGCCT GACGTCGCTG 
TTCGGCGGTT CCAACGAAGC CTCGCTCACC AACCGGCTCG CCGGCACCAT TTTCCTGATC
CGTGTCGTCA GCGCAGGTCT CGCCTATGGG GCGCAGATCC TGCTGGCGCG CTGGATGGGT
GGCTCTGACT ACGGCATCTA CGTCTATGTC TGGACCTGGG TGCTGCTGCT CGGCTCCATG
CTGGATTTCG GCATCTCCGC CTCAGCCCAG AAGATCATTC CGGAATATCG CGCGCGCGGC
GAACTCGACA GGCTGCGCGG TTTTCTGTCC GGCAGCCGTT GGGGCACGCT GGCGGCCTCG
AGCGTGGTAT CGCTGCTGCT CGCCGCACTG GTCTGGGCAC TGTCGCCGCT GATCGGCGAC
GCCACAGTCA TATCGCTGTA CCTCGGCTGC CTGACGCTCC CGGCATTCGT GGTCGCCAAC
ACCCAGGACG GCATTGCCCG CTCGCACGAC TGGATGCGGC TCGGGCTGAT GCCGCAATTC
ATCATCCGGC AGGCGCTGAT CATCGGCTTC ACCGCCGGGC TGTTCGTGCT CGGCTTCGAG
CTCGGTGCGA TCGCCGCGAT GGCGGCGAGC TGTGCGGCGG TGTGGATCGC GATGCTCGGA
CAGATGATAG CGCTCAACCG CAGGCTTGCC GGCCACGTCC CGCCCGGCCC GCGCGCCTAT
GACGTCCGCG GCTGGCTTGC GACCTCGCTG CCGATCCTCC TGGTCGAGAG CTTCTACCTG
TTGCTGTCCT ACACTGACGT CCTGGTGCTG CAGCAATTCA GCACGCCCGA GGAAGTCGGC
ATCTACTATG CGGTGGTGAA GACGCTGGCG CTGGTGTCGT TCATTCACTA CGCGATGTCG
GCCACCACTG CGCATCGCTT CACCGAATAC AACGCGGCCG GCGACAAGGT CCGGCTGGCG
GCGTATGTGC GCCACGCGAT CGTGTGGACG TTCTGGCCGT CGCTGGTGGC GACGCTGGCG
CTGCTGGCAC TCGGCGAGCC GCTGCTGTGG CTGTTCGGGC CGCAATTCAC GTCCGGCTAC
GGCATCATGT TCGTCGCCGC GATCGGCCTG ATGGTCCGTG CCGCGATCGG TCCGGTCGAG
CGCCTGCTCA ACATGCTCGG CCACCAGCAT GTCTGCGCCC TCGCCTATGC GCTGGCGTTC
GCGGTCAATC TGGCGCTGTG CCTGATCCTA GTGCCCCGCT TCGGCGGCTA CGGCGCCGCT
GCCGCCACCT CCGCAGCCCT CACCTTCGAA ACCGTGATGC TGTTCTGGAT CGTCCGCAAA
CGCCTCGGCC TGCACGTGCT GGCGTTCGGC AGCAAAGGCT AG
 
Protein sequence
MDSPPAATSL AALRARLTSL FGGSNEASLT NRLAGTIFLI RVVSAGLAYG AQILLARWMG 
GSDYGIYVYV WTWVLLLGSM LDFGISASAQ KIIPEYRARG ELDRLRGFLS GSRWGTLAAS
SVVSLLLAAL VWALSPLIGD ATVISLYLGC LTLPAFVVAN TQDGIARSHD WMRLGLMPQF
IIRQALIIGF TAGLFVLGFE LGAIAAMAAS CAAVWIAMLG QMIALNRRLA GHVPPGPRAY
DVRGWLATSL PILLVESFYL LLSYTDVLVL QQFSTPEEVG IYYAVVKTLA LVSFIHYAMS
ATTAHRFTEY NAAGDKVRLA AYVRHAIVWT FWPSLVATLA LLALGEPLLW LFGPQFTSGY
GIMFVAAIGL MVRAAIGPVE RLLNMLGHQH VCALAYALAF AVNLALCLIL VPRFGGYGAA
AATSAALTFE TVMLFWIVRK RLGLHVLAFG SKG