Gene Rpal_3782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_3782 
Symbol 
ID6411460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4065182 
End bp4066390 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content59% 
IMG OID642713663 
Productpolysaccharide export protein 
Protein accessionYP_001992756 
Protein GI192292151 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTACGCA AGACGTTTCA GGTGCTTGCT GGTCTGCTGT TGCTGATTTC GATGTCAGCG 
TGTTCGGTTC TGCCAGGCAC GGGCCCTTCA AGTGAATCAG TCGAGAATTA TGCGACGGCC
GGGGTTCGGT CGACGACTGC ATTGCCTTAC GCGCTGGTAG ACGTCTCCGC CGATACCATC
GGCTTCCTGT CCCAGCCCAA CGTGGTCTCG TTCAAAGGCT CGTTTAAAGA CAGACGGCCG
AAGCCTCAGC AGGTCATCGG CGTCGGTGAC GTCCTGAACA TCTCGATCTT CGAAGCCGCG
CCAGGGGGAC TGTTCACCCC TGGTCAATCT GCCGGCGCTC GCCCCGGTAA CTTCGTCGAT
CTGCCGCCGC AGGCTGTGGA CCAGCGCGGC AATATTTCCG TTCCTTACGC TGGCGAGGTG
CCCGCGGCTG GGCGAACGGT TCCCGAAGTT CAGCAGGCGG TGGTCGCAAG GCTGCGCAAC
AGGGCGATCG AGCCCCAGGT TGTCGTGAGC CTCAACCAGC AACATTCGAG CGTCGTAAGT
GTTCTGGGCG ACGTTAATAC TCCTGGCGTG TTCGCGCTCA ACAGCGTCGG TGAGAAGCTC
CTCGCGCTGA TCGCGCGCGC GGGTGGACCC AAGTATGAAG CGATCGAAAG CTATGTGACG
CTTCAGCGCG ATGGCAAGAA GGTGAAGGTC CTCCTGAGCC GGATCGTTCA CGATCCGTCA
GAGAACATCT TTGTCCGTCC CAACGACGTG ATCTTCCTTA CCCGGGAGGC ACCGACCTTC
ACGGCTCTTG GTGCTCTCAA TCAGAACGTG TTCGGCTATA ATTCTGAGCT GACCTTCGAC
GTCGAAACGC TGACGCTCGC CCAGGCAATC GGCAAGGCCG GCGGTCTGAA CGATCAGCAG
TCGGATCCGG CCGAAGTCTT CGTGTTCCGC TACGAGGATC GACCGCTGCT TGCGAAGCTC
GGCGTCGACA CCAACCGCTT CGTCTACGAC CGCATTCCGA CGATCTATCA CGTCAACCTG
CGGGATCCGG CCGGTATGCT TCTGGCCTCT GGCTTCCAGA TCCGAAGCAA GGACGTCATG
TACGTGGCAA ATGCGCGGGT GGTCGATTAC TACAAGCTCC TGACGCTGAT CAACAACACC
GCCAACACCA CGTCGAATGT GTCCAACGCG GCAATCAATG TGAACGCAGC GACGAAGACG
CGTTGGTGA
 
Protein sequence
MVRKTFQVLA GLLLLISMSA CSVLPGTGPS SESVENYATA GVRSTTALPY ALVDVSADTI 
GFLSQPNVVS FKGSFKDRRP KPQQVIGVGD VLNISIFEAA PGGLFTPGQS AGARPGNFVD
LPPQAVDQRG NISVPYAGEV PAAGRTVPEV QQAVVARLRN RAIEPQVVVS LNQQHSSVVS
VLGDVNTPGV FALNSVGEKL LALIARAGGP KYEAIESYVT LQRDGKKVKV LLSRIVHDPS
ENIFVRPNDV IFLTREAPTF TALGALNQNV FGYNSELTFD VETLTLAQAI GKAGGLNDQQ
SDPAEVFVFR YEDRPLLAKL GVDTNRFVYD RIPTIYHVNL RDPAGMLLAS GFQIRSKDVM
YVANARVVDY YKLLTLINNT ANTTSNVSNA AINVNAATKT RW