Gene Rpal_2087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_2087 
Symbol 
ID6409747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp2260566 
End bp2261966 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content67% 
IMG OID642711972 
Producthypothetical protein 
Protein accessionYP_001991084 
Protein GI192290479 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.073016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCCC CGGACGGTAG TGGCAGCGAG AGTTTCCGCA CGCGGCGGCG CCGTAATCGC 
GCCCGTTTTG CACTGGCGGC GGTGCTAACC TTCATTCTGG TTGCGGTCGG CGTGCTCGCC
TTGGTGTTCT ACGAGTTACG ACCCGTCAGC CTGAAGATCG CTGTCGGGCC GGGAGGCGGC
GACGATCTGA AGCTGGTGCA AGCATTGGCC CAGAGTTTTG CGCGGGATCG CGCTGCCGTG
CGGCTGGTGC CATTGGTGAC CGGTGGGGCG ACGCCGAGCA TCGAGGCGTT TCGCGACCAC
AAGGCGGACC TCGCGGTCAC CCGGGCCGAT CTTGATCTGC CTGCCGACGC GCAGGCGGTG
GCGGTGCTGC GCAAGAACGT GGTGGTGCTG TGGGCGCCAT CGTCGGCCGG CAAGGGCAAG
AAGTCGATCA AGACCATCAC CGATCTGGCA GGCCGCCGGG TCGGCGTGAT TGGGCGGACG
CCGGCCAACG TCAAACTGTT CGACATCATT CTGGCCGAGT CCGGCGTCGC GCCCGACAAG
GTCCAGAAGG AGCAGTTCGC CACCGGCCAG CTCGCCGAGA TGGCGCGCGA TCCGTCGCTC
GACGCGTTTC TGGCGGTCGG GCCGCTCGAC AGCAAGATCA CCGGCGAAGC GATCGCCGCC
ACCGCGAAGG CGCGGGGCGA GCCGCGGTTC CTACCGGTCG ATGTTGGTGA TGCCATCGCC
AAGAAGTATC CGATCTACGA TTCCGAGGAG ATTCCGGGCA GCATCTTCTC CACCCAGCCG
GCGCGGCCCG AGGACAAGGT CGATACCGTC AGCGTCAACC ATCTGATCAT TGCTCGGCAG
TCGCTGTCCT CCGTGACGGT GACCAAGCTG ACGCGGCAGA TCTTCGCCGC CCGGCAGCAG
ATCGCCCGCG AGATGCCGCT CGCCGGCAAG ATCGAAGCGC CGGACACCGA GAAGGACGCG
GCGCTGCCGG CGCATCGCGG CGCCGCCGCG TTCATCGACG GCACCGACCG CACCTTCATG
GAGCGCAACA GCGATTACAT CTGGGGCTTG GTGCTGCTGC TGTCCGGCCT CGGCTCGGCC
GGCGCCTGGT TTCGCAGCTA TTTGACCCGC GACGAGCGGG AAGCCGGCGC CAAGATGCGC
GACCGCGCGC TCGCCATGGT GTCCAAGGCG CGGAAAGCGG AGACGCTGGA GGCGCTGGAC
GCGTTGCAGC ATGAGATCGA CAAGATCCTT CGTGACACGC TGGATTGCTA CGACGACGGT
GCGATCGAGG ATCTCGAGCC GTTCAGCCTG GTGCTGGAGC AGTTCCACCA CGCGGTGGTG
GACCGCCGTG CCGCGTTGAG CGCTTCCGGC CCGGGGCTGG TTCCGGCTGA TCCGGCGACA
CTGCCGGCCG CCCGCGCCTG A
 
Protein sequence
MASPDGSGSE SFRTRRRRNR ARFALAAVLT FILVAVGVLA LVFYELRPVS LKIAVGPGGG 
DDLKLVQALA QSFARDRAAV RLVPLVTGGA TPSIEAFRDH KADLAVTRAD LDLPADAQAV
AVLRKNVVVL WAPSSAGKGK KSIKTITDLA GRRVGVIGRT PANVKLFDII LAESGVAPDK
VQKEQFATGQ LAEMARDPSL DAFLAVGPLD SKITGEAIAA TAKARGEPRF LPVDVGDAIA
KKYPIYDSEE IPGSIFSTQP ARPEDKVDTV SVNHLIIARQ SLSSVTVTKL TRQIFAARQQ
IAREMPLAGK IEAPDTEKDA ALPAHRGAAA FIDGTDRTFM ERNSDYIWGL VLLLSGLGSA
GAWFRSYLTR DEREAGAKMR DRALAMVSKA RKAETLEALD ALQHEIDKIL RDTLDCYDDG
AIEDLEPFSL VLEQFHHAVV DRRAALSASG PGLVPADPAT LPAARA