Gene RPD_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2049 
Symbol 
ID4022531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2297338 
End bp2298426 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content61% 
IMG OID637962242 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_569185 
Protein GI91976526 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.207546 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTC GTGACTTTCT GAAAGTATCA GCAACCGGCG CCGCGGTCGC GGCGGTGGCC 
TCGCCGGCGA TCGCTCAGTC GTCTCCCGAG GTGAAGTGGC GGTTGACCTC GAGCTTCCCG
AAGTCTCTCG ACACGATCTA TGGCGGTGCG GAATATCTCG CGAAGCAGGT CGCCGAGATG
ACCGACAACA AATTCCAAAT CCAGGTGTTC GCCGCCGGCG AAGTGGTTCC GGGTCTGCAG
GCGCTCGACG CCACGTCGAA CGGCACCGTG GAGATGAGCC ACACGGTTTC CTACTACTAT
GTCGGCAAGG ATCCGACCTT CGCGGTGTTC GCGTCGGTGC CGTTCGGTCT CAACGCGCGG
CAGCAGAACT CGTGGCTGTA TCAAGGCGGC GGCAACGAAC TGGCCAACGA ATTCTACAAG
AAGCACGGCG TGGTCGGCTT CCCCTGCGGC AACACCGGCA CCCAGATGGG CGGTTGGTTC
CGTAAGGAAA TCAAGACCGT CGCAGACATG TCGGGCCTGA AGATGCGGAT CGGCGGCATC
GCCGGTCAGG TGCTGCAGAA GGTCGGCGTT GTGCCGCAGC AGATCGCCGG CGGCGACATC
TATCCGGCGC TGGAAAAGGG CACCATCGAC GCGGCCGAAT GGGTCGGCCC CTATGACGAC
GAGAAGCTCG GCTTCCAGAA GGTCGCGAAG TACTATTACT ATCCGGGCTT CTGGGAAGGC
GGCCCGACCG TGCATGCCTT CGCGAACCTC GAAAAGTTCA ATGCGCTGCC GAAGAACTAT
CAGTCGATCC TGGCCAACGC CGCCGAGTCA ACCAACACCT GGATGGCGGC ACGCTATGAT
ATGCAGAATC CGACCGCGTT GAAGCGACTG GTGGCGAGCG GCACGCAGCT GCGTCCGTTC
TCCAACGAAA TCCTCGATGC CTGCCTCAAG GCCACCAACG AGCTGTGGGG CGAAATCTCG
GCGAAGAACG CCGACTTCAA GAAGGCGATC GACGCGATGC AGGCCTATCG CTCCGACCAG
TATCTGTGGT GGCAGGTCGC CGAATACACT TACGACAGCT TCATGATTCG CTCGCGCACC
CGCGGCTGA
 
Protein sequence
MKRRDFLKVS ATGAAVAAVA SPAIAQSSPE VKWRLTSSFP KSLDTIYGGA EYLAKQVAEM 
TDNKFQIQVF AAGEVVPGLQ ALDATSNGTV EMSHTVSYYY VGKDPTFAVF ASVPFGLNAR
QQNSWLYQGG GNELANEFYK KHGVVGFPCG NTGTQMGGWF RKEIKTVADM SGLKMRIGGI
AGQVLQKVGV VPQQIAGGDI YPALEKGTID AAEWVGPYDD EKLGFQKVAK YYYYPGFWEG
GPTVHAFANL EKFNALPKNY QSILANAAES TNTWMAARYD MQNPTALKRL VASGTQLRPF
SNEILDACLK ATNELWGEIS AKNADFKKAI DAMQAYRSDQ YLWWQVAEYT YDSFMIRSRT
RG