Gene RPC_2067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_2067 
Symbol 
ID3974025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp2262994 
End bp2264082 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content63% 
IMG OID637925175 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_531940 
Protein GI90423570 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.270219 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0802358 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTC GTGACTTTTT GAAAGTATCC GCAACCGGCG CTGCGGTCGC CGCGGTGGCG 
TCGCCGGCGA TTGCGCAGTC CTCGCCGGAA ATCAAGTGGC GGCTGACCTC GAGTTTCCCG
AAGTCGCTCG ACACCATCTA TGGCGGCGCG GAATATCTCG CCAAGCAAGT CGCCGAGATG
ACCGACAACA AGTTCCAGAT CCAGGTGTTC GCGGCCGGCG AAATCGTGCC CGGCCTGCAG
GCGCTCGACG CCACTTCGAA CAACACCGTC GAGATGAGCC ACACCGTGTC CTATTACTAC
GTCGGCAAGG ACCCGACCTT CGCGGTCTAT GCGTCGGTGC CGTTCGGGCT GAACGCGCGG
CAGCAGAATT CCTGGTGGTA TCAGGGCGGC GGCATGGAGC TCGGCAACGA GTTCTACAAG
AAATACGGCG TGATCGGATT TCCCTGCGGC AACACCGGGA CCCAGATGGG CGGCTGGTTC
CGCAAGGAGA TCAAGACCGT GGCCGATCTG TCCGGGCTGA AATTCCGCAT CGGCGGCATC
GCCGGCCAGG TGCTGCAGAA GCTCGGCGTG GTGCCGCAGC AGATCGCCGG CGGCGACATC
TATCCGGCGC TGGAAAAGGG CACCATCGAC GCGGCCGAGT GGGTCGGCCC CTATGACGAC
GAGAAGCTCG GCTTCCAGAA GGTCGCGAAG TACTACTACT ATCCGGGCTT CTGGGAAGGC
GGTCCGACCG TGCACGCCTT CGCCAATCTG GAAAAGTACA ACTCGCTGCC GAAGAGCTAT
CAGTCGATCC TGGCCAACGC CGCGCAGGCC ACCAACAGCT GGATGGCGGC GCGCTACGAC
ATGCAGAATC CGCCGGCGCT GAAGCGCCTG GTGGCAGGCG GCACGCAGCT GCGGCCGTTC
TCCAACGAGG TGCTGGACGC CTGCCTGAAG GCGACCAACG AATTGTGGGG CGAAATCTCC
GCCAAGAACG CCGACTTCAA GAAGGGCATC GACGCGATGC AGGCCTACCG CTCCGATCAG
TATCTGTGGT GGCAGGTCGC CGAATACACC TTCGACAGCT TCATGATCCG CTCGCGCACC
CGCGGCTGA
 
Protein sequence
MKRRDFLKVS ATGAAVAAVA SPAIAQSSPE IKWRLTSSFP KSLDTIYGGA EYLAKQVAEM 
TDNKFQIQVF AAGEIVPGLQ ALDATSNNTV EMSHTVSYYY VGKDPTFAVY ASVPFGLNAR
QQNSWWYQGG GMELGNEFYK KYGVIGFPCG NTGTQMGGWF RKEIKTVADL SGLKFRIGGI
AGQVLQKLGV VPQQIAGGDI YPALEKGTID AAEWVGPYDD EKLGFQKVAK YYYYPGFWEG
GPTVHAFANL EKYNSLPKSY QSILANAAQA TNSWMAARYD MQNPPALKRL VAGGTQLRPF
SNEVLDACLK ATNELWGEIS AKNADFKKGI DAMQAYRSDQ YLWWQVAEYT FDSFMIRSRT
RG