Gene RPB_3392 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3392 
Symbol 
ID3911194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3877125 
End bp3878213 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content63% 
IMG OID637885295 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_486999 
Protein GI86750503 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG4663] TRAP-type mannitol/chloroaromatic compound transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.354915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACGTC GTGACTTTCT GAAAGTATCA GCAACCGGCG CCGCGGTCGC GGCGGTGGCT 
TCGCCGGCGA TTGCGCAATC GTCCCCAGAG GTGAAGTGGC GGTTGACCTC GAGCTTCCCG
AAGTCGCTCG ACACCATCTA TGGCGGCGCG GAATATCTCG CGAAGCAGGT CGCCGAGATG
ACCGACAACA AGTTTCAGAT CCAGGTGTTC GCCGCCGGCG AAGTGGTCCC CGGCCTGCAG
GCGCTCGACG CGACCTCGAA CGGCACCGTC GAGATGTGCC ACACCGTGTC GTACTACTAT
GTCGGCAAGG ATCCGACCTT CGCGGTGTTC GCCGCGGTTC CGTTCGGCCT CAACGCCCGC
CAGCAGAATT CGTGGCTGTA CCAGGGCGGC GGCAACGAGC TCGCCAACGA GTTCTACAAG
AAGCACAACG TGGTCGGCTT CCCCTGCGGC AACACCGGCA CCCAGATGGG CGGCTGGTTC
CGCAAGGAGA TCAAGACCGT CGCCGACATG AGCGGCCTGA AGATGCGGAT CGGCGGCATC
GCCGGTCAGG TGCTGCAGAA GGTCGGCGTG GTGCCGCAGC AGATCGCCGG CGGCGACATC
TACCCGGCGC TGGAAAAGGG CACCATCGAC GCCGCCGAGT GGGTCGGCCC CTATGACGAC
GAGAAGCTCG GCTTCCAGAA GGTCGCGAAG TACTACTACT ATCCGGGCTT CTGGGAAGGC
GGCCCGACCG TCCACGCCTT CACCAATCTC GAGAAGTTCA ACGCGCTGCC GAAGAACTAT
CAGGCGATCC TCGCCAACGC GGCGGTGCAT ACCAACACCT GGATGAACGC GCGCTACGAC
ATGCTCAACC CGACCGCGCT GAAGCGGCTG GTGGCGAGCG GCACGCAGCT GCGTCCGTTC
TCCAACGAAA TCCTCGACGC CTGCCTCAAA TCGACCAACG AGCTGTGGGG CGAGATCTCG
GCCAAGAACG CCGACTTCAA GAAGGCGATC GACGCGATGC AGGCCTACCG CTCGGATCAG
TATCTGTGGT GGCAGGTCGC CGAATACACC TACGACAGCT TCATGATCCG CTCGCGCACC
CGCGGCTGA
 
Protein sequence
MKRRDFLKVS ATGAAVAAVA SPAIAQSSPE VKWRLTSSFP KSLDTIYGGA EYLAKQVAEM 
TDNKFQIQVF AAGEVVPGLQ ALDATSNGTV EMCHTVSYYY VGKDPTFAVF AAVPFGLNAR
QQNSWLYQGG GNELANEFYK KHNVVGFPCG NTGTQMGGWF RKEIKTVADM SGLKMRIGGI
AGQVLQKVGV VPQQIAGGDI YPALEKGTID AAEWVGPYDD EKLGFQKVAK YYYYPGFWEG
GPTVHAFTNL EKFNALPKNY QAILANAAVH TNTWMNARYD MLNPTALKRL VASGTQLRPF
SNEILDACLK STNELWGEIS AKNADFKKAI DAMQAYRSDQ YLWWQVAEYT YDSFMIRSRT
RG