Gene RPB_0356 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_0356 
Symbol 
ID3908622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp398221 
End bp399282 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content70% 
IMG OID637882242 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_483978 
Protein GI86747482 
COG category[R] General function prediction only 
COG ID[COG2220] Predicted Zn-dependent hydrolases of the beta-lactamase fold 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCGA TCTCGCGCCG AACGTTGCTG GCGAGCCTCG CCGCCCTCGC CGCCACGGCC 
GGGCTTTCCT CGTTCTGGGT TTCCCGCATG ACCTCCTACA AAGGCCCGAT CACCGATCAT
TTCGACGGCG AGCGCTTCTT CGATCGCGAC GGCGCGGCGC CGAAGGGGTG GCTCGACGTG
CTGCGCTGGC GCTTCACCAC CAAGCCGGCC AAATGGCCGG ACTGGGCGCC GAGCCCGTTC
GCCGACACCC CGCCGCCGCG CGTCGAAGGC GCCAGGGCGC GGCTGAGTTT CGTCGGTCAT
GCGAGCTGGC TGATTCAGAC CGGCGGGCTC AATATCCTGG TCGATCCGGT GTGGTCGGAG
CGGGTGTCGC CGGTCAGCTT CGCCGGCCCC AAGCGGCACA ACGATCCCGG CATCGCGTTC
GACAAGCTGC CCAAGATCGA CATCGTGCTG GTGTCGCACG GCCACTACGA TCACCTCGAC
CTGGCGACGC TGTCGCGGCT CGCCGCGCAG CATGCACCGC GGGTGATCAC GCCGCTCGGC
AACGATCTGA CGATGGCTTC GCACGACAGC GCGATCCGCG CCGAGGCCTA TGACTGGCGC
GACCGCGTCG AGCTCGGGCC CGGCGTCGCC GTGACGCTGG TGCCGACCCG GCACTGGACC
GCGCGCGGCC CGTTCGACCG CAATCGCGCG CTGTGGGCGT CGTTCGTCCT GGAGACGCCG
GCCGGCAGGA TCTACGTCGT CTGCGATTCC GGCTATGGCG ACGGCCGGCA CTTCCGTAAC
GTCCGCGAGG CGCACGCGCC GCTGCGGCTG GCGATCCTGC CGATCGGCGC CTATGCGCCG
CGCTGGTTCA TGAAGGACCA GCACATGAAC CCCGCCGACG CCGTGATGGC GCTGGCGGAT
TGCGGCGCCC GGCAGGCGCT GGCGAACCAT CACGGCACCT TCCAGCTCAC CGACGAGGCG
ATCGATGCGC CGGAGCTGGA ACTGTATGCG GCGCTCGACG CCGCTGCGGT GCCGCGCGAG
CGCTTTCCGG TGCTGAAGCC GGGGCAGGTT TTCGAAATCT GA
 
Protein sequence
MNPISRRTLL ASLAALAATA GLSSFWVSRM TSYKGPITDH FDGERFFDRD GAAPKGWLDV 
LRWRFTTKPA KWPDWAPSPF ADTPPPRVEG ARARLSFVGH ASWLIQTGGL NILVDPVWSE
RVSPVSFAGP KRHNDPGIAF DKLPKIDIVL VSHGHYDHLD LATLSRLAAQ HAPRVITPLG
NDLTMASHDS AIRAEAYDWR DRVELGPGVA VTLVPTRHWT ARGPFDRNRA LWASFVLETP
AGRIYVVCDS GYGDGRHFRN VREAHAPLRL AILPIGAYAP RWFMKDQHMN PADAVMALAD
CGARQALANH HGTFQLTDEA IDAPELELYA ALDAAAVPRE RFPVLKPGQV FEI