Gene Rpal_5059 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_5059 
Symbol 
ID6412753 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5442525 
End bp5443490 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content66% 
IMG OID642714944 
ProductPDZ/DHR/GLGF domain protein 
Protein accessionYP_001994023 
Protein GI192293418 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTCGC TCCCCGAATG GAATGTCCCG GCCGCGATTC GGCCTCGCTC GGCCGATTTC 
GGCTTCGATC TCGATCGCGC GTTGTCTGCG GTGGTCGGGC TGCACGCGAT CATTCCAGCC
GATGCCTACA CCGCCGGTAC GCTCGGCACC GAGCGCGCCG GCAACGGCGT GCTGATCGAC
GACGGTCTGG TACTGACCAT CGGTTATCTG ATCACCGAAG CCGAAACGGT GTGGCTGCAT
CTCGGTGACG GCCGCGTCGT TGAGGGCCAT GCGCTCGGAT TCGATCAGGA GAGCGGCTTC
GGCCTGGTTC AAGCGCTCGG CCCAATCGAT CTGCCGCCGC TACCGCTCGG CCGCTCTGCT
TTCGCCAAGG CGGGCGAGCG CGTCATTATC GGCGGCGTCG GTGGCCGCAC ACGGTCGGTG
GCCGGCCGCA TCGCCACACG TCAGGAATTC GCCGGCTACT GGGAGTATCT GCTCGACGAT
GCGATCTTCA CCGAGCCGTC GCATCCGAAC TGGGGCGGCA CCGCGCTGCT GTCGGCGACC
GGCGAACTGA TCGGCGTCGG CTCGCTGCAG ATCGAACGCA GCGGCTCGAA CGAGCATTAC
AATTTGAGCG TGCCGATCGA TCTGCTGCCA CCTGTGCTGA GCGATCTTCG GAAGTTCGGC
CGGCCGAACA AGCCGCCGCG GCCGTGGCTG GGGCTGTATT CGACCGAGAT CGAAGACAAG
GTCGTGGTGG TCGGAATTGC GCCGAAGGGC CCGGCGGCGC GCGCCGAGCT GAAGACCGGC
GACGTGATCC TCGCAGTCGC GGGCGACAAG GTGACCAGTG AAGCGGAGTT CTATCGCAAG
GTCTGGGCAC TGGGCACTGC AGGCGTAGAG GTGCCACTGA CGCTGTTCAG CGGCGGCGCC
ACCTTCGACG TGGTGCTGCA TTCCTCCGAC CGCGCCAAGT TCCTCAAGGC ACCGCGGCTG
CATTGA
 
Protein sequence
MPSLPEWNVP AAIRPRSADF GFDLDRALSA VVGLHAIIPA DAYTAGTLGT ERAGNGVLID 
DGLVLTIGYL ITEAETVWLH LGDGRVVEGH ALGFDQESGF GLVQALGPID LPPLPLGRSA
FAKAGERVII GGVGGRTRSV AGRIATRQEF AGYWEYLLDD AIFTEPSHPN WGGTALLSAT
GELIGVGSLQ IERSGSNEHY NLSVPIDLLP PVLSDLRKFG RPNKPPRPWL GLYSTEIEDK
VVVVGIAPKG PAARAELKTG DVILAVAGDK VTSEAEFYRK VWALGTAGVE VPLTLFSGGA
TFDVVLHSSD RAKFLKAPRL H