Gene Rpal_4338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4338 
Symbol 
ID6412022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4664819 
End bp4665823 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content70% 
IMG OID642714220 
Productpeptidase S58 DmpA 
Protein accessionYP_001993309 
Protein GI192292704 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3191] L-aminopeptidase/D-esterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.105129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGCACAATC TCATCACCGA TGTCGCCGGC GTCCGCGTCG GCCATGCGCA CGACCACAAG 
CTCGCTTCCG GCGTTACCGC GATCCTGTTC GACAAGCCCG CGGTCGCTTC GATCGACGTG
CGCGGTGGCG GTCCTGGGAT TCGCGACGGC GCGCTGCTGG AACCGGTGAA CACCGTCGAG
CAGATCGACG GCTTCACGCT GTCGGGCGGC TCGGCATTCG GCCTCGATTC CGGCGGCGGC
GTGCAGGCCT GGCTCGCCGA GCGCGGTCGC GGTTTTGCGA TCGGCAATGC GACGATTCCG
ATCGTGCCGG GCGCGGTGGT GTTCGACATG ATCAACGGCG GCGACAAGGC CTGGGGTCGG
TTCTCGCCGT ATCGCGACCT CGGCTACGCG GCTGCGGACG CCGCCGGCGA CAGCTTCGCG
CTCGGCAGTG TCGGCGCCGG CCTCGGCGCC ACCACGGCAA CGCTGAAGGG CGGGCTGGGC
TCGGCGTCAG CAACCACGCC CGGCGGCGTC ACGGTGGGCG CCATCGCGGT GGTCAACGCG
ATCGGCAGCG CCACGATCGG CGACGGCCCG TGGTTCTGGT CGGCACCGTT CGAACAGGAC
GGCGAATTCG GCGGGCTCGG GATGCCGGAA AGCTTCACGC CGGACATGCT GAAGGTGCGA
CTGAAGGGCG CGGCGGCAGC GAGCGCGATC GAGAACACCA CGCTGGTCGC GGTGGTGACC
GACGCGAACC TCACCAAGCC GCAGGTGAAG CGGCTGGCGA TGCTGGCGCA GACCGGGTTC
GCCCGCGCGA TCTATCCGGT GCACGCGCCG CTCGATGGCG ACGTGGTGTT TGCCGCGGCG
ACCGGCGTCA AACCGGTCGA GCCGCTCGCA GGTCTCACCG AGCTCGGCAC CATCGCGGCC
AACACGGTGG CGCGGGCAAT CGCTCGCGGC GTCTATGAGG CCACCGCGCT GCCGTTCAAG
GACGCGCAGC CGGCGTGGCG CGATCGGTTC GGCTCGAAGC GATAA
 
Protein sequence
MHNLITDVAG VRVGHAHDHK LASGVTAILF DKPAVASIDV RGGGPGIRDG ALLEPVNTVE 
QIDGFTLSGG SAFGLDSGGG VQAWLAERGR GFAIGNATIP IVPGAVVFDM INGGDKAWGR
FSPYRDLGYA AADAAGDSFA LGSVGAGLGA TTATLKGGLG SASATTPGGV TVGAIAVVNA
IGSATIGDGP WFWSAPFEQD GEFGGLGMPE SFTPDMLKVR LKGAAAASAI ENTTLVAVVT
DANLTKPQVK RLAMLAQTGF ARAIYPVHAP LDGDVVFAAA TGVKPVEPLA GLTELGTIAA
NTVARAIARG VYEATALPFK DAQPAWRDRF GSKR