Gene Rpal_4665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4665 
Symbol 
ID6412351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5023582 
End bp5025498 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content65% 
IMG OID642714544 
Productoligoendopeptidase, pepF/M3 family 
Protein accessionYP_001993631 
Protein GI192293026 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1164] Oligoendopeptidase F 
TIGRFAM ID[TIGR02290] oligoendopeptidase, pepF/M3 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAAT CTGCGACCTC ACGTACCGCG AAGACCTCTT CCCGCAAGTC CACCCCCGCC 
AAGCCAGCAC CGCGCAAGGC TGCACCCAGC AAGGCGAAAG CGGCCAAGAC CGCGGCTGCG
GGACGGAAGG CTGCCGCCAA GCCGGCGAAG CTTCCTGAGT GGAATTTGAT GGATCTGTAT
CCGGCGATCG GCTCGCCGGA GGTCGCCGCC GATCTCGACC GGCTCGATGC CGAATGCGCC
TCGTTCGAAG CCGACTTCAA GGGGCGGTTG GCCGAAGAGA CCGCGAAGGA CGGTGGCGCG
CTGTGGCTCG CCGGCGCGGT CAAGCGCTAT GAAGCGATCG AGGATCTGGC CGGTCGGCTC
GGTTCCTATG CTGGGCTGGT GCACGCCGGC GACAGCGTCG ATCCGGTGAA GTCGAAATTC
TATGGCGACG TCTCCGAGCG CCTAACCGCC GCCTCGGTAC ATCTGCTGTT CTTCACCCTC
GAACTCAACC GCGTCGACGA TGCGGTGCTT GAGACCGCGA TGCAGACGCC GGAGCTCGGC
CACTACCGGC CGTGGATCGA GGATCTGCGC AAGGACAAGC CGTATCAGCT CGAAGACCGC
ATCGAGCAGC TGTTCCACGA GAAGTCCCAG ACCGGTTACG GCGCGTTCAA TCGCCTGTTC
GACCAGACCA TCTCGTCGCT GCGCTTCAGG GTTGGCGGCA AGGAATTGGC GATCGAGCCG
ACGCTGAACC TGATGCAGGA TCGTGCGCCG GCGAAACGCA AGGCCGCGGC CGAAGCCTTG
GCCAAGACCT TCAAGGCGAA TGAGCGCACC TTCGCGCTGA TCACCAACAC GCTCGCCAAA
GACAAGGAAA TCTCCGACCG TTGGCGAGGC TTTGAAGACG TCGCCGACAG CCGCCACCTC
GCCAACCGGG TCGAGCGCGA AGTGGTCGAC GCGCTGGTCG CCTCGGTGCG AGCGGCGTAT
CCGAAGCTGT CGCACCGCTA CTACAAGCTC AAGGCCAGTT GGTTCAAAAA GAAGAAGCTG
CCGTATTGGG ATCGCAATGC GCCGCTGCCG TTCGCGGCGA CCGGCTCGAT CGCCTGGCCG
GACGCGCGCA ATATGGTGCT GACCGCCTAC AAGGCGTTCT CGCCGGAGAT GGCGCAGATC
GCCGAGCGGT TCTTCACCGA TCGCTGGATC GATGCCCCGG TCCGGCCGGG CAAGGCGCCC
GGCGCGTTTT CGCATCCGAC CACGCCGTCG GCGCATCCCT ATGTGCTGAT GAACTATCAG
GGCAAGCCGC GGGACGTAAT GACGCTCGCC CACGAGCTCG GCCACGGCGT TCACCAGGTG
CTCGCCGCCA AGAACGGCGC GCTGATGGCG CCGACGCCGC TGACGCTGGC GGAGACTGCC
AGCGTGTTCG GCGAAATGCT CACCTTCCGC CGTCTGCTGA CCCAGACCAA GGACCGCAAG
CAGCGCCAGG CGCTGCTCGC CGGCAAGGTC GAGGACATGA TCAACACCGT GGTCCGGCAG
ATCGCTTTCT ATTCGTTCGA GCGCGCGGTC CACACCGAGC GCCGCAGTGG CGAGCTCACC
GCGCAGCGGA TCGGTGAGAT CTGGCTGTCG GTGCAGGGCG AGAGCCTCGG CCCGGCGATC
GAGATCAAGC CGGGCTACGA GAGCTTCTGG ATGTATATCC CGCACTTCAT CCATTCGCCG
TTCTACGTTT ACGCCTACGC GTTCGGCGAC TGCTTGGTGA ACTCGCTCTA TGCGGTCTAC
GAGCACGCCC AGGAAGGCTT CGCCGAGCGT TATCTGGCGA TGCTGGCGGC CGGCGGGACC
AAGCATTATT CGGAACTGCT GGCGCCGTTC GGGCTCGACG CCAAGAACCC GAGCTTCTGG
GACGGCGGCC TGTCAGTGAT CGCCGGCATG ATCGACGAGC TGGAGGCGAT GGGGTAG
 
Protein sequence
MAKSATSRTA KTSSRKSTPA KPAPRKAAPS KAKAAKTAAA GRKAAAKPAK LPEWNLMDLY 
PAIGSPEVAA DLDRLDAECA SFEADFKGRL AEETAKDGGA LWLAGAVKRY EAIEDLAGRL
GSYAGLVHAG DSVDPVKSKF YGDVSERLTA ASVHLLFFTL ELNRVDDAVL ETAMQTPELG
HYRPWIEDLR KDKPYQLEDR IEQLFHEKSQ TGYGAFNRLF DQTISSLRFR VGGKELAIEP
TLNLMQDRAP AKRKAAAEAL AKTFKANERT FALITNTLAK DKEISDRWRG FEDVADSRHL
ANRVEREVVD ALVASVRAAY PKLSHRYYKL KASWFKKKKL PYWDRNAPLP FAATGSIAWP
DARNMVLTAY KAFSPEMAQI AERFFTDRWI DAPVRPGKAP GAFSHPTTPS AHPYVLMNYQ
GKPRDVMTLA HELGHGVHQV LAAKNGALMA PTPLTLAETA SVFGEMLTFR RLLTQTKDRK
QRQALLAGKV EDMINTVVRQ IAFYSFERAV HTERRSGELT AQRIGEIWLS VQGESLGPAI
EIKPGYESFW MYIPHFIHSP FYVYAYAFGD CLVNSLYAVY EHAQEGFAER YLAMLAAGGT
KHYSELLAPF GLDAKNPSFW DGGLSVIAGM IDELEAMG