Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4665 |
Symbol | |
ID | 6412351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | - |
Start bp | 5023582 |
End bp | 5025498 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642714544 |
Product | oligoendopeptidase, pepF/M3 family |
Protein accession | YP_001993631 |
Protein GI | 192293026 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1164] Oligoendopeptidase F |
TIGRFAM ID | [TIGR02290] oligoendopeptidase, pepF/M3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAAAT CTGCGACCTC ACGTACCGCG AAGACCTCTT CCCGCAAGTC CACCCCCGCC AAGCCAGCAC CGCGCAAGGC TGCACCCAGC AAGGCGAAAG CGGCCAAGAC CGCGGCTGCG GGACGGAAGG CTGCCGCCAA GCCGGCGAAG CTTCCTGAGT GGAATTTGAT GGATCTGTAT CCGGCGATCG GCTCGCCGGA GGTCGCCGCC GATCTCGACC GGCTCGATGC CGAATGCGCC TCGTTCGAAG CCGACTTCAA GGGGCGGTTG GCCGAAGAGA CCGCGAAGGA CGGTGGCGCG CTGTGGCTCG CCGGCGCGGT CAAGCGCTAT GAAGCGATCG AGGATCTGGC CGGTCGGCTC GGTTCCTATG CTGGGCTGGT GCACGCCGGC GACAGCGTCG ATCCGGTGAA GTCGAAATTC TATGGCGACG TCTCCGAGCG CCTAACCGCC GCCTCGGTAC ATCTGCTGTT CTTCACCCTC GAACTCAACC GCGTCGACGA TGCGGTGCTT GAGACCGCGA TGCAGACGCC GGAGCTCGGC CACTACCGGC CGTGGATCGA GGATCTGCGC AAGGACAAGC CGTATCAGCT CGAAGACCGC ATCGAGCAGC TGTTCCACGA GAAGTCCCAG ACCGGTTACG GCGCGTTCAA TCGCCTGTTC GACCAGACCA TCTCGTCGCT GCGCTTCAGG GTTGGCGGCA AGGAATTGGC GATCGAGCCG ACGCTGAACC TGATGCAGGA TCGTGCGCCG GCGAAACGCA AGGCCGCGGC CGAAGCCTTG GCCAAGACCT TCAAGGCGAA TGAGCGCACC TTCGCGCTGA TCACCAACAC GCTCGCCAAA GACAAGGAAA TCTCCGACCG TTGGCGAGGC TTTGAAGACG TCGCCGACAG CCGCCACCTC GCCAACCGGG TCGAGCGCGA AGTGGTCGAC GCGCTGGTCG CCTCGGTGCG AGCGGCGTAT CCGAAGCTGT CGCACCGCTA CTACAAGCTC AAGGCCAGTT GGTTCAAAAA GAAGAAGCTG CCGTATTGGG ATCGCAATGC GCCGCTGCCG TTCGCGGCGA CCGGCTCGAT CGCCTGGCCG GACGCGCGCA ATATGGTGCT GACCGCCTAC AAGGCGTTCT CGCCGGAGAT GGCGCAGATC GCCGAGCGGT TCTTCACCGA TCGCTGGATC GATGCCCCGG TCCGGCCGGG CAAGGCGCCC GGCGCGTTTT CGCATCCGAC CACGCCGTCG GCGCATCCCT ATGTGCTGAT GAACTATCAG GGCAAGCCGC GGGACGTAAT GACGCTCGCC CACGAGCTCG GCCACGGCGT TCACCAGGTG CTCGCCGCCA AGAACGGCGC GCTGATGGCG CCGACGCCGC TGACGCTGGC GGAGACTGCC AGCGTGTTCG GCGAAATGCT CACCTTCCGC CGTCTGCTGA CCCAGACCAA GGACCGCAAG CAGCGCCAGG CGCTGCTCGC CGGCAAGGTC GAGGACATGA TCAACACCGT GGTCCGGCAG ATCGCTTTCT ATTCGTTCGA GCGCGCGGTC CACACCGAGC GCCGCAGTGG CGAGCTCACC GCGCAGCGGA TCGGTGAGAT CTGGCTGTCG GTGCAGGGCG AGAGCCTCGG CCCGGCGATC GAGATCAAGC CGGGCTACGA GAGCTTCTGG ATGTATATCC CGCACTTCAT CCATTCGCCG TTCTACGTTT ACGCCTACGC GTTCGGCGAC TGCTTGGTGA ACTCGCTCTA TGCGGTCTAC GAGCACGCCC AGGAAGGCTT CGCCGAGCGT TATCTGGCGA TGCTGGCGGC CGGCGGGACC AAGCATTATT CGGAACTGCT GGCGCCGTTC GGGCTCGACG CCAAGAACCC GAGCTTCTGG GACGGCGGCC TGTCAGTGAT CGCCGGCATG ATCGACGAGC TGGAGGCGAT GGGGTAG
|
Protein sequence | MAKSATSRTA KTSSRKSTPA KPAPRKAAPS KAKAAKTAAA GRKAAAKPAK LPEWNLMDLY PAIGSPEVAA DLDRLDAECA SFEADFKGRL AEETAKDGGA LWLAGAVKRY EAIEDLAGRL GSYAGLVHAG DSVDPVKSKF YGDVSERLTA ASVHLLFFTL ELNRVDDAVL ETAMQTPELG HYRPWIEDLR KDKPYQLEDR IEQLFHEKSQ TGYGAFNRLF DQTISSLRFR VGGKELAIEP TLNLMQDRAP AKRKAAAEAL AKTFKANERT FALITNTLAK DKEISDRWRG FEDVADSRHL ANRVEREVVD ALVASVRAAY PKLSHRYYKL KASWFKKKKL PYWDRNAPLP FAATGSIAWP DARNMVLTAY KAFSPEMAQI AERFFTDRWI DAPVRPGKAP GAFSHPTTPS AHPYVLMNYQ GKPRDVMTLA HELGHGVHQV LAAKNGALMA PTPLTLAETA SVFGEMLTFR RLLTQTKDRK QRQALLAGKV EDMINTVVRQ IAFYSFERAV HTERRSGELT AQRIGEIWLS VQGESLGPAI EIKPGYESFW MYIPHFIHSP FYVYAYAFGD CLVNSLYAVY EHAQEGFAER YLAMLAAGGT KHYSELLAPF GLDAKNPSFW DGGLSVIAGM IDELEAMG
|
| |