Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_3030 |
Symbol | |
ID | 5540526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 3925498 |
End bp | 3927432 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640895150 |
Product | peptidase S9 prolyl oligopeptidase |
Protein accession | YP_001433103 |
Protein GI | 156742974 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCACC CGCAGATCGC GCCATACGGC TCGTGGCGCT CACCGATAAC TGCTGCCCTG GTCGCAACAT CGGGTGTTTC TCTTAGCACG ATTGCGCTCG ATGGCGACAA CATCTACTGG CTCGAAGGGC GTCCCGCCGA GGGTGGGCGC GTGGTGGTGG TGCGACGCAC CGCCGATGGC GCCATTGCCG ATGTGACGCC GCAGGGTTTC AATGTGCGCA CCCGCGTTCA CGAATATGGA GGAGCGCCGT ACACGGTTGA CCAGGGTATG GTCTATTTCA GCAACTTTGC CGATCAGCGC CTCTACTGCC AGCGCCCCGG TGCAGCGCCG GAACCGATCA CACCCGAAAC GCCATGGCGC TATGCCGACT TCGAGGTTGA TCGTCGGCGC AACCGGCTGA TCGGCGTGCG TGAGGACCAC TCTGGCAGCG GTGAAGCAGT CAATACGATT GTCGCCATCT CGCTCGATGG CGCTGCCGAA CAGCGCGTGC TGATCAGCGG CGCAGATTTT TATGCGAATC CGCGGCTTAG TCCAGACGGC CAATGGCTGG CATGGCTTTC CTGGAACCAT CCGAACATGC CATGGGACGC CGCTGAGCTG TGGGTCGCGC CGGTGCGCGA AGATGGGATG CCGGGTGCTG CCGAACGGAT CGCCGGCGGT CCTGACGATG CGGCGTTTCA ACCAGCGTGG GGACCGGACG GCGCGCTCTT CTTTGTCGCC GAGCGCACCG GTTGGTGGAA CCTTTACCGC TGGCACAATG GTGTTGTCCA CGCGCTCTGT CCGATGGAAG CCGAATTCGG TCTGCCACTC TGGGTCTTCG GCGCACGCAC CTATGCTGTC GAGTCGGAGG ATCGCCTGGT CTGCACGTAT ATCGAGCGCG GCGAGCACAA AATGGCACTG CTTGATGTCC GAAGTGGGAA CCTGACGCCG CTCGAACTGC CGTTCAGCGA TTTCGGGTTC ACCGGTCCGC GCGCCACTGG CGGCAGAGTC GTCTTCGTTG GCGCCTCACC AGCCGCGCCT GCTGCCCTGG TCATGCTCGA CCTGGCGAGT GGTGCGCTGA CAACCGTTCG CCGCTCGATG GAGATGCAGA TCGACCCTGG CTTTATCTCG ACGCCGCAGG TGATCGAATT TCCCACCGAA GGCGGCGTGA CTGCGTTCGG CTTCTATTAC CCGCCGCGCA ACCGTGATTT TCTGGCGCCG GAAGGCGAAA AGCCGCCGTT GCTCGTCCTG AGCCATGGAG GACCGACCGG CGCAACCTCG GCGTCATTTG ATCCCGGCAT TCAGTTCTGG ACGAGCCGCG GCATTGCAGT GATGGATGTC AACTACGGCG GCAGCACCGG ATTCGGGCGC GCCTACCGCC AGCGCCTCGA CGGTCGGTGG GGCATTGTGG ACGTCGACGA CTGCTGCAAT GCGGCGATGT ACCTGGCAGC GCAGGGGCTG GCAGACCCGG AACGTCTGAT CATCGCCGGC GGCAGTGCCG GCGGGTACAC CACGCTGGCG GCGCTCACCT TCCGCCACGT GTTCAAAGTC GGCGCCAGTT TCTACGGCGT CAGCGACCTG GAGGCGCTGG CGCGCGACAC CCATAAGTTC GAGTCGCGCT ACCTCGACCG GTTGGTAGGA CCATACCCGG AGCGCGTCGA TATCTACCAC GCGCGCTCGC CGATCTATCA TATCGAGCGG CTCAACTGCC CGGTGATCTT CCTGCAAGGG CTGGAAGACA AAGTCGTACC GCCGGATCAA TCCGAGCGGA TGGCGGCGGC GCTGCGCGCG AAGGGCATTC CGGTCGCGTA TCTGGCGTTC GAGGGCGAGC AACACGGTTT TCGTAAAGCA GAGACCATCA TTCGTGCGCT GGAAGCCGAG TTATACTTCT ACGCGCGTAT CCTGGGGTTT GAACTCGCCG ATCCGGTCGC GCCGATTGTA ATCGACAATC TGTGA
|
Protein sequence | MTHPQIAPYG SWRSPITAAL VATSGVSLST IALDGDNIYW LEGRPAEGGR VVVVRRTADG AIADVTPQGF NVRTRVHEYG GAPYTVDQGM VYFSNFADQR LYCQRPGAAP EPITPETPWR YADFEVDRRR NRLIGVREDH SGSGEAVNTI VAISLDGAAE QRVLISGADF YANPRLSPDG QWLAWLSWNH PNMPWDAAEL WVAPVREDGM PGAAERIAGG PDDAAFQPAW GPDGALFFVA ERTGWWNLYR WHNGVVHALC PMEAEFGLPL WVFGARTYAV ESEDRLVCTY IERGEHKMAL LDVRSGNLTP LELPFSDFGF TGPRATGGRV VFVGASPAAP AALVMLDLAS GALTTVRRSM EMQIDPGFIS TPQVIEFPTE GGVTAFGFYY PPRNRDFLAP EGEKPPLLVL SHGGPTGATS ASFDPGIQFW TSRGIAVMDV NYGGSTGFGR AYRQRLDGRW GIVDVDDCCN AAMYLAAQGL ADPERLIIAG GSAGGYTTLA ALTFRHVFKV GASFYGVSDL EALARDTHKF ESRYLDRLVG PYPERVDIYH ARSPIYHIER LNCPVIFLQG LEDKVVPPDQ SERMAAALRA KGIPVAYLAF EGEQHGFRKA ETIIRALEAE LYFYARILGF ELADPVAPIV IDNL
|
| |