Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cagg_1344 |
Symbol | |
ID | 7268636 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chloroflexus aggregans DSM 9485 |
Kingdom | Bacteria |
Replicon accession | NC_011831 |
Strand | + |
Start bp | 1663925 |
End bp | 1665814 |
Gene Length | 1890 bp |
Protein Length | 629 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643566187 |
Product | peptidase S9 prolyl oligopeptidase active site domain protein |
Protein accession | YP_002462687 |
Protein GI | 219848254 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1506] Dipeptidyl aminopeptidases/acylaminoacyl-peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 0.0000317617 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCCTCAGA TCGCACCCTA CGGCAGTTGG CGCTCGCCGA TCACCGCCGC GATGCTCGTC ACCAAACAGG TAAGCCTCAG TTGGCCGCAA ATTGATGGTT CTGACCTCTA CTGGATCGAA GGCCGACCAC AAGAGGGTGG TCGTAATGTG CTCGTTCGAC GCACTGCTGC CGGCACGATT GACGATGTGA CGCCGACCGG GATGAATGTA CGCACGTTAG TACACGAATA CGGTGGTCGG AGTTATGTCG TTGATCAAGG TGAGATCTTC TTCGTGGAGT TCAGCGATCA GACGCTTTAC CGCCAACACC CCGGCGCCGC ACCGGTTCCG CTTGGCACAC CGCCCGGGCT GCGGTTCGCC GAGCCGATAG TTGATCGCCG ACGTAACCGG TTGATCGCCG TTGGCGAAGA CCACCGGAGC AGTGGTGAAC CGCTCAACAC GCTCGTCGCT ATCAGTCTGG CCGATGGGAC AGTCACGCCA TTACTGAATG GACGCGATTT TGTGGCTGCA CCACGCCTCA GCCCTAACGG TGCGTGGCTG GCCTGGTTAG CGTGGGATCA TCCCAATATG CCGTGGGACG CTGCTGAGTT ATGGGTAGCG CCGGTGCTGA CCGATGGCAG TCTAGGTGCC GCCCGTCGTT TAGCCGGTGG TCCCGGCGAC TCGTGCTTCC AGCCGGAATG GACGCCCGAT GGCGCATTAC TGGTCGTCGC CGAACGTACC GGCTGGTGGA ACCTCTACCG CCTCGACCTC GATGGCAACA CCACCCCACT CTACCCGCTT GATGCCGAGT TTGGTCAACC ACTCTGGCAA CTGGGTATGC GCACGTATGT TCCACTTCCC GACGGACGCA TCGTAGCGAC CTTCAGCCGC GAGAGTCGCC GGCACATGTG CGTGATCGGG CAGCCGGGTC ACGCCGAACC CATCGAGTTA CCGGTTTCGG TAATCAATAT CGTCAACGGT GACGGCGAAC GGATCGTCTT TGTCGGCAGT TCGCCGACGA TGCCGGCAAC GCTCTTCTTG CTTAACCTCG CCGATCGATC GCTGACACCG ATCCGCAGCA GTGGCGATGT GCCGGTCGAT CTATCGTACA TCTCACAGCC AGAGGTGATC AGCTTTCCGA GTGCCGGTGG GCGGATTGCG CACGGCATTT TCTATCCACC TCACAACCCC GATTTCAGCG CACCTGACGG TGAACTACCC CCATTGCTGG TAATGATCCA CGGTGGGCCA ACCGCTGCCA CTTATCCCAC CTTACGCCTC TCGATCCAAT ACTGGACGAG TCGGGGGATC GGTGTACTCG ATGTGAACTA CGGCGGCAGT ACCGGTTTTG GTCGTACTTA TCGTGAACTG CTCGACGGAC AGTGGGGTGT GGTTGATGTA GAAGATTGTG TTGCCGGTGC CCGGTTTCTT GCCGCTGAGG GCAAAGCCGA TCCCAACCGG CTCTTGATTA CCGGCGGCAG TGCCGGCGGA TTCACAACAT TAGCTGCATT AGCTTTCCAC AATACCTTCC GCGCCGGCGC CAGTCATTTT GGTGTTGCCG ATCTCGCAGC CCTAGCGCGT GACACCCACA AATTCGAATC ACGGTATCTC GACCGTCTGA TCGGCCCATA CCCCGCACGA GCTGACCTCT ATCAGGCACG TTCACCGCTC TACCATGCCG ATCGGATCAA CAGCCCGGTT ATCTTCTTCC AAGGGCTAGA AGACAAAGTG GTGCCGCCAG ATCAATCGGA GCGTATGTAC GAAGCGCTTC GCTCACGCGG TATTCGGACC GAATACGTAC CCTTTGCCGG CGAACAGCAC GGTTTTCGCA AGGCCGAGAA CATTATCACA GCTCTCGAAC GTGAGCTAGC GTTTTACCAA GAAGTGCTAG GGATCCACGC TGAGCGGTGA
|
Protein sequence | MPQIAPYGSW RSPITAAMLV TKQVSLSWPQ IDGSDLYWIE GRPQEGGRNV LVRRTAAGTI DDVTPTGMNV RTLVHEYGGR SYVVDQGEIF FVEFSDQTLY RQHPGAAPVP LGTPPGLRFA EPIVDRRRNR LIAVGEDHRS SGEPLNTLVA ISLADGTVTP LLNGRDFVAA PRLSPNGAWL AWLAWDHPNM PWDAAELWVA PVLTDGSLGA ARRLAGGPGD SCFQPEWTPD GALLVVAERT GWWNLYRLDL DGNTTPLYPL DAEFGQPLWQ LGMRTYVPLP DGRIVATFSR ESRRHMCVIG QPGHAEPIEL PVSVINIVNG DGERIVFVGS SPTMPATLFL LNLADRSLTP IRSSGDVPVD LSYISQPEVI SFPSAGGRIA HGIFYPPHNP DFSAPDGELP PLLVMIHGGP TAATYPTLRL SIQYWTSRGI GVLDVNYGGS TGFGRTYREL LDGQWGVVDV EDCVAGARFL AAEGKADPNR LLITGGSAGG FTTLAALAFH NTFRAGASHF GVADLAALAR DTHKFESRYL DRLIGPYPAR ADLYQARSPL YHADRINSPV IFFQGLEDKV VPPDQSERMY EALRSRGIRT EYVPFAGEQH GFRKAENIIT ALERELAFYQ EVLGIHAER
|
| |