Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_2306 |
Symbol | |
ID | 5539787 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 2973974 |
End bp | 2975470 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640894439 |
Product | carboxypeptidase Taq |
Protein accession | YP_001432407 |
Protein GI | 156742278 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2317] Zn-dependent carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.601407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.828222 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATACGC CTCCTCAACT CACCGAACTC AAAGCGCGTC TCCGCGAGAT CGACGATCTG GAGATGGCGG CTGCGCTCCT GAACTGGGAT CAGACGACCT ACATGCCTCC CGGTGGCGCC GCGGCTCGCG GTCGCCAACT GGCGACACTG GGGCGCATTA TTCACGAGAA GCGGATCGAT CCGGCGATTG GGCGTTTGCT CGATGCGCTG CGCTCCTACG AAGAGTCGCT CCCGCCTGAT TCGCCCGATG CTGCGCTCAT TCGGGTGACG CGACGCGACT ATGAGCGCGC GATGCGCGTT CCCGCCGCGT TCACTGCTGA ACTCTACGAG CACACCGCTG CCAGTTACGA TGTCTGGTCG CGTGCGCGTC CGGCGAACGA CTTTATGGCG GTTTTGCCCT ATCTGGAGCG CACCCTTGAT CTCAGCCGCC GCTTTGCGGA GTTCTTTCCC GGCTATGAGC ACATTGCCGA TCCGCTGATC GATATGGCGG ATTATGGCAT GCGCGCAGCA ACGATCAAAC AGGTCTTCGC GGAACTCCGC CAGGGGTTGA TCCCGCTGGT CGAACAGATC ACCGTCCAGC CGCCGGTCGA TGACTCTTGC CTGCGCCAGT TCTTCCCCGA AGCGCAACAG TGGGCGTTTG GCGTTGAAGT CATCACGGCA TTGGGGTACG ACTTCAGCCG TGGAAGGCAG GATAAGACGT TGCACCCGTT TATGACGAAG TTCTCGCTGA ACGATGTGCG CATCACCACG CGAGTCGATG AATATGACCT CGGTTCGGCG CTCTTCAGCA CCATCCACGA AGCCGGGCAC GCAATGTACG AGCAGGGTAT TGCGCAGGCA TTCGAGGGTA CGCCGCTCGC GTCTGGCACA TCCGCCGGCA TGCACGAGAG TCAGTCGCGT TTGTGGGAGA ATATCGTTGG GCGCAGCCTC CCCTTCTGGG AGTATTTCTA TCCGCGCCTC CAGGCGACTT TTCCCGATCA GTTGGGAAAC GTGCCGCTTG AAACGTTCTA TCGGGCGATT AACAAAGTGC AGCGCTCCCT CATCCGCACT GAAGCCGATG AAGTGACCTA CAACCTGCAC GTCATTCTGC GTTTCGACCT GGAACTGGCA TTGCTCGAAG GAACGCTCGC CGTGCGCGAC CTGCCCGAAG CCTGGCGTGA ACGCTATCGC AGCGATCTTG GCGTGGCGCC GCCGGACGAC CGCGACGGTG TGTTGCAAGA TGTTCACTGG TACGGTGGTC TCATCGGCGG CGCGTTTCAG GGATATACAC TGGGGAATAT TATGAGCGTG CAACTGTTCG ATGCCGCGCT GCGCGACCAT CCCGACATCC CGCAGCAGAT TGGCAGCGGC AGGTTCGACA CGCTCCGCGA ATGGATGCGC GAACATGTCT ACCGTCATGG GCGCGCCCTC GACGCCGACG ACATCCTGCG ACGCGCCACC GGCAGATCAC TCGATGTGCA GCCGTATCTG GCATACCTGT GGCGCAAATA CGGATGA
|
Protein sequence | MHTPPQLTEL KARLREIDDL EMAAALLNWD QTTYMPPGGA AARGRQLATL GRIIHEKRID PAIGRLLDAL RSYEESLPPD SPDAALIRVT RRDYERAMRV PAAFTAELYE HTAASYDVWS RARPANDFMA VLPYLERTLD LSRRFAEFFP GYEHIADPLI DMADYGMRAA TIKQVFAELR QGLIPLVEQI TVQPPVDDSC LRQFFPEAQQ WAFGVEVITA LGYDFSRGRQ DKTLHPFMTK FSLNDVRITT RVDEYDLGSA LFSTIHEAGH AMYEQGIAQA FEGTPLASGT SAGMHESQSR LWENIVGRSL PFWEYFYPRL QATFPDQLGN VPLETFYRAI NKVQRSLIRT EADEVTYNLH VILRFDLELA LLEGTLAVRD LPEAWRERYR SDLGVAPPDD RDGVLQDVHW YGGLIGGAFQ GYTLGNIMSV QLFDAALRDH PDIPQQIGSG RFDTLREWMR EHVYRHGRAL DADDILRRAT GRSLDVQPYL AYLWRKYG
|
| |