Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3465 |
Symbol | |
ID | 8334818 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 3846515 |
End bp | 3848704 |
Gene Length | 2190 bp |
Protein Length | 729 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644956609 |
Product | coagulation factor 5/8 type domain protein |
Protein accession | YP_003114212 |
Protein GI | 256392648 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.552807 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00639075 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGCAAGA AGCGCTTCAC GCGTATCAGA GGCCTGGTAC TCATCGGATG TGCCGCCGTG CTCGCGGTGG CCGGCACGGT CTCGACCGGA TCCGGGACGG CGGTCGCGGC TCAGGCGACC GCAGCCAATC CGGCCGTCCG AGTAGATCAG GGTTCGCTCA CGAACCAGGC GACTCCAGCC GCTGACGCCC TGCTGTCGCT GAACAAACCG GCAACCGCCT CCTCCTCCGG CGGCTGTTGC GCCGCTCCCA ATGTCGACGA CGGCGTCTCC ACGACGCGCT GGGCCAGCGG CGCGGGCGTC GACCCCCAAT GGATCTACAT CGACCTCGGC GCCATGGCGC ACGTCAGCCG GGTCCGGTTG CAGTGGGATA CCTCCTGCGC GACCGCCTAT GAGGTCGACG TCTCCGCCGA CCACACGACG TGGACGAAGA TCTACAGCAC CACCGCCGGC AAGGGCGGAG TGGAGGACCT GACCTCGCTG GACGGGAACG GTCGCTACGT GCGCATGTAC GGGACCAAAC GCTGCCGCAG CGACTCCAGC CACGGCTATT CGCTGCAGGA ACTCGGCGTC TACGGCACCG TCGGCAGCGA CACCACGCCG CCGACCCCGC CCGGAACCCC GACGCTGGTC TCCGACACCC CGAACAGCGT CACCATCGGC TGGAGCGCCT CGACCGACAA TGTCGGCGTC AGCGGCTACG ACATCTACCA CGACGGCCAG CTGTGCGCGC AGGTGAACGG CAGTACCCTG ACAGGCACTT GCGGCTCGCT CAATCCCAAT GTCAGCTACG GGTTCTACGT GAACGCCCGC GACGCGGCGG GCAACGTCAG CCAGCCCTCC GGCACGCTGA ACGTCACCAC CCCGCCGTCC AGTGACACCA CGCCGCCGAC CGTGCCCGGC ACCGTGCACT CCACCGCGGT GACCAGCACC AGCGTCACCC TGGGATGGAC CGCCTCGACG GACAATGTCG CGGTCGCCGG CTACCGCGTC TACAACGTGG TGAACGGTAC GCGCACGAAG GTCGGCACAG CGGACGCCAA CGCCACCACC ACCCAGGTGG ACGGCCTGAC GCCCAGCACC GCCTACCACT TCCAGGTCAC CGCCTACGAC GGCAACGGCA ACGAGAGCGC GGGCAGCACG CCGATCCTGG ACGTCAGCAC CTCCGCGTCC AGCAGCTGCA CGCCTTCGCA GGGCGTGTGC ACCGTCACCC AGGTCGGCAC GGACGATGAC GTGGTGTGGG GTCTGGTCAC GCTGCCCGAC GGCACGATCC TGTTCAACGA GCGCGACGCG CACGACATCG TGCACCTGAA TCCCAAGACC GGCGCGAAGA AGACCATCGG TACGGTGCCC AACGTGCAGA GCACCGACGG CGAGGGCGGC CTGACCGGGC TGGAGATCAA CCCGGTGAGC TTCAGCTCGG ACCACTGGCT CTACATCATG CACACCTCGC CGACCGACAA CCGCATCGTG CGGATCAAGT ACGACCCGGC CAGCGACACG CTGCAGACCA GCACCGAGCA GATCCTGCTG ACCGGCATCG CGCGCAACAA GTTCCACAAC GGCGGACGCC TGCGCTTCAG CCCGGACGGC AAGTACCTGT ACGCCGGCAC CGGCGACGCG CAGAACGGCG CCAACGCGCA GAACACCAGC AGCCTCAACG GCAAGGTGCT GCGCATCAAC CCCGACGGAA CCATCCCGAC GGACAACCCG TTCCACAACG CGGTCTGGAG CTACGGGCAC CGGAACGTGC AAGGCCTCGC CTTCGACTCC CAGGGACGGC TGTGGGAGCA GGAGTTCGGC AACAGCGTCA TGGACGAGAC CAACCTCATC GTCAAGGGCG GCAACTACGG CTGGCCGTCG TGCGAGGGCA CGTCGGGCAC CTGCGGCACC GCCGGCTTCA TCGCGCCGAA GCACACCTAT CCGGTGGCCA ACGGCTCGTG CAGCGGGATC ACGATCATCC GCGACTTCCT GTACGTGGCC TGCGAACGCG GGACGCGGCT CTACCGGGAG CAGATCAGCG GCAGCAGCTT GACGAACGTG CAGACGTTCT TCGACGGCAC GTACGGCCGG CTGCGCACCG TCGAGCCGGC GCCGGACGGC GGGATGTGGA TGGCCACGTC CAACGGCGGT GACAAGGACA GCACGCCGCA CAACAGCACC AACCAGATCT TCCATGTGAC AGTGGCTTGA
|
Protein sequence | MRKKRFTRIR GLVLIGCAAV LAVAGTVSTG SGTAVAAQAT AANPAVRVDQ GSLTNQATPA ADALLSLNKP ATASSSGGCC AAPNVDDGVS TTRWASGAGV DPQWIYIDLG AMAHVSRVRL QWDTSCATAY EVDVSADHTT WTKIYSTTAG KGGVEDLTSL DGNGRYVRMY GTKRCRSDSS HGYSLQELGV YGTVGSDTTP PTPPGTPTLV SDTPNSVTIG WSASTDNVGV SGYDIYHDGQ LCAQVNGSTL TGTCGSLNPN VSYGFYVNAR DAAGNVSQPS GTLNVTTPPS SDTTPPTVPG TVHSTAVTST SVTLGWTAST DNVAVAGYRV YNVVNGTRTK VGTADANATT TQVDGLTPST AYHFQVTAYD GNGNESAGST PILDVSTSAS SSCTPSQGVC TVTQVGTDDD VVWGLVTLPD GTILFNERDA HDIVHLNPKT GAKKTIGTVP NVQSTDGEGG LTGLEINPVS FSSDHWLYIM HTSPTDNRIV RIKYDPASDT LQTSTEQILL TGIARNKFHN GGRLRFSPDG KYLYAGTGDA QNGANAQNTS SLNGKVLRIN PDGTIPTDNP FHNAVWSYGH RNVQGLAFDS QGRLWEQEFG NSVMDETNLI VKGGNYGWPS CEGTSGTCGT AGFIAPKHTY PVANGSCSGI TIIRDFLYVA CERGTRLYRE QISGSSLTNV QTFFDGTYGR LRTVEPAPDG GMWMATSNGG DKDSTPHNST NQIFHVTVA
|
| |