Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3300 |
Symbol | |
ID | 8334653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 3641182 |
End bp | 3643170 |
Gene Length | 1989 bp |
Protein Length | 662 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644956445 |
Product | putative secreted protein |
Protein accession | YP_003114048 |
Protein GI | 256392484 |
COG category | [R] General function prediction only |
COG ID | [COG1409] Predicted phosphohydrolases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCACG AGCATGAAGA CCCGCACTGC CAGTGCCCTC GCACCGCCGA CTTCGACTCC GCCGGTCCTG CCGCGACCGG TCGGCGCGCG TTCCTGCGCA ACGCCGGGTT GGTGGGTGCG GGGGCGACCG CGCTGGCTTC CGGCACCACG GTGTTCGGGC CGGCGGCGTC TGCTGCCGCG TCTGCTGCCG CCGCCGATGC CAAGCACGGC GGGGCGAGCA TCTACGACGT CACCGCCATC GACACCGACG CCCCGGGCGC CCGCTGGGAT CCGGACCCGG ACAACCCGGT GTTCACGCTG GTCGTGATGC CGGACACGCA GTACATGTTC GACGAGGACC GGATCCACCC GGCGCCGATG GAGGCTTCGT TCCGCTACAT CCTCAGCGGG GACCAGGACG CGAACGTGGT GTTCATGGCG CACCTCGGCG ACCTGACGCA GAACGGTGAG GCGCGGGAGT TCGCCGCGGC TGGCGCGGTC TTCGAGATGC TGGACCGCAA GAAGGTCGCC TACAGCGTCG TCGCCGGCAA CCACGACGTG TCCGGGGACG ACCAGCGCGG CGCCACGCCG TATTCGGGCA CGTTCACCAA GGCGCGCGCG GCGAAGTCCC CGAGCTTCGG CGGCGCGAGC CCCGACGGGT ACAACACCTT CCACATCTTC CGCGGCGGCG GTCGCGACTG GCTGCTGCTC GGTCTGGACT GGCGGCTCTC GCCGGCCGGG TTCGCCTGGG CCAACCAGGT CATCGCCGAC CATCCGACGC TGCCGGTGAT CGTCACCACG CACGAGGTCG CCTATGCCGA CGACTCCGGG ACCGCGTACC TGTCCGACTA CGGGCAGCAG CTGTGGGACG GGCTGATCAA GAACCACGAC CAGGTGTTCC TGAGCCTGAA CGGGCATTAC TGGCCGCCCG GGTCCACGAC GCTGAGCAAC GCCGCGGGCC ACGATGTGCA CGTCCACATC ACCAACTATC AGGACCGTTA CTACGGCGGC GCGGCGATGA TCCGTACGTA CCGCTTCGAC ATGCGGCGCA ACACCGTGGA CGTGTCGACG TTCTCGCCGT TCATCCGCGG CATGGCCACC AGCCAGGTGA ACGAGCTGGC CGGGCAGGAG ATGGAGCTCA CGTCGGCGGT CGACCGCTTC TCGATGGCGA TCGACTTCGA CCAGCGTTTC GGCGGCTTCG CGCCGGTCGC GGCACGTCCG GCGCGTCCGG CGGCCAAGAT GCTGACCCGC GACACGCTGG CCTACTGGCG CTTCGACGGC GGCGGCGCGG ACGGCTCGGC GCTCGGCGAC AGCCAGATCA TCCGCGACCT GTCGGGCAAG GGCAACGACC TGGTGAAGGA GAACGTCCCC GGCACCACCG GCACGCCGGT GAGCTGGTCC ACCACCGAGT TCCACCCCGA CCAGCCGGCT CACGGCTCGC TGCGCTTCAC CGGCCAGGGC CACCCGACGC GCGGCGCCTG GCTGCAGACC GTCCCGAACG CGCCGCTGAA CGCCGAGACG TTCCAGCACG GCTACACCTT CGAGATGTTC TTCAAGCTGC CCGCGGATTG GGACTCCTCG CAGAGCGCGT GGAGCGGCCT GCTCAGCCGC TGGGGCATGA GCAGCGAGGC GGGCAAGTCC GGCGGCAACA CCGACCCGCA GGAGCCGATC GCCACCCTGA GCCTGTCCGG CGGCTCCGAG TTGCAGTGGA ACGTCTACCC GCTGAACCAG CCCGGCGCGT CGACCGCCTG GAGCCACCTG CTGCCCCTCG GGCAGTGGTG GCACGTCGCG ATCGTCAACG ACGGCAAGGT GAACCGCATG TACGTCAACG GCTGCGAAGA GGGCCGCAAC CCCTCGACCC CCGCCATCGG CCTGACCACC CTGAACCACT CCTGGCTCCT CGGCGGCTAC GAATACGCCG GCGCTATCAA CCAGATCCAC AACGGCTGGA TCGGCGATGT CCGCATCACC GCCCGGCCGC TCCGCATCGA CGAGTTCATG AACGCCTGA
|
Protein sequence | MNHEHEDPHC QCPRTADFDS AGPAATGRRA FLRNAGLVGA GATALASGTT VFGPAASAAA SAAAADAKHG GASIYDVTAI DTDAPGARWD PDPDNPVFTL VVMPDTQYMF DEDRIHPAPM EASFRYILSG DQDANVVFMA HLGDLTQNGE AREFAAAGAV FEMLDRKKVA YSVVAGNHDV SGDDQRGATP YSGTFTKARA AKSPSFGGAS PDGYNTFHIF RGGGRDWLLL GLDWRLSPAG FAWANQVIAD HPTLPVIVTT HEVAYADDSG TAYLSDYGQQ LWDGLIKNHD QVFLSLNGHY WPPGSTTLSN AAGHDVHVHI TNYQDRYYGG AAMIRTYRFD MRRNTVDVST FSPFIRGMAT SQVNELAGQE MELTSAVDRF SMAIDFDQRF GGFAPVAARP ARPAAKMLTR DTLAYWRFDG GGADGSALGD SQIIRDLSGK GNDLVKENVP GTTGTPVSWS TTEFHPDQPA HGSLRFTGQG HPTRGAWLQT VPNAPLNAET FQHGYTFEMF FKLPADWDSS QSAWSGLLSR WGMSSEAGKS GGNTDPQEPI ATLSLSGGSE LQWNVYPLNQ PGASTAWSHL LPLGQWWHVA IVNDGKVNRM YVNGCEEGRN PSTPAIGLTT LNHSWLLGGY EYAGAINQIH NGWIGDVRIT ARPLRIDEFM NA
|
| |