Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_7838 |
Symbol | |
ID | 8339214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 9094157 |
End bp | 9096997 |
Gene Length | 2841 bp |
Protein Length | 946 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644960922 |
Product | peptidase C60 sortase A and B |
Protein accession | YP_003118503 |
Protein GI | 256396939 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0706316 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACC CCCCAGCCAT CGCCATCCCC ATAGGCACCC CAGCCACCGA CCAACAGCAC ACCCCCGCAA CGAGAGCACT GCCGGTAGCG AAGCATCCGA CGGCGAACGA TGGCGCGCAG TCAGGCGAGA CCAACGGTCA CCTCGATCTC GCACGGGCCA GCGCGCACGG CATCCCGCCG AGCCCTCAGC CGACATCCAC GAAAAGCCAG ACGCTCGATC CCAGCGCCGC GCTACCGTGC CCCGATCCCC GTCGCGCAGT TCCGAATCTC AATCCGAGCG ATGCAGTTCC AAGTCCCGAT CCCAGCGGGG CAGTGGCGAA TCTCGATCCC AGCGGCAGGC TGCCAGGTCT CGATCCCAGC GCCACATTGG CGGTTCTCGA CCCCAGCGGC GGGCCGCCAA GTCCTGGTCC CAGCGCCGCA TTACGGAGTC TCGATCCCAG CAGGGCATCG GCGAATCTCA ATCCCAGCGG CACGCTGCCA AGTCCCGATC CCGGCGACGC TCTACCAAGT CTCGATACCA GCGGCGCCTT GCCGAGCGAT GCGTCTTTGC CGCCCGATCC AGCCTCCGCA CATGGGCCGG TCTTTAGCCC GACCGGCAGG CCGGCGACCT CCGAGCGGCC TCCCATCGCA GCACCCGATT CCCACATCGG GCACGATCGC AACAGGGCCA GCACGCACAG CACCTCGCCA AGCCCGCAGC CGACATCTGC GGCCGACCAC TCGCTCGATC CCAACGGGGC GCTGGCGAGC GATGCGTGTC CGCCCGCCAA TCCAGCCTCC GCACGCGGTC CGGTCTTCGG CCTGATCGGC GAGCTGGCGA CCTCCGAGCG GCCTCCCATC GCAACACCTG ATTCCCACAT CGAGCGTGAT CGCAACAGGG CCGACGCACA CGGCACCTCA CCGAGCCCGC AGCCGACATC TGCGGCCGAC CACTCGCTCG ATCCCAACGG GGCGCTGGCG AGCGATGCGT GTCCGCCCGC CAACCCGGCC TCCACACACG GCCCGAACTT CAGCCCGACC CGCGAGCCGG CGGCCTCCAA TCGGCGTCCC GCCACAGCGC CCGGGTCTCA CATCGAGCGT GATCGCAGCA GGGCCGACGC GCGCGGCACC TCGTCGAACC CGCAGCCGAC ATCTGCGGCC GACCACTCGC TCGATCCCAG CGGCGCCCTG CCGAGCGATG CTTGTCCGCC TGCCGACCCA GCCTCCACAC CCGGTCCAAT CTTCAGCCCG ATCGGCGAGC CAGCGAACTC CGAGCGGCCT CCCGCCACAG CGCCCAGGTC TCACACCGAG CGTGATCGCA ACCGGATCGG CACAAGCGGC ACCTCGCCGG GCCCTCAGTC GGCATCCGCG ACCAACCAGT TACTCGATCC CAACACGGCG CTAGCGAGCG ATGCGTCTCT GCCCGCCAGT CCAGCCTCCG CACTGGGTCC GGTCAGCCCG GTCGACGAGC TGGCGACCTC CGAGCGGCTT GCCATCGCAG CACCCGGGTC TCACATCGAG CGTGATCGCA GCAGGGACGG CACGCGCGGC ATCTTGCCGA ACGCTCAGCC AGCGTCCGCG ACCGACCAGT TGCTCGATCC CAACGGGGCG ATGCCGAGCG GTGCGTCTCC GCCTGCTGAC CGAAACTCCG CGCACGGTCC GGTCTTCAGC CCGATCGGCG AGCCAGCAGC CTCCGAGCGG CCTCCCGCCA CAGCGCCTCA GTCTCACGTC GAGCGTGATC GCAGCTGGGC CGACGCGCGT GGCATCTTGC CGAGTGCTCA GTCGGCCGCG ACCAGCCGGT CGTTCGATCA CAGCGATCCA GTGCCGCGCG ATGCGCCATT GCCGTCCGAT TTAGCCTCCG CATCTGGCCC GGAGTTCCAG CTCAGCAGCG AGTTGGCGAC TTTTGGGCGG CGTCGCACTG CAGCGTTCGA TTCCCATGTT GGGCTAGGTG AAAGCGGGGG CGGCGCGCGC GGGGTGTTGC CGAGCGGTCA GGCGGCGTCC GTGGGCGGCC GGTTGCTCGA TGCGCGTGGC GCGTTGCCGA GTGATGCGGC TGGGGTCGTC GACAGAGCCT CCGTGCTCGG TCCGGTGTTC AATTTGATCG GCGAGCGCGT GGCCTTTGAG CGGCCTCGCG TCACGACTTC GGCCGATTCA CATGTCGAGG TGGGTGTTCG GGGGAAGGCT CGTAGCGTCC CGAAGCCTGC GCCGAGTCCG GTCGTGCCGC AGAGTGAGGT GCGGCAGCCG GGTCGGGTGC TGGCTTTCGC TTCGGTGGTG GCTGTTGTTG TCGGGTCGGG GTTTCTTGTA CGGGCGCTGA TGCTCACTGA TCCGGTGCCG CCGCCGTTGC CGCCTGCATG GGCGGGGCGG GTTGTGGGGG CGGTTGGGCA TCGGGCTGGG CCGCTGGGGT ATGCGCGGCC GGTGCGCGTC AGCGTTGCTC GGGTTGGGAT TCATGCTGAT GTGCTTGCGC TTGGTTTGAC TCAGGAGGGC GGTGTTGGGG TGCCGCCCGC CAAGGAGCCT TTGAAGGCCG CTTGGTATGA CCGGGGGCCG GCGCCGGGGG AAGCTGGACC TGCGGTCATT ACGGGGCATG TTGATTCGCG GTTCGCGCCG GGGAATCGTG CCGCGTTTTA TGAGTTGGGG GCGGTTCGGC CTGGCGATGC TGTCGATGTG GTGCGCGCTG ACCATCGGGT CGCGGTGTTC CGTGTCGACT CGGTCGCGTT GGTGCCGAAG GCCGGCTTTC CGACGCGGCA GGTCTACGGG CCGACGGGGT ACGCCGCGCT GCGGCTCATC ACCTGCGGTG GCCACTACGA CCGGCGGACC GGGTACGCCG ACAACGTCAT CGTCTACGCG CATCTCGTCG GGTCCCGCTG A
|
Protein sequence | MTDPPAIAIP IGTPATDQQH TPATRALPVA KHPTANDGAQ SGETNGHLDL ARASAHGIPP SPQPTSTKSQ TLDPSAALPC PDPRRAVPNL NPSDAVPSPD PSGAVANLDP SGRLPGLDPS ATLAVLDPSG GPPSPGPSAA LRSLDPSRAS ANLNPSGTLP SPDPGDALPS LDTSGALPSD ASLPPDPASA HGPVFSPTGR PATSERPPIA APDSHIGHDR NRASTHSTSP SPQPTSAADH SLDPNGALAS DACPPANPAS ARGPVFGLIG ELATSERPPI ATPDSHIERD RNRADAHGTS PSPQPTSAAD HSLDPNGALA SDACPPANPA STHGPNFSPT REPAASNRRP ATAPGSHIER DRSRADARGT SSNPQPTSAA DHSLDPSGAL PSDACPPADP ASTPGPIFSP IGEPANSERP PATAPRSHTE RDRNRIGTSG TSPGPQSASA TNQLLDPNTA LASDASLPAS PASALGPVSP VDELATSERL AIAAPGSHIE RDRSRDGTRG ILPNAQPASA TDQLLDPNGA MPSGASPPAD RNSAHGPVFS PIGEPAASER PPATAPQSHV ERDRSWADAR GILPSAQSAA TSRSFDHSDP VPRDAPLPSD LASASGPEFQ LSSELATFGR RRTAAFDSHV GLGESGGGAR GVLPSGQAAS VGGRLLDARG ALPSDAAGVV DRASVLGPVF NLIGERVAFE RPRVTTSADS HVEVGVRGKA RSVPKPAPSP VVPQSEVRQP GRVLAFASVV AVVVGSGFLV RALMLTDPVP PPLPPAWAGR VVGAVGHRAG PLGYARPVRV SVARVGIHAD VLALGLTQEG GVGVPPAKEP LKAAWYDRGP APGEAGPAVI TGHVDSRFAP GNRAAFYELG AVRPGDAVDV VRADHRVAVF RVDSVALVPK AGFPTRQVYG PTGYAALRLI TCGGHYDRRT GYADNVIVYA HLVGSR
|
| |