Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_1052 |
Symbol | |
ID | 8332387 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 1193497 |
End bp | 1197585 |
Gene Length | 4089 bp |
Protein Length | 1362 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644954200 |
Product | coagulation factor 5/8 type domain protein |
Protein accession | YP_003111819 |
Protein GI | 256390255 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.311777 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACAGG AATCGAATCA GCCGTTCCGC GCGTCGCGGC GGACCGTCGT CGCGGCGGCG TCGACTTTGT TGGCCGGCTT CGCGGTGGAC ACCGCTTTCC CGAGTGTCGG CTTCGCCGCC GAGCCGGGGA AGGCTGGAGT CCCGGGTCAT GCGCCGGCGC CGGGGGAGCT GGCGATGTAC CGGCCGATCG AGGTCTCCTC GACCGATTAC GCGCCCACGC CTGGCGCGTT CGCCGTCGAC CGGGTGACCT CGGCCGGGGT CAGGGGTACC GGGTGGCGCG CCGCGGCCGG TGATCCGCAG TGGATCTCCG TGGACTTGCA GGCTGTCTGC CAGGTCACCT CTGTCCATCT GACCTTCGAG GCGGCCGCGG GCGATCCGGT CTTCGTCAAG CCGACCTCCG GCAACTGGGC CGATGGGACG ACGGGCAAGG AGCTGCTCTC CAGCTACGCG TCCGCCTTCG TCGTCGAGGT GTCGACGGAC AAGAAGTCGT GGACGAGCGT GTATCAGACG GCGTCGGGCA CCGGCGGCGC GGTCGTGATC GACCTGGCGG CTCCGGTCTC GGCGCGCTGG GTGCGGCTGA CGGCGTCCAA GCGCTCGGAT GCGAATCCCT TGGGACTCAA CGGTTTCCAG GTGTACGGCA GCGCGAGCGG GCACCGGCCC GCCGCCACCG GGTGGACCGA CTGGGGTACG CACGACAACC ACGCGCCCGG ACTGGTCGTG GCCGCCGACG GCACCGTGCC GCTGGAGTCC GGCTGGGTGC TGACGATGGA CGACTGGGCT CCCGGCGACG GGACCGCGCT GTCCGTGCCG ACCGTGGACA CCAGCGGGTG GTTGCCCGCC ACGGTTCCCG GGACCGTCCT GGCCTCGCTC GTCGAGCAGG GGCATCTGCC CGATCCGGTC GCGGGCTTCA ACAACCTGCG GATTCCCGAG GCGCTGTCCC GCCATTCCTG GTGGTACAAG CGCGACTTCG CGCTGCCCGC CGCGCTGGGC GCCGGGCACG GACGCCGGAT CTGGCTGGAG TTCGACGGAG TCAACCACCA GGCCGCGCTG TTCCTCAACG GAGCGCAGAT CGGCAGCCTC ACCTACCCGT TCGCGCGCGC CGCGATCGAG GTGACCGAGC ACTTGGTCGC CGGCGAGCAG TCCCTGGCCG TGAAGATCGA CCCGATGCCG ATTCCCGGCA GCCCGGGCGA CAAGGGACCG GCCGGTCAGT CCTGGGTCGA TGCCGGCGCG CAGATCATGA ACATGAACTC CCCGACATAC CTGGCCGCCT CGGGCTGGGA CTGGATGCCC GCGGTCCGCG ACCGGGTCAG CGGTATCTGG AACCACGTGC GGCTGCGCTC CACCGGGGAC GTCGTGATCG GCGACCCTCG GGTGGACACG GTGCTGCCGG CGCTGCCCGA CACCACCAGC GCGTCGGTGA CCATCGTCGT TCCCGTGCGC AACGCCGGGT CCGCCGATGT CAGCGCGACC GTGACAGCGT CCTTCGACAC CGTGCGCCTC TCGCAGACGG TGACCGTCCC GGCCGGTAAG AACCTGGACG TCACCTTCGC CCCGGCGAAG TTCGCGCAGC TGAACCTGCG CGATCCGAAG CTGTGGTGGC CCAACGGCTA CGGCGATGCG AACCTGCACA CCTTGAACCT GGCGGTCACG GTCGCGGGTC AGCGCAGCGA CCAGCGCACC ACCCGCTTCG GCATCCGGCA GTTCGACTAC GAGTACAAGA CGCCGCTGCC CTTCGTCGCA TCAGCCGACG CGTACACCCA GACCACGGAC CTCGGTGCGC GGCAGGCGCG CTATGTGCGC ATCAACTGCC AGACGCGCGC GACCGGATGG GGCTTCTCGC TCTGGACGCT GTCCGTCCTG AACAGCGCCA CGCCCGGTAC CGACCTGGCA CTGCACCAGC CGACGACCGC ATCGACGCAG GACCCCTCGA ACCCGGTGAC CAACGCCACC GACGGGGACG CGAACACCCG CTGGTCCTCC GACTTCGCCG ACGACCAGTG GATCGAGGTG GACCTCGGGG CGTCAGTGTC CTTCGATCAG GTCGCCATCA CCTGGGAGCA GGCGTACGCG CGCACGTACA CCGTGCAGGT CTCGACGGAC GGCTCGGTGT GGACGGACGC GAAGAGCGTG GACAACACGG CGATCCCGCT TCCGTTCAAC GGCGGCGACG CGAGCCTGGA CGTCGAGTCG TTCGCCGCGG CCTCCGGCCG CTACGTGCGG ATCAGCGGCG GCGTGCGCGA GACCAGCTGG GGTAACTCGC TGTGGTCGCT GTCGGTCCTG AACAGCGCCA CGCCCGGAGT CGACCTGGCC TTGCACAAGA CGGCCACCGC CTCGACCGAA GACCCCTCGA ACCCGGCGGC CAACGCCACC GACGGCAACA GCGGCACCCG ATGGTCCTCG GACTATGCGG ACAACCAATG GATCCAGGTG GATCTCGGCT CGTCGCAGAC CTTTGACGGA GTCGGCATCC TGTGGGAGCA GGCGTATCCG AAGACCTACG TGATCCAGGT GTCCGACGAC GGCTCGTCGT GGACCGACGT GAAGACAGTC GGCCTCGCGC CGGAGTCGCT GAAGATCAGC GTGAACGGTG TCAGAGTCCT GTGCCGGGGC GGCAACTGGG GCTGGGACGA ACTGCTGCGC CGGATGCCCT CCGACCGGAT GGACGCGGCG ATCCGCATGC ATCGGGACAT GAACTTCACG ATGGTCCGCA ACTGGGTCGG CGCCAGCAAC CGTGAGGAGT TCTACGCCGC GTGCGACGAG TTCGGGCTGC TGGTGTGGAA CGACTTCCCG AACGCCTGGG GCATGGACCC GCCGGACCAC GACGCGTACA ACTCGATCGC CGCGGACACC GTCCTGCGCT ACCGGATCCA CCCGAGCGTC GTCGTGTGGT GCGGCGCGAA CGAGGGGAAT CCGCCGCAGG CGATCGACGA GGGCATGCGC AACGCGGTGA CGAACGGCGC GCCCGGCATC CTGTACCAGA GCAACTCCGC CGGCGGGAAC ATCACCGGCG GCGGTCCCTA CTACTGGGTC GAGCCGGAGA CCTACTACGA CCCGGCGACG TACGGCAGCC ACAGCTTCGG TTTCCACACC GAGATCGGGA TGCCGGTGGT CTCCACGGCC GACAGCCTGC GGAACATGGC CGGCGAACAG CCGGCGTGGC CCATCGGCGG TCCGTGGTAC TACCACGACT GGAGCCAGTA CGGTAACCAG TCGCCGCTGC AGTACCAGGC CGCGATCGAA GCCCGGCTCC AGACCTCGAA CACCCTCGAG GACTTCGCCC GCAAGGCGCA GTTCGTCAAC TATGAGAACG CGCGCGCGAT GTTCGAGGCG TGGAACGCGA ACCTGTGGGC CGATGCCAGC GGTCTGATGC TCTGGATGTC GCACCCGGCG TGGCACAGCA CGGTGTGGCA GACCTACGAC TATGACTTCG ACGTCAACGG CATGTACTAC GGCGCGCGCA AGGCGTGCGA GCCCGTGCAC GTGCAAGCCG ATCCGGTCCA CTGGCAGGTC GTCGCGGTGA ACCACACGCC GCACGCGGTG AGCGGCGCGA CGGTCTCGGC GCGGCTGTTC GACCTGTCGG GCAGGCAGCT CGGCACGACG CAGAGCGCTG CGATCAACGT CGCGGTGGCG GACAGTGCGA AGGCCTTCGC GGTGGCATGG ACCGACGCGC TTCCCGATCT GCACCTGTTG CGCCTTACGC TCCAGGACGC CTCGGGCAAG ACACTGTCGG AGAACACGTA CTGGCGGTAC CGCGCTCCGT CGGCGATGCA GGCGCTGAAC AAGGCGCAGC AGACCCGGAT CACGGCCTCG ATCACCGGGG CGACCAGCGG CGGTGACGGG CGCCGCCAGC TGACGGCGAC GGTTCGCAAC CAGGGCTCGA GCGTCGCGGC GATGGTGCGG CTATCACTGC AGAACCGCGC TTCGGGGCAG CGGGTCTTGC CCACGCTCTA CGGAGAGAAC TACGTGTGGC TGCTCCCCGG CGAGACGCGC ACGATCATCG TGTCGTTCCC GTCCAGCGCG CTGCCCAAGG AGCAGCCGGA GCTGCACGTC GAGGGCTACA ACACGAGCGC GGTCATCGCT CGCGCATAG
|
Protein sequence | MAQESNQPFR ASRRTVVAAA STLLAGFAVD TAFPSVGFAA EPGKAGVPGH APAPGELAMY RPIEVSSTDY APTPGAFAVD RVTSAGVRGT GWRAAAGDPQ WISVDLQAVC QVTSVHLTFE AAAGDPVFVK PTSGNWADGT TGKELLSSYA SAFVVEVSTD KKSWTSVYQT ASGTGGAVVI DLAAPVSARW VRLTASKRSD ANPLGLNGFQ VYGSASGHRP AATGWTDWGT HDNHAPGLVV AADGTVPLES GWVLTMDDWA PGDGTALSVP TVDTSGWLPA TVPGTVLASL VEQGHLPDPV AGFNNLRIPE ALSRHSWWYK RDFALPAALG AGHGRRIWLE FDGVNHQAAL FLNGAQIGSL TYPFARAAIE VTEHLVAGEQ SLAVKIDPMP IPGSPGDKGP AGQSWVDAGA QIMNMNSPTY LAASGWDWMP AVRDRVSGIW NHVRLRSTGD VVIGDPRVDT VLPALPDTTS ASVTIVVPVR NAGSADVSAT VTASFDTVRL SQTVTVPAGK NLDVTFAPAK FAQLNLRDPK LWWPNGYGDA NLHTLNLAVT VAGQRSDQRT TRFGIRQFDY EYKTPLPFVA SADAYTQTTD LGARQARYVR INCQTRATGW GFSLWTLSVL NSATPGTDLA LHQPTTASTQ DPSNPVTNAT DGDANTRWSS DFADDQWIEV DLGASVSFDQ VAITWEQAYA RTYTVQVSTD GSVWTDAKSV DNTAIPLPFN GGDASLDVES FAAASGRYVR ISGGVRETSW GNSLWSLSVL NSATPGVDLA LHKTATASTE DPSNPAANAT DGNSGTRWSS DYADNQWIQV DLGSSQTFDG VGILWEQAYP KTYVIQVSDD GSSWTDVKTV GLAPESLKIS VNGVRVLCRG GNWGWDELLR RMPSDRMDAA IRMHRDMNFT MVRNWVGASN REEFYAACDE FGLLVWNDFP NAWGMDPPDH DAYNSIAADT VLRYRIHPSV VVWCGANEGN PPQAIDEGMR NAVTNGAPGI LYQSNSAGGN ITGGGPYYWV EPETYYDPAT YGSHSFGFHT EIGMPVVSTA DSLRNMAGEQ PAWPIGGPWY YHDWSQYGNQ SPLQYQAAIE ARLQTSNTLE DFARKAQFVN YENARAMFEA WNANLWADAS GLMLWMSHPA WHSTVWQTYD YDFDVNGMYY GARKACEPVH VQADPVHWQV VAVNHTPHAV SGATVSARLF DLSGRQLGTT QSAAINVAVA DSAKAFAVAW TDALPDLHLL RLTLQDASGK TLSENTYWRY RAPSAMQALN KAQQTRITAS ITGATSGGDG RRQLTATVRN QGSSVAAMVR LSLQNRASGQ RVLPTLYGEN YVWLLPGETR TIIVSFPSSA LPKEQPELHV EGYNTSAVIA RA
|
| |