Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4312 |
Symbol | |
ID | 8335666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4892640 |
End bp | 4895663 |
Gene Length | 3024 bp |
Protein Length | 1007 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644957415 |
Product | coagulation factor 5/8 type domain protein |
Protein accession | YP_003115017 |
Protein GI | 256393453 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.382948 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.000989115 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGATTCCCG CACCAAGATT TTCACGGAGA TCCAGATCAT CGGCACTGTT GCTGGTCCTG GCCATGTTGA TGGCCGTTCT GTCGGTACCG TCGCTCGGGG CGGGGCGGGC CAAGGCGGCC AGCTGTGACA CCGGCACGAA CCTGGCACTG AACAAGAACG CCACCGCTTC CTCGATCGAG GGCGCCGGAA CCCCGGCCAC GGCTGCGGTG GACGGCAACA CCGGCACGCG CTGGTCGTCG CAGTTCTCTG ACCCGCAGTG GCTCCAAGTG GATCTCGGTA GCTCGCAGAG CATCTGCCAG GTGACGCTGA ACTGGGAGAC CGCCTCGGGC AAGGCGTTCC AGATCCAGAC CTCCAACGAT GCCGCTACCT GGACCTCGAT CTACTCCACG ACCACCAGTG CTGGCGGGAC GCAGACGCTC AGCGTGTCCG GCACCGGTCG TTACATCCGG ATGTACGGCA CCGCGCGTAA CACCGGTTAC GGCTACTCGC TGTGGGAGTT CGGCGTCTAC GGCAGCTCCA ACGGCGGAGG CGGCACCCCG CCGCCGCCGA ACTGGAACCT GGTCTGGCAG GACACCTTCG GCGGCAACTC CGGAACCGCT CCGTCCTCCG CCAACTGGAT CGAGGACACC GGCCACAACT CCGCGGGCGG ACCCGCGGAC TGGGGTACCG GTGAGGTCGA ATCCGCCTCC TCGTCGACTG CGAACGTTTC CGTGGACGGC AACGGCCACC TGAACATCAC TGCCCTCAAG GACGGTGCGG GGAACTGGAC CTCCGGCCGC ATCGAGACTC AGCGCTCTGA CTTCGCCGCC CCGGCCGGCG GCATGCTGGA GGTCACCGCG ACCATCAAGC AGCCGAACGT GGCCAACCCC GCCGGCTACT GGCCCGCTTT CTGGGCGCTG GGCAACGGGT CGCGCACCGG CTCGGGTACC TGGCCGGCCA TCGGCGAGAC CGACATCATG GAGAACGTCA ACGGTCACCA GACCACCTCC TCCGGTCTGC ACTGCGGGAC CGCGCCGAAC GGTCCGTGCA ACGAGTCCAG CGGGCGTGGC AGCGGTCTGC GAACCTGCGC CGGCTGCCTG AGCGCGTACC ACACCTACGC CGAGGTCATC GACCGGACGC AGTCCGACGA GCAGATCCGC TTCCTGGTGG ACGGCCAGGT CACCTGGACG GTCAGCGAGA GCCAGGTCGG TGTGACCACC TGGCAGAACG CGGTGGACCA CGGCTTCTTC CTGATCCTGG ACCTGGCGGT CGGCGGCTCC TGGCCGAACG CCGACTGCGG CTGCACGTCC CCGACCGCGG CGACCACCTC CGGCGGCACG CTGAGCGTCG GGCCGATCGC CGTCTACAGC ACCACCGGCT CGGCGCCGAC CCCGTTGCAG CCCCCGGCAC CGGCCACCGG GTCCAGCACC GTGAAGGTCA CCGGCAGCCA GGGGAACTGG CAGCTGAATG TGAACGGCGC GCCGTACCAG GTCAAGGGTG TGACCTGGGG TCCTGGCAAC CAGGCCGGGG ACGGCTACCT CGCCGACGCC GCGTCCATGG GGGTCAACAC CATCCGGACC TGGGGTACGG ATGCGTCCTC CCAACCGCTG CTCGACGCGG CGGCGGCGCG CGGGATCAAG GTCATCAACG GCTTCTGGCT GAATCAGGGA GCTGACTACG TCAACGACAC CGCGTACAAG ACCAACACCC TGAACAGCAT CAAACAGTTC GTCACGCAGT ACAAGAGCCA TCCGGCGACC CTGATGTGGG ACGTCGGGAA CGAGGTGATC CTCACCTCGC AGAACTACAC CTACCCCAAC GGCGCCACCG TGGAGCAGGA GCGCGTCGCC TACGCGCAGT ACGTCGAGCA GATCACCCAG GCGATCCACG CCATCGACCC GAACCACCCG GTCACCTCGA CCGACGCCTG GACCGGCGCC TGGCCGTACT ACAAGCAGTA CACGCCGAGC CTGGACCTGC TCGCCGTGAA CTCCTACGGC TCGGTGTGCA ACGTGAACAC CGACTGGGTC AACGGCGGCT ACACCAAGCC CTACATCGTC ACCGAGGCCG GCGACGCCGG CGAGTGGGAG GTCCCCAACG ACGCCAACGG CGTGCCCACC GAGCCCACCG ACCAGCAGCA GCGCGACGGC TACACCAGCG CCTGGAACTG CATCGCGGGC CACCCCGGCA TCTCCTTCGG CGGGACCCTG TTCAACTACG GCGTCGAGAA CGACTTCGGC GGCGTCTGGT TCAACCTGCT CACCGGCGGC TGGCGCCGGC TGTCCTACTA CGCGGTCAAG CAGGCCTTCA CCGGCCAGGC GCAGACCAAC ACGCCGCCGG CGATCACCTC GATGACGCTC AGCAACACCG CGAGCGTCCC GGCGGGCGGG CAGTTCACCG TGAACGTCGC CTCGACCAAC CCGACCGGCG ACGCGCTGAG CTACAACGTC GCCCTGTCGA GCAAGTACGT CAACAGCGCC ACCCCGTTGC AGTCGCCGTC CAGCTACACC CAGACCGGTC CCGGCGCCTT CACCGTCACC GCACCCCAGA CGCTCGGCGT GTGGAAGGTG TACGTGTACG TCTACGACCA ACACGGCGGC GTGGGCATCC AGTCCGTGTC CTTCCGCGTG GTCGCACCCC CGGTCTCCGG CACCAACGTG GCACTGGGCA AGGCCGTCAC GGCCTCGTCC TTCCAACCCG CCAGCAACGG CCAGACCTTC GTCCCCGCCA ACGTCACCGA CAACAACTGG ACCACCCGCT GGGCCAGCGA CTGGAGCGAC CCCCAGTGGA TCCAGGTCGA CCTCGGCCAG TCCACCGCCA TCAAGCACAT CCAACTCGGA TGGGAGTCCG CCTACGCCAA GGCGTACCAG ATCCAGGTAT CCAACGACGG CACCAACTGG ACCACCGTCC ACACCACCAC AACCGGCGCC GGCGGAGTCG AGACCTTCGA CGTCACCGGC ACCGGCCGCT ACGTCCGGAT GTACGGCACG CAGCGGGGGA CGGCGTACGG CTACTCGCTT TATGAGTTCG GGATCTATGC CTGA
|
Protein sequence | MIPAPRFSRR SRSSALLLVL AMLMAVLSVP SLGAGRAKAA SCDTGTNLAL NKNATASSIE GAGTPATAAV DGNTGTRWSS QFSDPQWLQV DLGSSQSICQ VTLNWETASG KAFQIQTSND AATWTSIYST TTSAGGTQTL SVSGTGRYIR MYGTARNTGY GYSLWEFGVY GSSNGGGGTP PPPNWNLVWQ DTFGGNSGTA PSSANWIEDT GHNSAGGPAD WGTGEVESAS SSTANVSVDG NGHLNITALK DGAGNWTSGR IETQRSDFAA PAGGMLEVTA TIKQPNVANP AGYWPAFWAL GNGSRTGSGT WPAIGETDIM ENVNGHQTTS SGLHCGTAPN GPCNESSGRG SGLRTCAGCL SAYHTYAEVI DRTQSDEQIR FLVDGQVTWT VSESQVGVTT WQNAVDHGFF LILDLAVGGS WPNADCGCTS PTAATTSGGT LSVGPIAVYS TTGSAPTPLQ PPAPATGSST VKVTGSQGNW QLNVNGAPYQ VKGVTWGPGN QAGDGYLADA ASMGVNTIRT WGTDASSQPL LDAAAARGIK VINGFWLNQG ADYVNDTAYK TNTLNSIKQF VTQYKSHPAT LMWDVGNEVI LTSQNYTYPN GATVEQERVA YAQYVEQITQ AIHAIDPNHP VTSTDAWTGA WPYYKQYTPS LDLLAVNSYG SVCNVNTDWV NGGYTKPYIV TEAGDAGEWE VPNDANGVPT EPTDQQQRDG YTSAWNCIAG HPGISFGGTL FNYGVENDFG GVWFNLLTGG WRRLSYYAVK QAFTGQAQTN TPPAITSMTL SNTASVPAGG QFTVNVASTN PTGDALSYNV ALSSKYVNSA TPLQSPSSYT QTGPGAFTVT APQTLGVWKV YVYVYDQHGG VGIQSVSFRV VAPPVSGTNV ALGKAVTASS FQPASNGQTF VPANVTDNNW TTRWASDWSD PQWIQVDLGQ STAIKHIQLG WESAYAKAYQ IQVSNDGTNW TTVHTTTTGA GGVETFDVTG TGRYVRMYGT QRGTAYGYSL YEFGIYA
|
| |