Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_2837 |
Symbol | |
ID | 8334186 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 3242886 |
End bp | 3246335 |
Gene Length | 3450 bp |
Protein Length | 1149 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644955981 |
Product | hypothetical protein |
Protein accession | YP_003113587 |
Protein GI | 256392023 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00780437 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.000194456 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCGCGC TGGAAGCCAG AGCGAAGGCG CTGGCCGACG CCGCCAACTT CGTCGCGGGA AGCGGCGCCA CCTTCAACAC CGACCTCGAT GACGCCGGCC TGCGCGACAA AGTCATCGAG GCCGTTCGGA CGGCCGAGCG GAACATTGCC GTCATCGTCG GCATCGACCT GGACGACCCG GATCTCCGCG AAAAGGTCAT CGAGGCCGTC AGCGCGGCCG AAAAGGGCAT CAACATCATC GTCGGGATGG ATGTCGATGC CGACGGCTTG AAGGAGCGAG TCAAGGCCGA GGCGGATGCT GCCGGCGCCG GCGAGAAGAT CAAGGTCCGG GTCGAGTCCG ACGGAACCAG CCTGGAGCAG GACGTTGCAT CCAAGGCGGG ACGCGTAAAG CCCCAGCCCA TCAAGGTGCC GATCCAGTCG GACGCCGATA AGTTCGAGGC CGAACTGCGC GCGTCGTTCG CGGAGGGCGA GAAGAACGCT GCGGCCGCCG AAAAGGCGAT GAACCAGTCC TTCACTGCGA TGCAGACCGG GGTGCGCGCG CTCCGGTCGG CCATGGCCGA GCTTGAGCCG GCAACGCAGG ATGCTGAAGA TTTTGAAACG TCCTTCCGCA AAGCCATGGA CGAGGGCGAA AGGGTTTCAC AAGAGGCCGA CCGCGTCCTG CGCCAGTCGT TCACCTCCAT GGAATCCGGC TCGCGGACTT TGCGTGCGGC GATGACCGAG CTCCAGCCGG CCGCCGAGGG CGCTGGTCAG GCCGCAGCGA ATGCGGGCAG CGGTTTCAAT ACGAGCGCCC TCAGGATGTC CTCGCTGATC GGGGCGGCCT TGGCGCTCGG CCCAGCGCTG GCAGCGATCC CGGCTGTAGT GGGCGCCGCG GGCGCGGGCT TCGCAACGCT GGGCCTGGGG ATGGCCGGGC CGATTGCCGC GCTGCGGGAC TACGGGGCGC AGAGTCAAGC CACGGGCCAG TCCTCGGCGC AGCTCGCGGC AACGGCCTTC AGTAACGCTG TTTCGATTCG CAATGCCGAG CAGGCAATCG CCGACGCGAA GCGGCAGGCC GCGATTTCGG CGATTAACTC GGCCCAGTCG ATCGAGTCTG CGGAACAGGG CGTTACCGAC GCCGAGCGGC AGGCGGCGAT CTCGGCTCAA TCGGCAGCAG ATGCCGTGGC TTCGGCCGAT CAGCGCCTCG CGAACGCGCA GGAGTCGTTG ACGCAGGCAC AGGAGTCGTT GACGCAGGCT CAGAAGGACG GCGTCAACGT CCTCAAGGAC TTGAATCTGG CTTCGGCCGA TGCGGCGAAC TCTGTCGCGG ACGCGCAGAA CGCGGTCATT GACGCGCAGG CGGCCTACGA CAAGGCCAAG GGCAACAGCC TGCTGACTGA TCAGCAGAAG AAGGAAGCGC AGCAGCAGCT GATCGACGCC CAGCAACACC TGACGGACGC GCAGCAGAAG GCGCTTGAGG CGCAGCAGGC CGCGAACGAC GCCAACCAGA AGGGCGTGGA CGGCAGCACG GCGGTCGTCG CGGCGCAGCG GCAGGTTGTC TCGGCCACGC AGGGCGTAGC CGACGCTCAG CTTGCGGCAA CTCGCGCCCG GGAGGCGCAG GCCAACCAGG AGATCTCAAG CAACCAGTCG GTCGCGAAGG CTCAGCAGTC GCTGGCTACC GCGATCCGCG ATGCGGCGGA ACAGCAGATC TCGTCGAATG AATCGGTGTC CAAGGCGGTT CAGGCACTCA AGGACATGCA GGAGCAGCAG GCTCTGTCTG CCGCGGCTGC GGCATCTTCG GGGTCTGCGG CTGCGAACAA ATTCGCGCAG GATATGGCGA AGTTGACCCC GGCCGGCCGG GATTTCGTCA ACCAGCTGAT TTCGATGCGG GGCGGTCTGC ACGATCTCGA GGCCACTGCG CAGACGACGC TGCTGCCCGG CTTCACCACG CTCCTGAAGG ACGTGGGCGG TTCGAACGGC CTCGGCTCGC TGTTCAACAA GGCCGTCGGC GACATGGGCA CGATCATCGG CGGCACGGCG ATCCAGTTCG GGAACCTGAT GACGTCGCCC GCGTTCAAGG GCCAGCTGAC GCAGGTGCTG AAGGACGGCG CGGGGTTCGC GAAGGATCTC GGCGACGGAC TTGTGGCTTT GACGGGCGGC CTGACTAAGG CGGCGTCGCA GGCAGGCCCG ATAGTGTCCG GGCTCGGCGG CGGAATCAAA ACCTTGATGT CGTCCGGGAT TCCCGACTTC TTCAGCGGTC TGGTCACCAA TGCGGGCGGC GCCGGCCAGT CGATACAGGC CATCTTCACG ATCGTTTCCA ACCTTGCCGG TCCGCTGGGC ACGATAGCCG GCGCGTTCTC TGCGGCACTT GCTCCGGCGC TGCAGGTTCT GGACTCCCCG CAGGTCCAGC AGTCACTACA GTCGATCGCG ACTTCAATTG CGCAGATCCT GATCGTCCTG TCGCCGGTGG TCACAATGCT CGCGCAGGGT CTGGCAGGGG CGCTGCGGAT CGTGGCGCCG CTGATGCAGT CGCTGGCGAA GTTCATCCAG GACAACCAGC AGTGGGTGGT GCCGCTGGCC AAGGGGATCG CGATTGCCAC GATCGCTTTT GTCGCTTTCA ACGCAGTGCT CGCTGCGAAC CCCGTCCTGC TGGTGGTAGC CGCAATCGCG GCCCTGGTTC TCGGTGTGGT CTACGCATAT GAGCACTTCA AGATATTCCG CGATGTCATC CACGATGTGT GGGTCGTTAC GAAGGCCGAG TTCGACTTCT TCCTGGGCTT CATAAAGCGG TGGTGGCCGG AGCTGCTGGC ACCGTTCACC GGCGGCGTGT CAGAGATCAT CGCCCACTGG GACGCGGTCG TCGACTTCGT GAAGAAGCTA CCGGGCCGAC TGGTCTCCGC GGGCGCGCAC ATGTGGGACT GGATCTCTCA GAAGTGGGAC GACGACGTAG CCGCGCCGGT CAGCAAGGCC TTCGACGGCT TCATTCACAC AGTGACCGGG CTGCCGGGCA AGTTGGCCAG GGCCGGCGCC GGCATGTGGG ACTGGATCAA GGAAGAGTTC GTCGGCGCCC TCAATGCAAT TGCCAACCTG TGGAACCAGT TGCACTTCAG CACGCCGAGC TTCCACATCC CGATTCCCTT CAGCAGCGGC ATCAACGTCG ACTCGATAAC CGTCGGGGTA CCGCCCATCG GCCCTTTCAA GGCCGCCGGC GGCCCCATCT GGGGCGGCCT GTCCGCGATC ATCGGCGAAG CAGGGACCGA ACTTCTGAAA CTGCCGACCG GCACCCAGGT CATGCCCCAT GCCAACACTC AATCAATGAT CGCCCAGGGC GGCCTTGGAT CGTCCGGCGG CGTGCTTCAG ATCGAGTGGG TCGGCGGCAA CGGCGGCGAC GAGCTCATGA CGTGGATCCG CAAGAACATC CGCATCCGCC ACGGGTCGGA TCCCAACAGC GTCCAGAAGG CCCTCGGGCA GAGCTTTTGA
|
Protein sequence | MAALEARAKA LADAANFVAG SGATFNTDLD DAGLRDKVIE AVRTAERNIA VIVGIDLDDP DLREKVIEAV SAAEKGINII VGMDVDADGL KERVKAEADA AGAGEKIKVR VESDGTSLEQ DVASKAGRVK PQPIKVPIQS DADKFEAELR ASFAEGEKNA AAAEKAMNQS FTAMQTGVRA LRSAMAELEP ATQDAEDFET SFRKAMDEGE RVSQEADRVL RQSFTSMESG SRTLRAAMTE LQPAAEGAGQ AAANAGSGFN TSALRMSSLI GAALALGPAL AAIPAVVGAA GAGFATLGLG MAGPIAALRD YGAQSQATGQ SSAQLAATAF SNAVSIRNAE QAIADAKRQA AISAINSAQS IESAEQGVTD AERQAAISAQ SAADAVASAD QRLANAQESL TQAQESLTQA QKDGVNVLKD LNLASADAAN SVADAQNAVI DAQAAYDKAK GNSLLTDQQK KEAQQQLIDA QQHLTDAQQK ALEAQQAAND ANQKGVDGST AVVAAQRQVV SATQGVADAQ LAATRAREAQ ANQEISSNQS VAKAQQSLAT AIRDAAEQQI SSNESVSKAV QALKDMQEQQ ALSAAAAASS GSAAANKFAQ DMAKLTPAGR DFVNQLISMR GGLHDLEATA QTTLLPGFTT LLKDVGGSNG LGSLFNKAVG DMGTIIGGTA IQFGNLMTSP AFKGQLTQVL KDGAGFAKDL GDGLVALTGG LTKAASQAGP IVSGLGGGIK TLMSSGIPDF FSGLVTNAGG AGQSIQAIFT IVSNLAGPLG TIAGAFSAAL APALQVLDSP QVQQSLQSIA TSIAQILIVL SPVVTMLAQG LAGALRIVAP LMQSLAKFIQ DNQQWVVPLA KGIAIATIAF VAFNAVLAAN PVLLVVAAIA ALVLGVVYAY EHFKIFRDVI HDVWVVTKAE FDFFLGFIKR WWPELLAPFT GGVSEIIAHW DAVVDFVKKL PGRLVSAGAH MWDWISQKWD DDVAAPVSKA FDGFIHTVTG LPGKLARAGA GMWDWIKEEF VGALNAIANL WNQLHFSTPS FHIPIPFSSG INVDSITVGV PPIGPFKAAG GPIWGGLSAI IGEAGTELLK LPTGTQVMPH ANTQSMIAQG GLGSSGGVLQ IEWVGGNGGD ELMTWIRKNI RIRHGSDPNS VQKALGQSF
|
| |