Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_8841 |
Symbol | |
ID | 8340234 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 10247698 |
End bp | 10250592 |
Gene Length | 2895 bp |
Protein Length | 964 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644961931 |
Product | hypothetical protein |
Protein accession | YP_003119495 |
Protein GI | 256397931 |
COG category | [S] Function unknown |
COG ID | [COG4485] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.920014 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCAGAA CCGCCTGGCC GGCGATCCGC CGCAATCTGT ACCTGATCCT GGCCTTCGTC GTCCCGGTGG CGGTCTACGC GGCGATCCTG GCCGACCGGC AGGTCTACCC GTTCGACCCG CACGGCTCGT ACACGGTGTT CATGATCGAC CTGAACAACC AGTACGCGCA GTTCTACTCC TATTTCGACC AGGCGCTCAC CGGCCACGGC TCGCTCCTGT TCTCCTGGCG CGCCGACGGC GGCATGAACT TCTGGCCGAT CGTCGCCTAC TACCTGACCA GTCCGTTCGG CCTGCTGACG CTGTTCGGGT CGGACCACCA GCTGCCGGTG CTGATCGCCT TCGCCACGGT GCTCAAGCTC GGCGCGGCGG GTCTGGGCAT GGCGCTGTTC CTGCGCAAGT TCCGGGGCGA GCCCGGCCAC GGGAGCGGCT CGGCGGTGAA CAAGGCCGTG ATCGTGGCGC TGTCCACCGC CTACGCCCTC GGCGCCTGGT CGCTGATCTA CGCCTTCAAC ATCATGTGGC TCGACGCGCT GTACCTGCTG CCGTGGGCGC TGCTCGGCGT GGAGCGCTTG CTGGCCAAGG GAAAGATCAC CTCGCTGGCC GTCGCCATCG GCCTGAACCT GATCATCGAC TTCTACACCG GCGCGATGAT CTGCGTGTTC GTGTGCATGT ACGGCCTGGC GCGCTACGCC GGAGTGCGGG AGCGCTTCGA GCGCGCCGAC TTCCTGCGCA CGGCTGCCAA ATTCGCCGTG GCCGGCGCGA TCGGCGGACT GCTCGCCGCC GCGTTCATCC TGCCGACCTA TCTCGGCGGG CTGACCCAGA AGACCAAGCT CCACGCCGAC GCCTCGGTGA ACCCGCCGAT CCCGGTGCTG TCGCTGCTGG TGCGCTTCTT CGGCGGCACC GTGGACGCCG GGCAGCTGAC TCCGAACATC GGCGCCGCGA CCCTGATCCT GGTGCTGGTT CCGGTGTTCT TCCTGGTGAA GACCATCAAG CGCGCCGAGC GGATAGCCTT CGGCGCCGTC TTGGCGTTCC TCGTCCTGGC GATGGAGATC AAGCCGCTCT ACGTGATCAT GCACGGCGGC CAGATGCCGA ACGCCTTCCC GTTCCGGTTC GCGTTCCTCA TCACCGGGCT GCTGACCTTC CTGGCGTTCC GCGCCTGGGT CGGCATCGAC TCGGTGAAGC AGATCAAGTG GCTCGGGGTC AGTGCCGCGG TGTGGTTCGT GATCCTGTAT TGGGGACGGC AGCAGTACCC GCGGATCATC CGGCCGCAGG TGGTGCAGTT CGACGCGCTG ATCCTGGTCG TCGGGACCGC GCTCCTTTCC TTCGCGCTGT ATCTGCGGGT CCGTCCGGCC GGCGAGCCGC TGTCGAAGCG ACTGGCCTGG ATGCCCAAGC GGCTGGCCAC TCCGAAGGTC GCCGCCGTGG CCGCGATCGC GGTGCTCGCC GTGGACGCCT CGGGCTCGGC CGCGCTGGTC GGGCAGAAGG CGATCGGCAA GCAGCCCTCC GGGAACGGCT CGGTGACCAA GTCCAACTGG ACCAGCTCGC CGACCACTTC CTACGGGACG GCGCTGTCCT CCCTGCAGCC GAGCAACGAC GAGTTCTACC GCGCCGAGGG CGACGACCAG AACCTGCGCA GCACCAACGA CAGCCTGCGC TACGGCAACT TCGGCTTCAC CCACTTCTCC TCGCTGTCCT CCGGCAAGCT GCACAGCGCG ATGCAGAACC TGGGCTTCGC GCACCACTCG GCCGACGTGT GGTCCTCGCA CACCGGCGCC ACGCTGCTCA CCGACGCTCT GCTCGGCTAT CAGTACCTGG TCGGCACCAC GCGCGAGACA GCCGACGGCA CGATCGACCG GCTCGGCGCG ACGCTGCAGA AGACGTACGA CAACACCGCG CCGACCGTCC CGGGCAAGAA GCCGCCGACG CCGGACGTCA CCACGGTCTA CAAGATCGAC GACACGCTGC CGGTCGGCTT CCGGCTCGCC GGCTCCGACC TCGCGGACTT CACCGCGCCG GTGCCGGCGA ACTCCCCGTT CGCCGCACAG GAGCAGGCTT TCGGTCTGCC GGGCGCCTTC GCATCGATGT GCGGCAGCCC GACGGTGACC GCCAGCGACG GGGTGACCGT CACGCCCGGC AGCGACGGCT CGGTGTCGAT CAAGGTTCCG GCGAAGGCGC CGACCGGCGC GGAGTACTAC AGCGACAAGA TCGTGTGGCA GTGCCAGTCT TCCGGCGCGC GCCAGGTCTA CCTGTACGCG CCCAGCGCGA TGCCGACCGG CTTGTCCTAC GTCCGGCTCG ACGGCCAGAA CCGCCCGGCG CCGAAGGCGG GCGCGGTCGC GGAGGACAAG ACGAACATCC TCTACCCCTC GGGCTTCGCC AACGGCGTGC AGGACCTCGG CAGCGTCCAG TCCGGTTCGT TCACAGTGAC AATGTCCACG CAGAAGCTGC CGTTGAAGAA CACGTACACG GTCCCGGCCA ACCCGGTCCG CGGGCTGGAC CCGACCGCCG TGAACGCGAA GCTGGCGCAG CTGCGCGGCG GCGGCGTGAG CGACGTGCAC TGGACTGACC GGGGTCTGAC GGCCACGACG ACCGGGGACA GCGCGGCGAC CGTCTTCCTG TCGATCCCGA CGATCCCCGG CTGGTCGGTC ACCGTCGACG GCAAGAGCGT GAAGACCACC GAGCTGCTCG GTTCCTTCAC CGGCGTCCCG GTCCCGGCCG GCACGCACCG GATCTCGATG TCCTTCACCC CGCCCGGGCT CACCGCCGGG TTCGGCGGCA GCGCCGTCGG CCTGGTCGCG CTCGGCGGGG TCTGGTGGTT CCAGCGGCGG CGCGCCGCCG CCGGTTCGGC GCCCGCGGCC CCGGCCGCCG GCGAGGAGCA GGTGTCCGAG GACGTGATGT CGTGA
|
Protein sequence | MTRTAWPAIR RNLYLILAFV VPVAVYAAIL ADRQVYPFDP HGSYTVFMID LNNQYAQFYS YFDQALTGHG SLLFSWRADG GMNFWPIVAY YLTSPFGLLT LFGSDHQLPV LIAFATVLKL GAAGLGMALF LRKFRGEPGH GSGSAVNKAV IVALSTAYAL GAWSLIYAFN IMWLDALYLL PWALLGVERL LAKGKITSLA VAIGLNLIID FYTGAMICVF VCMYGLARYA GVRERFERAD FLRTAAKFAV AGAIGGLLAA AFILPTYLGG LTQKTKLHAD ASVNPPIPVL SLLVRFFGGT VDAGQLTPNI GAATLILVLV PVFFLVKTIK RAERIAFGAV LAFLVLAMEI KPLYVIMHGG QMPNAFPFRF AFLITGLLTF LAFRAWVGID SVKQIKWLGV SAAVWFVILY WGRQQYPRII RPQVVQFDAL ILVVGTALLS FALYLRVRPA GEPLSKRLAW MPKRLATPKV AAVAAIAVLA VDASGSAALV GQKAIGKQPS GNGSVTKSNW TSSPTTSYGT ALSSLQPSND EFYRAEGDDQ NLRSTNDSLR YGNFGFTHFS SLSSGKLHSA MQNLGFAHHS ADVWSSHTGA TLLTDALLGY QYLVGTTRET ADGTIDRLGA TLQKTYDNTA PTVPGKKPPT PDVTTVYKID DTLPVGFRLA GSDLADFTAP VPANSPFAAQ EQAFGLPGAF ASMCGSPTVT ASDGVTVTPG SDGSVSIKVP AKAPTGAEYY SDKIVWQCQS SGARQVYLYA PSAMPTGLSY VRLDGQNRPA PKAGAVAEDK TNILYPSGFA NGVQDLGSVQ SGSFTVTMST QKLPLKNTYT VPANPVRGLD PTAVNAKLAQ LRGGGVSDVH WTDRGLTATT TGDSAATVFL SIPTIPGWSV TVDGKSVKTT ELLGSFTGVP VPAGTHRISM SFTPPGLTAG FGGSAVGLVA LGGVWWFQRR RAAAGSAPAA PAAGEEQVSE DVMS
|
| |