Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_3843 |
Symbol | |
ID | 8335196 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4351190 |
End bp | 4354330 |
Gene Length | 3141 bp |
Protein Length | 1046 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 644956979 |
Product | NHL repeat containing protein |
Protein accession | YP_003114582 |
Protein GI | 256393018 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.0246016 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTTC CTATCAAGCT CCGCTGGGCG GCCGTGGCGG CGAGCGCCGT GGTCGGGTCG GCGGCGATCG GCCTGGGGGT GGCCGCTGCC GCGGGCGGTC CCGCCGCGTC GACCGGCGGC GGGCAGAGCT ACTCCGCGGT CTGTCCGGCC GCGGACGTGG TCACGCCCGC GTTGAACAGC AACCTGATCG AGAACCCCGG GGCCGAGGAC TACACGGCCC TCACCACGCT CGGCGAACCC GCTACCGACC TGCAGTACGC GCCCGACTGC TGGGTGAGCA CCTCCCCGAT GGGCGGCCAG GGCGCCGTCC TGGAGTCGGC GGCGTCCTCC GCGGTGCCCG GCCAGACCGG CAGCCGCACC TTCTACGGCG GCTACGACTA CGACTCCCCT CAGGTCTCGA TCGTGGGCGT GACGACCACG GCCACCCAGC TGATCGACGT GAGCTCGCTC GGCGCGGGCA CCCGCGCGTA CACGCTGACC GGCGAGATCG GCGGCTACAC GACCCAGACC GACTACGCGA CGGTCACCGC CCGGTTCGAA GACGCCGCCG GCGCCCCGCT CGGCTCGGCC GTCCTCGGCC CGGTCGACCC GGCCCAGCGC ACCAACGTCA CGAGCCTGAT TCCGCAGGCG GCGACAGGCA CCGTGCCGGC CGGCACCGCC CAGATCCTGA TCACCGTCGC CTCGACCGGC GTCAGCGCCG GGTACGGCAT CGACGGCCGC GCCGACAACC TGAACCTGAC GATCTCCTCC GGCGACTCCG GCCAGAGCTA CACCGTGCCG TGCCCGGCCT CCGACACGGT CACCCCGGCG CTGAACAGCA ACCTGATCGA GAACCCGGGC GCCGAGGACT CCACCGCGGC GACCGCCCTC GGGGCACCCG CCGGCGACGA CCAGACCGTC GCTGACTGCT GGACGAGCGC CTCCCCGCTC GCTGCGCCCG ACGGGACGCA GGAGTCGAGT CCCTCGACCA ACCCGGGCGT CACCGGCAGC CGCGTGTTCT ACGGCGGCAC CAACCCGAGC ACCGTCGCGG TGGCCGGCGT GGTCACCACC GGCAGCCAGG TCATCGACGT CAGCTCCCTG AAGGCGGCCG GCCAGCCGTT CAAGCTGACC GGCAAGGTGG GCGGCTACTC GACCCAGAGC GACTACGCCG AAGTCGTCGC CACTTTCGAG AACGCTTCCG GCGCCAGCCT CGGCACCGCC CAGATCGGTC CGGCGACCCC GGCCGACCGC GCCAACGTCT CGGAACTGCT CCCCGACGGC GCCTACGGCA CCGTCCCGGA CGGCACCGCG AAGATCCTCG TCACGATCAT CACGGGGGGC GTCAACGCGG GCGCGAACAG CGACGGCACG GCCGACGACC TGAACCTGAC GATCGGTCAG AACGCGGTCG GCCAGAGCTA CACCGTGCCG TGCCCGGCCT CCGACACGGT CACCCCGGCC CTGAACAGCA ACCTGATCGA GAACCCGAGC GCCGAGGACT CCACGACGGC GACCGCACTC GGCGCGCCGC TCGGCGACGA CCAGACCGTC GCGGACTGCT GGACGAGCGC GTCCCCGCTC AGCCCGCCCG ACGGCACGCA GGAGTCGCGC ACCTCGACCT ACCCCGGCGT CACCGGCAGC CGCGTGTTCT ACGGCGGCAC CAACCCGAGC ACCGTCTCGA TCCCCGGCAT CAGCACCACC GGCACCCAGT CCGTCGACCT GAGCGCGCTC GCCGTCGGCG GCCAGCCGTT CAAACTGTCC GCCGACCTCG GCGGCTACTC GACCCAGGGT GACTACGCGA CCGTGACGGC GGCGTTCCAG GACACGAAGG GCGCCACGCT CAGCACCGCC AAGATCGGCC CGGTCACCGC GGCGCAGCGG GGCAACGCCT CCAGCCTGAT CCCGCAGGCC TGGTACGGCG ATGTCCCCGC CGGCACGCAG AAGGTCGTCG TCACGATCGC GACGGTCGGC GTGAGCAGCG GCGCGAACAG CGACGGCACG GCCGACAACC TGAACCTGAC GATCGGTCAG AGCGCCGTGC CGAGCGGCCC GGTGCTGCAG ACCATGCCGT ACGCCTCGGT GGGCGACGAC ACGGGCACGC ACGTCGACCC GGGCAGCGGC GCGGTCGTCC CGAACGTCGT GCCGGGCGCC CTGGACCGGC CGGCCGGTGT GTCGGCGTTC AAGGGCACCG TGGACGTGTC GAACACCGGT GACAACGTCG TCTCCGCGCT GCAGAACGGC TCCACCTCGG TCATCGCCGG CTCGCTGGAG GCGTACGGCG AGCACGGCGA CGGCGGCAAG GCCACGTCCG CATCGCTGTA TCAGCCCTCT GGCAGCGCCA CGGACGCCGC CGGCGATCTG TTCGTCGCCG ACGCCGGCGA CAACGTGGTG CGCGAGATCG CCGCGAACGG GACGATCAGC CGCTTCGCCG GCACCGTCCC CGGCGGCTCG TGGTCCGGCG CCGGCCTCGG CGGCCTGGTC CCGCTCGACC ATCCCGAGGC CGTCGCCGTG AACGCCGCCG GCGACGTGTT CATCGCGGAC ACCTACGCCG ACCGGGTGGT CGAGCTCACG CCGCGGGGCC TGCTGCTGCG CCTGATCGGC ACCGGCCGCG CCGGCTACTC CGGCGACGGG CGACCGAGCC CGCTCGCGCA GCTGAACCAG CCGATCGGGC TCGCGCTCGA CGCCCAGGGC GACCTCTACA TCGCGGACTC GGCCAACAAC GTGATCCGCC GCGTGGACGC GCGCACCGGG ATCATCACGA CCGTCGCGGG CGACCACGCG GCCGGCAAGG CGGCCGGCGG CCTCGGCGGA TTCTCCGGCG ACGGCGGGCC CGCGACCTCG GCGCAGCTCA ACGACCCGCA GGGGGTGGCG GTGGACGGCG CCGGCGACCT GTTCGTCGCG GACACCTTCG ACAACGCGAT CCGCGAGGTC ACCCCGGACG GGACCATCAG CACCGTGGTG AACTCCTCCG CCGCTCCCGG CGGGGAGAGC AGCGGCGCGG CCCCGACCGC CTCGCACCTG AACACCCCGT ACGCCGTCAC AGTGGATCCG TCCACGGACC TGCTGTACAT CGCCGACACC CGCAACAGCG TCATCGCCCA GGTGATCGGC CTGGCCCGCG CCGGCCACGC GCCGGGCCCG GTGGCCCCGC CCGCGTCCTG A
|
Protein sequence | MKLPIKLRWA AVAASAVVGS AAIGLGVAAA AGGPAASTGG GQSYSAVCPA ADVVTPALNS NLIENPGAED YTALTTLGEP ATDLQYAPDC WVSTSPMGGQ GAVLESAASS AVPGQTGSRT FYGGYDYDSP QVSIVGVTTT ATQLIDVSSL GAGTRAYTLT GEIGGYTTQT DYATVTARFE DAAGAPLGSA VLGPVDPAQR TNVTSLIPQA ATGTVPAGTA QILITVASTG VSAGYGIDGR ADNLNLTISS GDSGQSYTVP CPASDTVTPA LNSNLIENPG AEDSTAATAL GAPAGDDQTV ADCWTSASPL AAPDGTQESS PSTNPGVTGS RVFYGGTNPS TVAVAGVVTT GSQVIDVSSL KAAGQPFKLT GKVGGYSTQS DYAEVVATFE NASGASLGTA QIGPATPADR ANVSELLPDG AYGTVPDGTA KILVTIITGG VNAGANSDGT ADDLNLTIGQ NAVGQSYTVP CPASDTVTPA LNSNLIENPS AEDSTTATAL GAPLGDDQTV ADCWTSASPL SPPDGTQESR TSTYPGVTGS RVFYGGTNPS TVSIPGISTT GTQSVDLSAL AVGGQPFKLS ADLGGYSTQG DYATVTAAFQ DTKGATLSTA KIGPVTAAQR GNASSLIPQA WYGDVPAGTQ KVVVTIATVG VSSGANSDGT ADNLNLTIGQ SAVPSGPVLQ TMPYASVGDD TGTHVDPGSG AVVPNVVPGA LDRPAGVSAF KGTVDVSNTG DNVVSALQNG STSVIAGSLE AYGEHGDGGK ATSASLYQPS GSATDAAGDL FVADAGDNVV REIAANGTIS RFAGTVPGGS WSGAGLGGLV PLDHPEAVAV NAAGDVFIAD TYADRVVELT PRGLLLRLIG TGRAGYSGDG RPSPLAQLNQ PIGLALDAQG DLYIADSANN VIRRVDARTG IITTVAGDHA AGKAAGGLGG FSGDGGPATS AQLNDPQGVA VDGAGDLFVA DTFDNAIREV TPDGTISTVV NSSAAPGGES SGAAPTASHL NTPYAVTVDP STDLLYIADT RNSVIAQVIG LARAGHAPGP VAPPAS
|
| |