Gene Caci_3843 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3843 
Symbol 
ID8335196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4351190 
End bp4354330 
Gene Length3141 bp 
Protein Length1046 aa 
Translation table11 
GC content73% 
IMG OID644956979 
ProductNHL repeat containing protein 
Protein accessionYP_003114582 
Protein GI256393018 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0246016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTC CTATCAAGCT CCGCTGGGCG GCCGTGGCGG CGAGCGCCGT GGTCGGGTCG 
GCGGCGATCG GCCTGGGGGT GGCCGCTGCC GCGGGCGGTC CCGCCGCGTC GACCGGCGGC
GGGCAGAGCT ACTCCGCGGT CTGTCCGGCC GCGGACGTGG TCACGCCCGC GTTGAACAGC
AACCTGATCG AGAACCCCGG GGCCGAGGAC TACACGGCCC TCACCACGCT CGGCGAACCC
GCTACCGACC TGCAGTACGC GCCCGACTGC TGGGTGAGCA CCTCCCCGAT GGGCGGCCAG
GGCGCCGTCC TGGAGTCGGC GGCGTCCTCC GCGGTGCCCG GCCAGACCGG CAGCCGCACC
TTCTACGGCG GCTACGACTA CGACTCCCCT CAGGTCTCGA TCGTGGGCGT GACGACCACG
GCCACCCAGC TGATCGACGT GAGCTCGCTC GGCGCGGGCA CCCGCGCGTA CACGCTGACC
GGCGAGATCG GCGGCTACAC GACCCAGACC GACTACGCGA CGGTCACCGC CCGGTTCGAA
GACGCCGCCG GCGCCCCGCT CGGCTCGGCC GTCCTCGGCC CGGTCGACCC GGCCCAGCGC
ACCAACGTCA CGAGCCTGAT TCCGCAGGCG GCGACAGGCA CCGTGCCGGC CGGCACCGCC
CAGATCCTGA TCACCGTCGC CTCGACCGGC GTCAGCGCCG GGTACGGCAT CGACGGCCGC
GCCGACAACC TGAACCTGAC GATCTCCTCC GGCGACTCCG GCCAGAGCTA CACCGTGCCG
TGCCCGGCCT CCGACACGGT CACCCCGGCG CTGAACAGCA ACCTGATCGA GAACCCGGGC
GCCGAGGACT CCACCGCGGC GACCGCCCTC GGGGCACCCG CCGGCGACGA CCAGACCGTC
GCTGACTGCT GGACGAGCGC CTCCCCGCTC GCTGCGCCCG ACGGGACGCA GGAGTCGAGT
CCCTCGACCA ACCCGGGCGT CACCGGCAGC CGCGTGTTCT ACGGCGGCAC CAACCCGAGC
ACCGTCGCGG TGGCCGGCGT GGTCACCACC GGCAGCCAGG TCATCGACGT CAGCTCCCTG
AAGGCGGCCG GCCAGCCGTT CAAGCTGACC GGCAAGGTGG GCGGCTACTC GACCCAGAGC
GACTACGCCG AAGTCGTCGC CACTTTCGAG AACGCTTCCG GCGCCAGCCT CGGCACCGCC
CAGATCGGTC CGGCGACCCC GGCCGACCGC GCCAACGTCT CGGAACTGCT CCCCGACGGC
GCCTACGGCA CCGTCCCGGA CGGCACCGCG AAGATCCTCG TCACGATCAT CACGGGGGGC
GTCAACGCGG GCGCGAACAG CGACGGCACG GCCGACGACC TGAACCTGAC GATCGGTCAG
AACGCGGTCG GCCAGAGCTA CACCGTGCCG TGCCCGGCCT CCGACACGGT CACCCCGGCC
CTGAACAGCA ACCTGATCGA GAACCCGAGC GCCGAGGACT CCACGACGGC GACCGCACTC
GGCGCGCCGC TCGGCGACGA CCAGACCGTC GCGGACTGCT GGACGAGCGC GTCCCCGCTC
AGCCCGCCCG ACGGCACGCA GGAGTCGCGC ACCTCGACCT ACCCCGGCGT CACCGGCAGC
CGCGTGTTCT ACGGCGGCAC CAACCCGAGC ACCGTCTCGA TCCCCGGCAT CAGCACCACC
GGCACCCAGT CCGTCGACCT GAGCGCGCTC GCCGTCGGCG GCCAGCCGTT CAAACTGTCC
GCCGACCTCG GCGGCTACTC GACCCAGGGT GACTACGCGA CCGTGACGGC GGCGTTCCAG
GACACGAAGG GCGCCACGCT CAGCACCGCC AAGATCGGCC CGGTCACCGC GGCGCAGCGG
GGCAACGCCT CCAGCCTGAT CCCGCAGGCC TGGTACGGCG ATGTCCCCGC CGGCACGCAG
AAGGTCGTCG TCACGATCGC GACGGTCGGC GTGAGCAGCG GCGCGAACAG CGACGGCACG
GCCGACAACC TGAACCTGAC GATCGGTCAG AGCGCCGTGC CGAGCGGCCC GGTGCTGCAG
ACCATGCCGT ACGCCTCGGT GGGCGACGAC ACGGGCACGC ACGTCGACCC GGGCAGCGGC
GCGGTCGTCC CGAACGTCGT GCCGGGCGCC CTGGACCGGC CGGCCGGTGT GTCGGCGTTC
AAGGGCACCG TGGACGTGTC GAACACCGGT GACAACGTCG TCTCCGCGCT GCAGAACGGC
TCCACCTCGG TCATCGCCGG CTCGCTGGAG GCGTACGGCG AGCACGGCGA CGGCGGCAAG
GCCACGTCCG CATCGCTGTA TCAGCCCTCT GGCAGCGCCA CGGACGCCGC CGGCGATCTG
TTCGTCGCCG ACGCCGGCGA CAACGTGGTG CGCGAGATCG CCGCGAACGG GACGATCAGC
CGCTTCGCCG GCACCGTCCC CGGCGGCTCG TGGTCCGGCG CCGGCCTCGG CGGCCTGGTC
CCGCTCGACC ATCCCGAGGC CGTCGCCGTG AACGCCGCCG GCGACGTGTT CATCGCGGAC
ACCTACGCCG ACCGGGTGGT CGAGCTCACG CCGCGGGGCC TGCTGCTGCG CCTGATCGGC
ACCGGCCGCG CCGGCTACTC CGGCGACGGG CGACCGAGCC CGCTCGCGCA GCTGAACCAG
CCGATCGGGC TCGCGCTCGA CGCCCAGGGC GACCTCTACA TCGCGGACTC GGCCAACAAC
GTGATCCGCC GCGTGGACGC GCGCACCGGG ATCATCACGA CCGTCGCGGG CGACCACGCG
GCCGGCAAGG CGGCCGGCGG CCTCGGCGGA TTCTCCGGCG ACGGCGGGCC CGCGACCTCG
GCGCAGCTCA ACGACCCGCA GGGGGTGGCG GTGGACGGCG CCGGCGACCT GTTCGTCGCG
GACACCTTCG ACAACGCGAT CCGCGAGGTC ACCCCGGACG GGACCATCAG CACCGTGGTG
AACTCCTCCG CCGCTCCCGG CGGGGAGAGC AGCGGCGCGG CCCCGACCGC CTCGCACCTG
AACACCCCGT ACGCCGTCAC AGTGGATCCG TCCACGGACC TGCTGTACAT CGCCGACACC
CGCAACAGCG TCATCGCCCA GGTGATCGGC CTGGCCCGCG CCGGCCACGC GCCGGGCCCG
GTGGCCCCGC CCGCGTCCTG A
 
Protein sequence
MKLPIKLRWA AVAASAVVGS AAIGLGVAAA AGGPAASTGG GQSYSAVCPA ADVVTPALNS 
NLIENPGAED YTALTTLGEP ATDLQYAPDC WVSTSPMGGQ GAVLESAASS AVPGQTGSRT
FYGGYDYDSP QVSIVGVTTT ATQLIDVSSL GAGTRAYTLT GEIGGYTTQT DYATVTARFE
DAAGAPLGSA VLGPVDPAQR TNVTSLIPQA ATGTVPAGTA QILITVASTG VSAGYGIDGR
ADNLNLTISS GDSGQSYTVP CPASDTVTPA LNSNLIENPG AEDSTAATAL GAPAGDDQTV
ADCWTSASPL AAPDGTQESS PSTNPGVTGS RVFYGGTNPS TVAVAGVVTT GSQVIDVSSL
KAAGQPFKLT GKVGGYSTQS DYAEVVATFE NASGASLGTA QIGPATPADR ANVSELLPDG
AYGTVPDGTA KILVTIITGG VNAGANSDGT ADDLNLTIGQ NAVGQSYTVP CPASDTVTPA
LNSNLIENPS AEDSTTATAL GAPLGDDQTV ADCWTSASPL SPPDGTQESR TSTYPGVTGS
RVFYGGTNPS TVSIPGISTT GTQSVDLSAL AVGGQPFKLS ADLGGYSTQG DYATVTAAFQ
DTKGATLSTA KIGPVTAAQR GNASSLIPQA WYGDVPAGTQ KVVVTIATVG VSSGANSDGT
ADNLNLTIGQ SAVPSGPVLQ TMPYASVGDD TGTHVDPGSG AVVPNVVPGA LDRPAGVSAF
KGTVDVSNTG DNVVSALQNG STSVIAGSLE AYGEHGDGGK ATSASLYQPS GSATDAAGDL
FVADAGDNVV REIAANGTIS RFAGTVPGGS WSGAGLGGLV PLDHPEAVAV NAAGDVFIAD
TYADRVVELT PRGLLLRLIG TGRAGYSGDG RPSPLAQLNQ PIGLALDAQG DLYIADSANN
VIRRVDARTG IITTVAGDHA AGKAAGGLGG FSGDGGPATS AQLNDPQGVA VDGAGDLFVA
DTFDNAIREV TPDGTISTVV NSSAAPGGES SGAAPTASHL NTPYAVTVDP STDLLYIADT
RNSVIAQVIG LARAGHAPGP VAPPAS