Gene Caci_3433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3433 
Symbol 
ID8334786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3798835 
End bp3802119 
Gene Length3285 bp 
Protein Length1094 aa 
Translation table11 
GC content73% 
IMG OID644956577 
Productcondensation domain protein 
Protein accessionYP_003114180 
Protein GI256392616 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCACG TTTCGCAGGC TCACGACCAA GCCGGAGCGG CCGGCGCGGT ACCGCCGCGC 
ACACCGTACG AGGAGGCCGT CGCCGCCATC TGGCGCGACA TCCTGGGCCG CCCCGACGTC
GGCGCGCTCG ACGACTTCTT CAGCCTGGAC GCCACCTCTT TGCAGGCGAT CCAGGTCGTC
TCCCGCATCC GCAAGACGCT CGGCGTGGAC ATCCCGGTCA AGGACGTCTT CCAGGAGCCG
ACCGTCGCCG CCCTGGCCGC CCGCGTCCAG GCCGAGTCCG CCTCGCGGCG CAGTGCCCTG
ACGCGCCGCC CCGCCGATGC CGAGCCGGTG CTGTCCTTCG ACCAGGAGCG GTTGTGGCTG
GAGAGCCAGC TGCTGCCGCC GACCGCCTAC AACGTGCACG GCCGCCGGCG CCTGGCCGGA
CCGGTGGACG TCGCGGCGCT GGAAGCCGGC GTCGCGGCGA TCGTGGCGCG GCACGACGCG
CTGCGGGCCC GGTTCCCGGT GATCGACGGG CAGCCGGTGC AGATCGTCGA CCCGCCGGAC
GCGCAGTGGC GCCTGGAGAC GGCGCGCACC GACAGCCTGG CGGCGGCGCT GCGGCTCGCG
GACCGGCAGG CTGAGACGGC GTTCGACCTG GCGCTCGGGC CGCTGTTCCG CTGCCTGCTG
GTCGCGGTGG ACGGTAGTGA CGGCAGCGGC GGCGGCACCA ATGGCGCGAC CGACGCAGAG
CCCGAATACG TCCTCAGCGT CACGGTGCAC CACATCGTCG CCGACGCCTG GTCGATCGGT
CTGTTCGTCC GGGAACTGCT CGCGCTGTAC GCCGCGGGCG GCGACCCCGA GCGCGCCGGA
TTGCCGGAGC TGACGGTCCA GTACCCGGAC TTCGCGGTCT GGCAGCGCGC GCATATCGCC
GGCGAGGAAC TCACCGGGCA GCTCGCCTAC TGGCGCGACC ACCTCGCCGG TGCGCCGCCG
GTACTGGCGA TGCCGGTGTC ACGGCGGCTG GCGGTCCCCG GCGGCGGCAT CGAGCGGGCC
CGCGCCGAGC TGACCGGCGC GGAGTCGGCG GCGTTGGGCG ACCTGCGGCG CAAGCACGGC
GTGACCACGT TCATGGTGCT GCTGGCGGCC CTCGGCGCGA CGCTGGGACG CTGGTCCGGA
CGGCGCGACG TGGTGGTCGG CGTGCCGATC GCCGGGCGCG CGGACGCCGG CACGCACGCG
CTGATCGGCT TCTTCGTCAA CACCCTGCCG GTCCGCGTGG ACCTGCGCGG CGACCCGTCG
TTCGGCGACC TGCTGGCTCG GGTGCGCCAG GCCGCGCTGG ACGGCTACGC CAACGCCGAC
GCGCCCTTCG ACGTGCTGGT CAAGGAGCTC CAGGCGCCGC GCGACCCGCG CCAGACCCCG
CTGTTCCAGG CGCTGCTGAA CGTGATCGGG CCGCCGGAGG CCGAGACGGT CGCCGGGATC
GCGGTCGAGC CGCTGGAGCT GCCGGCGTTG CCGAGCAAGT TCGACCTCGC GCTCACCGCG
CAGGAGCGCG GCGGACGGCT GGGCTTCGAC CTGGCGTTCA ACGCCGACCG GTATCACGGC
TCGATGATGC GTGAGCTGCT GGCGCAGGTG GTCGCGCTGG TGCGGGACGC GCTCGAGGAT
CCGGGGCGGG CGGTGTCGGA GCTCGGCGGG GCTCCGGGAG CTGCCGCCGA CTCCTCCGAT
GCTTCCGGCT CTGCCGATGG AAACGCCACC GCAGCCGCCT GGACCCCGCA TCTCGCCGTG
GACCGCTTCG CTCAGCAGGC TGATCGCGTC GCGGTCGTCG GCGCCGACGG CGAACACGGC
TACCGGTGGC TGGCGCGCGC CGCCGACCGG GTCGCCGCGT TCCTCGGCGC GCGGGAGGCC
GAGCCGGGAC GCGTGGGCAT CGCCCGGCAT CCCACCGCCG CGTTCGTCGC GACGGTCTTG
GGCTGCCTGA AGGCAGGTCT GGAATTCACC GTGACTGAGC CTGCGGCCGG CGTCCCTGCG
AGCTTCCTCG GACTATCGCA GCTCCTTGAC ACGGAGGAGA CCGAGGGTCT CAGCACCCTG
TTCGAGGACC TCAAGGAACC ACTACCTGCG GCAGAATCTG AAGCCACCAC TCACGAACGC
GACGACTGGG CCGTGGCACG CTTCGGCTTC AGCCGTGACG ATCGCTTCGC CGCTCCGGCA
ACGAGCCCCG GTCTGCTCGT CTCGGCGCTG TCCAGCGCAC TGAGCGCGGG CGCGACGCTG
GTCATGACCG AGCTGACTCC GGCGGGCGGC GTCGCAGAAC TCGGCGACTG GCTTCGCTCG
CAGGCGGTGA GCGTTCTCTA CACCGCGCCG CCACTGATCC GCGCGCTGGC CGCTGCCGAC
CTGCGCCTGC CGACGCTGCG ATTCGCGCTG GTCGACAACG CCGGCGACTT CCTGCCGCAC
GATGTCGAAG CAGCGGCTTT GCTGTCACCG GACTGCCGCT GCGTGAGCCT GTACCGGGTC
GGGCAGGACG GTCGACCGGT CGCGGTCTAC GCGGTCCCCG CAGACTTCAC CGTCGCCTCG
GCGCCGCTGC GCGTCCCGCT CGGGACCGGC GCTGTCGGGC TGCCGCATCC CTCGGGGCGT
CCGGCCGCGA TCGGCGAGAT CGCCGAGATC CGCGCCGACG GACGGCGCAC CGGCGACCTG
GGCCGTTGGC GCGCCGACGG CGTCCTGGAG TACACCGGCC TGGCCGGCGC GGACCCGGGG
CAGGACCTCG CCGAGGCGGC GTCGGCGTTG CGCGACGTCG CGGAGGTCCG CGACGCACTG
GTCACCGAGC AGGTCGGCGA CGAGGGCGAC GCGATCGTGG TCGCCTACCT GGTCGGCCCG
GATCCGGACG CGGGTACCTC GGGAATCCGC CGCTACCTCA TCAGCAGGCT GCCGGAGTGG
CTGATCCCCG GCGCGCTGGT GGTGGTCGGC GCACTGCCGC TGACCGCCGA GGGTGACCAC
GACGTGGCGC TGCTGCCGCG CACCGACCCC GGCGCCGCGA CCGAGGTGTA CGTCGCGCCC
CGCACACCGA TGGAGCAGCA GCTGGTGGAC GTGATGGCGG CGCTGCTGGC GGTGGACCGG
ATCGGCGTCC ACGACACCTT CTTCGAACTC GGCGGCTTCT CGCTGCTGGC GACCCGCCTG
ACTTCTCGCA TCCGCGACCT GTTCGACGTG GAGCTGTCGC TGCGCGACGT GTTCGAGGCG
CCGACCGTGG AAGGGCTCGC ACAGCTCATC CTGCGAGCGC AGAGCGAGGC CTTCGGCGGC
GAGGATTTGG AGGGACTGCT GGCGGAGATC ACCGCTGCGG ACTAA
 
Protein sequence
MAHVSQAHDQ AGAAGAVPPR TPYEEAVAAI WRDILGRPDV GALDDFFSLD ATSLQAIQVV 
SRIRKTLGVD IPVKDVFQEP TVAALAARVQ AESASRRSAL TRRPADAEPV LSFDQERLWL
ESQLLPPTAY NVHGRRRLAG PVDVAALEAG VAAIVARHDA LRARFPVIDG QPVQIVDPPD
AQWRLETART DSLAAALRLA DRQAETAFDL ALGPLFRCLL VAVDGSDGSG GGTNGATDAE
PEYVLSVTVH HIVADAWSIG LFVRELLALY AAGGDPERAG LPELTVQYPD FAVWQRAHIA
GEELTGQLAY WRDHLAGAPP VLAMPVSRRL AVPGGGIERA RAELTGAESA ALGDLRRKHG
VTTFMVLLAA LGATLGRWSG RRDVVVGVPI AGRADAGTHA LIGFFVNTLP VRVDLRGDPS
FGDLLARVRQ AALDGYANAD APFDVLVKEL QAPRDPRQTP LFQALLNVIG PPEAETVAGI
AVEPLELPAL PSKFDLALTA QERGGRLGFD LAFNADRYHG SMMRELLAQV VALVRDALED
PGRAVSELGG APGAAADSSD ASGSADGNAT AAAWTPHLAV DRFAQQADRV AVVGADGEHG
YRWLARAADR VAAFLGAREA EPGRVGIARH PTAAFVATVL GCLKAGLEFT VTEPAAGVPA
SFLGLSQLLD TEETEGLSTL FEDLKEPLPA AESEATTHER DDWAVARFGF SRDDRFAAPA
TSPGLLVSAL SSALSAGATL VMTELTPAGG VAELGDWLRS QAVSVLYTAP PLIRALAAAD
LRLPTLRFAL VDNAGDFLPH DVEAAALLSP DCRCVSLYRV GQDGRPVAVY AVPADFTVAS
APLRVPLGTG AVGLPHPSGR PAAIGEIAEI RADGRRTGDL GRWRADGVLE YTGLAGADPG
QDLAEAASAL RDVAEVRDAL VTEQVGDEGD AIVVAYLVGP DPDAGTSGIR RYLISRLPEW
LIPGALVVVG ALPLTAEGDH DVALLPRTDP GAATEVYVAP RTPMEQQLVD VMAALLAVDR
IGVHDTFFEL GGFSLLATRL TSRIRDLFDV ELSLRDVFEA PTVEGLAQLI LRAQSEAFGG
EDLEGLLAEI TAAD