Gene Caci_3423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3423 
Symbol 
ID8334776 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3782539 
End bp3784335 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content71% 
IMG OID644956567 
Productamino acid adenylation domain protein 
Protein accessionYP_003114170 
Protein GI256392606 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1020] Non-ribosomal peptide synthetase modules and related proteins 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACTGC TTGCCTCCTC CGCTGCCGAG CACGCCGACG CGGCTGGCAT GCCCGGTGCG 
GCGGGATGGT CGGTTGTCGA CCGGTTCGAG CGGCATGCGG CTGCGCATCC GTCGCAGACC
GCGCTGGTGT GCGATGGCGA GGTGGTGAGT TTCGGGGAGC TGGCGGCGCG GACGGGGGCG
ATCGCCGCGG GGTTGGTGGC GCGGGGTGTC GGCGCCGAGG ATCTGGTGGG GTTGTTGCTG
CCGCGCGGGG TCGATCTGGT GGCGGCGTTG GTGGGCGTGC TTCGGTGCGG TGCGGGGTAT
CTGCCGCTGG ATCCTGCGCT GCCGGATCAG CGGCTGCGGT ACATGGCCGA GCATGCGGCG
CCTGCGGCGG TCTTGTGTGA CCCCACGTTG CGGAAGCGGC TGCCTCCGAC GTTCGCGCGT
CGTACGTTGA CCGTTGCGAC GTGCCAGCGG ATCGGTGGCG ATCATCCGCC GTTTCGTGAG
CCGATCAGGG CCGGTCAGCG CGCCTACGCC ATCTTCACCT CCGGCTCGAC CGGGCTCCCC
AAAGCAGTCG AGGTCGCCCA TGCCTCGCTC GCCGCGCTTC TCGATGCCCT TACCTCGACG
GTGGCCGGGC CGCCTGGCGC CCGCGTCGCC TGGAACGCCG GCGCCTCCTT CGACGCCTCG
GTCCAGCAGT GGGTACGGCT TTGCCACGGC GACACGCTCG TATTGGTCAG CGAAGCCGTA
CGTGCCGACC CTCAAGCACT CGCGGCACTC CTGCGCACCG AGCACGTCAC AGACCTCGAC
ATCACTCCCT CGCACGCGAT CGCCCTGCTC GACCACCTCC CCACCGACCG ACCGCTCAGG
CTGCTCGTCG GCGGGGAAGC GGTGTCTGGC AGCTTGTGGG ACCGCCTTGC GGCATTGCGG
GCCGAGGGAG TCCTGGAGCC CTACAACCTG TACGGGCCAA CGGAATGCAC CGTCGATGCG
ACCGTCGCAG CCATCGAGGA CACGAGCGAT CCGCACCTCG GCGCACCGCT CCCAGGTGTC
CGTGTCACAG TGCTCGATGA CGCCCTTCAG CCGGTCCAGC CCGGCGATAT CGGCGAGATC
TACCTCGCCG GCTCGGGTGT CGCGCGCGGC TACCTCGGAC AACCGGCGCT GACCGCCCAG
CGATTCGTCG CAGACCCCGA CGTACCCGGC GCGCGCATGT ACCGCACCGG CGATCGCGGC
GCCATCACCG CCGACGGCCG CCTGGAATAC GCCGGACGCG CCGACGACCA GGTCAAGCTG
CGTGGCTTCC GGATCGAGCC CGGCGAGGTG GAAGCGGTAC TCGGCGGCTG CCCAGGCGTG
GCGCAGGCGG CGGTGGTGGT CCGCGACGAC GTTCCCGGCG GTCCCGCTCT CGTCGGCTAC
TGCCTGCCCT CAGCGGCGGC CTTCGATGCC GAGGCGATCC GCCGCGCCGC CGCTGCCCGC
CTGCCCGACT ACATGGTTCC GGCGCTGCTC GTAGCGATAG ATGGCTTTCC GTTGACCAGC
AACAGGAAAC TCGATCGCGC AGCTCTGCCG TCCCCGATCC CGACCCGCCC CACCGAGCAC
GCCTTCGAAC CGCCGCAGGG CGCCACCGAG GAACTGCTCG CCGCCGCATG GTGCGAAGTC
CTCGGGCTGG AGCGCGTCGG CGCCGGCGAG GACTTCTTCG AACTCGGCGG GCAGTCGCTG
CTGGCGATCC GGCTGGTCGC GAACGTGCGC CGACGTACCG GACGGCCGGT CCCGATGGTG
GCTGTGTTCG AACACCCGGT GCTGCGGGAC CTGGCCGCGT TCCTGGATTC CCGGTGA
 
Protein sequence
MSLLASSAAE HADAAGMPGA AGWSVVDRFE RHAAAHPSQT ALVCDGEVVS FGELAARTGA 
IAAGLVARGV GAEDLVGLLL PRGVDLVAAL VGVLRCGAGY LPLDPALPDQ RLRYMAEHAA
PAAVLCDPTL RKRLPPTFAR RTLTVATCQR IGGDHPPFRE PIRAGQRAYA IFTSGSTGLP
KAVEVAHASL AALLDALTST VAGPPGARVA WNAGASFDAS VQQWVRLCHG DTLVLVSEAV
RADPQALAAL LRTEHVTDLD ITPSHAIALL DHLPTDRPLR LLVGGEAVSG SLWDRLAALR
AEGVLEPYNL YGPTECTVDA TVAAIEDTSD PHLGAPLPGV RVTVLDDALQ PVQPGDIGEI
YLAGSGVARG YLGQPALTAQ RFVADPDVPG ARMYRTGDRG AITADGRLEY AGRADDQVKL
RGFRIEPGEV EAVLGGCPGV AQAAVVVRDD VPGGPALVGY CLPSAAAFDA EAIRRAAAAR
LPDYMVPALL VAIDGFPLTS NRKLDRAALP SPIPTRPTEH AFEPPQGATE ELLAAAWCEV
LGLERVGAGE DFFELGGQSL LAIRLVANVR RRTGRPVPMV AVFEHPVLRD LAAFLDSR