Gene Caci_3439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3439 
Symbol 
ID8334792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3809505 
End bp3812342 
Gene Length2838 bp 
Protein Length945 aa 
Translation table11 
GC content73% 
IMG OID644956583 
Productamino acid adenylation domain protein 
Protein accessionYP_003114186 
Protein GI256392622 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID[TIGR01733] amino acid adenylation domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.571319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.459804 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGAGAC AGAACCAGAC CCCTTACGAC CTCGTCCGCG CCGTGGCGCG CGCCGACGGC 
GACCGCGCCG CGCTTGTCGA CCCGGCCGGC GCCGGGGCGC AGCTGTCGTA CGCGGAGCTG
ATCGCCCGCA CCGAGACGCT GGCGCGCCGG CTGCGGGAGC ACGGCGTGGG CGCCGAGCAG
CCGGTCGCCG TCGTGCTGGA GCGCGGGGCG GACACTGTGG TGGCGATGCT GGCGGTGCTC
GCCGCCGGCG GCGTCTACTG CCCGCTGGAC GTCTCGGCGC CGGACGCGCG GCTGACCGCG
GTCGTGGAAC TGCTCGGGGC GAAGGTCGCG CTGACCGACG CGGCGCACGC CGGCCGGCTG
CCCGCCGGGG TGGACGCGCT GCGGCTGGAG GCCGTATCAG CAGCCTCAGT TACTGCCTCA
GGCGCCGCCT CAGACTCAGG CACCGCCTTG GACTCAGCCT CGGGCTCAAC CCCAGCCTCA
GCCGCGGACT CAGGCACCGC CTCGGACTCG AACTCAGCCT CAGCCTCGGA CTCAGGCACC
GGCTCAGCCT CAGCCTCGAA CTCGGCCTCG GACACCGGAG CCCTCGACCC CGACTCAGCG
TTCGAGCCCG CCGCACCCAC CCCGGACTCC CTCGCCTACG TCCTCTACAC CTCCGGCTCC
ACCGGCGTCC CCAAGGGCGT GGCGATGACC CACCGCGGTC TGTCCCGCCT CATCAGCTGG
CAGACCGCCT CCGGCGCTCC GGGACTGCGC ACGCTCCAGT TCACCGCGAC CTCCTTCGAC
GTCACCTTCC AGGAAGTGCT CTCCACCCTG GCCACCGGCG GCTGCCTCGT CGTCGCCGGC
GAGCAGGTCC GGCGCGATCC GGCGCTGCTG CTGGAGACGA TCGTCAAGGA GCGGATCCAG
CGCCTCTTCC TGCCCTACGT CGCCCTGCAG CTCATGGCGG TCACCGCCGC GCGGCTGGCG
ATCGTCCCGG AGAGCCTGGA GCACGTCGTC ACCGCCGGCG AGCGCCTGGT CGTCACCCCC
GCGATCCGCG ACCTGTTCAC CACGCTGCCG CACTGCCGTC TGGACAACCA CTACGGACCC
ACCGAAGCGC ACCTGGTCAC CAGCCAGACC CTGCCGCAGG ACCGCGCCGC CTGGCCGGAC
GTCCCCGGCA TCGGCGCCCC GGTCGCCGGC GTCGCCTGCC ATGTCCTCGA CGAGCGGCTG
CACGCAGTGC CCGATGGCGA GGTCGGCGAA CTCTACGTCT CAGGACCCTG TCTGGCGCGC
GGCTACCTCG CCGACCCGGC ACGCACCGCT GAGCGCTTCG TTGCCGACCC CGGCGCCGAC
GCACCGGGGG AGCGTTGGTA CCGGACCGGT GACCTGGTGC GCCGCATCGC CGATGGGACC
TATGAATTCC TCGGCCGCGC CGACGGTCAG CTGAAGGTGC GCGGGTTCCG CGTCGAGCCC
GGCGAGGTCG AGACCGCGCT GACGAGCCAC CCGCGGGTCC AGGCCGCCGC CGTCGGACTG
CGGCAGATCG AGGACGGGAT CTCGATCCTG GTCGGATACC TCCAGACCGA CGGCGCCGTC
TCGCAGCGGG AGATCGGCGA TCATGCCAGG GCACTGCTAC CCGCCTACAT GGTGCCCTCG
CGCTACCTGA CCGTGCCCGC ACTGCCGCGC ACCGGAACCG GCAAGGTCGA CACCCGGGCG
CTCGCCGAGA TCGCGCTGCC CGACGCCGCC GATTCCTCCG ACGCCGCCGA CGCTGCCGAC
CCCGACCAGC TCCCCTTGTC CGACCTCATC ACCGCTCTAT GGATCCGCGT CCTCGGCCAC
GACGAATTCG ACCCCGACGA CGACTTCTTC GACGTCGGCG GCGACTCCCT GCTCGCCACC
TGGGTAGCCG CCGAGCTGGG GCAGATGCTC GGCCGCCCGG TCGAGTTGTC GCTGTTCCTG
GAATACAGCA CCGTCGAAGA CCTCGCCGAA GCTCTCGGCT CGCAGGGCTC TGCGACCGCG
ACAGGAGCGG CACTGATCGC GGGATCCTCA CGCAGCTCAG CTTCGCAGAT CGTCACCCTG
CGCCCCGGAC CGTCCGGCCG CAGTCTGTAC CTGTTCCACC CGCTCGGCGG CGAGCTGATC
TGCTACCGCG AGCTGGCCCG CGCCAGCCGT GCCCCGGTCC GCGTCCTCGG CGTCGGCTGG
AGTGGCGCGC CGCCGGAGTA CGGCGCCACG CTGGAGGACA TCGCCCGCGT GCACGTCGAG
CAGCTGCTGG TCATCCAGCC CGACGCGCCG TTTCTATTGG CCGGGTGGTC CTTCGGCGGC
GTGCTCGCCT TCGAAGTCGC CCGGCAGCTC ACCGCGGCCG GCGCGAGCGT GGACTTCCTC
GGGCTGATCG ACGCCAACCC GGTGATCGAC CCCATCACCG GGCTGCCGCT GGCGGACACG
CCGTTCCTGG GCGTGCTGGA CGAGGTGGTG ACGCTGCTCG ACGCACCCGG GACCACCTCC
GCCGATCTCA CGGCTCTGAC ATCCGGCGAC ACCTGGCTCC AGCTCATGGG CGCGCCGATC
GCCCCCGGCG CCTCAAGTAC GTATCTGCGG ACCGCGCTGG ACACCGCCCG AGCGTGTATG
TGGGCTGCGA TGCGCTACCA GGCGCGCCGC CACGACGGCC CGATCGACGT GTTCCAGGCC
TCCGGGTCCG GGGCGGATCG GCAGGAAGCG CTGGCCGGGG CGATCCGCAG CCTGGCCGGC
GGCGCGTTCC GGACGGTCGC CGTCCCCGGC GGCCACTGGG CGTGCATCAG GGCGGAGGAC
GGGGCCGAGA CGGCCAGGGC ACTGGATGCC GCGCTCGAGC GCGTCGGCGC GGCGGGGAGT
GGGACGCATG GATCTTGA
 
Protein sequence
MVRQNQTPYD LVRAVARADG DRAALVDPAG AGAQLSYAEL IARTETLARR LREHGVGAEQ 
PVAVVLERGA DTVVAMLAVL AAGGVYCPLD VSAPDARLTA VVELLGAKVA LTDAAHAGRL
PAGVDALRLE AVSAASVTAS GAASDSGTAL DSASGSTPAS AADSGTASDS NSASASDSGT
GSASASNSAS DTGALDPDSA FEPAAPTPDS LAYVLYTSGS TGVPKGVAMT HRGLSRLISW
QTASGAPGLR TLQFTATSFD VTFQEVLSTL ATGGCLVVAG EQVRRDPALL LETIVKERIQ
RLFLPYVALQ LMAVTAARLA IVPESLEHVV TAGERLVVTP AIRDLFTTLP HCRLDNHYGP
TEAHLVTSQT LPQDRAAWPD VPGIGAPVAG VACHVLDERL HAVPDGEVGE LYVSGPCLAR
GYLADPARTA ERFVADPGAD APGERWYRTG DLVRRIADGT YEFLGRADGQ LKVRGFRVEP
GEVETALTSH PRVQAAAVGL RQIEDGISIL VGYLQTDGAV SQREIGDHAR ALLPAYMVPS
RYLTVPALPR TGTGKVDTRA LAEIALPDAA DSSDAADAAD PDQLPLSDLI TALWIRVLGH
DEFDPDDDFF DVGGDSLLAT WVAAELGQML GRPVELSLFL EYSTVEDLAE ALGSQGSATA
TGAALIAGSS RSSASQIVTL RPGPSGRSLY LFHPLGGELI CYRELARASR APVRVLGVGW
SGAPPEYGAT LEDIARVHVE QLLVIQPDAP FLLAGWSFGG VLAFEVARQL TAAGASVDFL
GLIDANPVID PITGLPLADT PFLGVLDEVV TLLDAPGTTS ADLTALTSGD TWLQLMGAPI
APGASSTYLR TALDTARACM WAAMRYQARR HDGPIDVFQA SGSGADRQEA LAGAIRSLAG
GAFRTVAVPG GHWACIRAED GAETARALDA ALERVGAAGS GTHGS