Gene Caci_3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3103 
Symbol 
ID8334455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3410568 
End bp3412220 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content69% 
IMG OID644956250 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003113853 
Protein GI256392289 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.8985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCA ACGGCGGCGC TGAGCGTCAC GATGAGGCCC ATCGCAAAGG TCCCGACTCC 
CCGGCCCTGC CCGAGGAGAC CATCGGCGGG AACCTGGCAA GGATCGCGGC GGCGTACCCC
GACCGCGACG CGTTGATCGA ATGCGCGACC GGACGGCGCT GGACCTACGC GCAGTTCGAC
GCGGCGGCGG CGGATCTGGC GCGCGGAATG CTCGCCGCCG GCGTCGCCAA GGGCGACCGC
GTGGGCATCT GGTCCCCCAA TTGCGCCGAA TGGATCCTGG TCCAATACGC GGCCGCGAAG
GTCGGCGCGA TCCTGGTGAA CCTCAACCCG GCCTACCGGG ACCACGAGAT CCGCTTCACC
CTCAAGCAGT CCGGCACCAG CCTGCTGTTC GCCGCCACCC AGGTGAAGAC CAGCGATTAC
GTCGCGATGG TGAGCGCCGT CCGCGAGGAC TGCCCGGACC TGCACGACGT GGTCTTCATC
GGCACGCCCG GCTGGACCGC CTTCGTCGAA CGCGGCGCGA CGGTCCCCGC GTCAACACTC
GCCGAACGCG AAAGCCGCCT GCACCCCGAC GACCCGATGG ACATCCAGTA CACATCGGGT
ACTACGGGAT TCCCTAAGGG CGCCACACTG TCGCATCGCA ACGTCCTGGG GAACGGCTAC
ATGGTCGCCG AAGTCCAGGG CTGGACCCAT GAGGACCGCG TCTGCCTCCC GGTACCGCTC
TACCACTGCT TCGGCATGGT GATGGGCAAC CTCGGAGCCA CCAGCCACGG CTCCTGCATG
GTCCTGCCCG GACCGCTGTT CGACCCCGCC GACACACTGC GCGCCGTCTC TGAGGAACAC
TGCACGGTTC TCTATGGGGT TCCCACCATG TTCATCGCAG AGTTGGCCCT CCTAGAGAAA
ACCCCAGACA CCTACGACCT CAGCTCCCTG CGCACCGGCG TCATGGCCGG CTCGCCCTGC
CCGGTCGAGG TGATGAAGCG GGTCATCGGC GAGATGGGCA TGGCGGACGT GACCATCGCC
TACGGCATGA CCGAGACCTC CCCGGTCTCC ACGCAGACCC GCCGCGACGA CAGCCTGGAG
CGCCGCGTCG CCACCGTCGG CCGGGTCCAC CCGCACGTCG AGATCAAGAT CGTCGACCCC
GACACGGGCG CCACGCTCGG CGCCGACGAG CCCGGCGAGC TGTGCACCCG CGGCTACAGC
GTCATGCTCG GCTACTGGGA CGAGCCGCAG CGCACCGCCG AGGCGGTCGA CGGCGACGGC
TGGATGCACA CCGGCGACCT CGCGCAGATG GACGCCGACG GCTACGTCGC CATCGTCGGC
CGCATCAAGG ACATGGTGAT CCGCGGCGGG GAGAACGTGT ACCCGCGCGA GGTGGAGGAG
TTCCTGTACT CCCACCCCGA CGTGGAGGAC GTCCAGGTGA TCGGCGTCCC CGACCAGAAG
TACGGCGAGG AGCTGATGGC GTGGGTCCGG CTGCGGCCCG GCGCGCAGCC GCTGACCCCC
GAGGCCGTGC GAACCTTCTG CGAAGGACGC CTGGCGCACT ACAAGATCCC GCGCTACGTG
CACATCGTGG ACGGGTTCCC CATGACGGTC ACCGGCAAGG TGCGCAAGGT GGAGATGCGG
GAGCAGGCGA TGGAGATCCT CGGGCTGCGG TGA
 
Protein sequence
MSGNGGAERH DEAHRKGPDS PALPEETIGG NLARIAAAYP DRDALIECAT GRRWTYAQFD 
AAAADLARGM LAAGVAKGDR VGIWSPNCAE WILVQYAAAK VGAILVNLNP AYRDHEIRFT
LKQSGTSLLF AATQVKTSDY VAMVSAVRED CPDLHDVVFI GTPGWTAFVE RGATVPASTL
AERESRLHPD DPMDIQYTSG TTGFPKGATL SHRNVLGNGY MVAEVQGWTH EDRVCLPVPL
YHCFGMVMGN LGATSHGSCM VLPGPLFDPA DTLRAVSEEH CTVLYGVPTM FIAELALLEK
TPDTYDLSSL RTGVMAGSPC PVEVMKRVIG EMGMADVTIA YGMTETSPVS TQTRRDDSLE
RRVATVGRVH PHVEIKIVDP DTGATLGADE PGELCTRGYS VMLGYWDEPQ RTAEAVDGDG
WMHTGDLAQM DADGYVAIVG RIKDMVIRGG ENVYPREVEE FLYSHPDVED VQVIGVPDQK
YGEELMAWVR LRPGAQPLTP EAVRTFCEGR LAHYKIPRYV HIVDGFPMTV TGKVRKVEMR
EQAMEILGLR