Gene Caci_3420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_3420 
Symbol 
ID8334773 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp3778491 
End bp3779711 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content70% 
IMG OID644956564 
Productcytochrome P450 
Protein accessionYP_003114167 
Protein GI256392603 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2124] Cytochrome P450 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00557634 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGACAG CCGTCGTCGA TCTCGACGAT CTGGACCTGT ACACCTCCGG CGACCCGCAC 
GCGGTGTGGG CCCGTCTGCG GCGCGAGGCG CCGGTGTACT TCAACGACAC CCCGAACGGC
GGCTACTGGG CACTGACCCG GTACGCCGAC GTGAACGCGG TGTATGTGGA CCCTGCGACG
TACTCCTCCA AGAACGGCAC GGTCCTCGGC GGCTCCTACC GCAGCGACCG TGACACCGCC
TCGGGACAGA TGCTGATCTG CTCGGACCCG CCGGCGCACC GGCAGCTGCG GCAGCACGTC
CACCAGGCGT TCGGGCAGCG GATGATGGAT GTCGCCGCCG GATACGTCGC TGATTACCTC
GGCGCCGCGC TGGACCGGAT GACCGCCGAC GGCGGCGGCG ACTTCGCCAC CGACATCGCG
CCCCAACTCC CCGCCGGGCT GCTGGCAGCG ATGTTCACCA TCGGCCACGC CGACGCCCTC
CACCTCCTGC GCCTGACCCG CACCATGATC GGCTTCCGCG ACCCCGAATA CACCGCGCCG
GAAGCCCCCG AGGCGATGAT CCTGGCCGGC GCCCAGGTGG AGATCTTCGA CTTCATGACC
GATCTCCTGG CCGCCCGCCG CCGCGAACCC GCCGACGACC TGATCAGCAT CCTGCTGGCA
GCCCGCACCA ACGGCCGCCC CCTGACGGAC AGCCAGATCC TCTACAACGC CTTGAACGTG
GCGGTCGGCG GCGACGAGAC CACTCCCTTT ACCGCCTCGG CGATCGTGGA GACGCTGATG
GCGCACGAGC GGGAGGCCTC CCGCTTGCAC GCCGATCACG CGCTGCTCGG CACGGCGGTC
GACGAGTTCT TCCGCTGGAC GTCCACGAAC GCCTACGTCT GCCGCACGAC CACGCGCGAG
GTCGAGATCC GCGGCGTCCC GATCCCGGCG GGGGCGACGC TGACGTTGTG GAACGCCTCG
GCGAACCGCG ACGAGGACGA GTTCCCGCAC GCCGACCGGC TGGACGTCGG GCGCACGCCG
AACCACCACC TGGCGTTCGG CGTCGCCAAC CACCGCTGCG TCGGGATGCC GGCGGCGCGG
ATGGAGATCA CGCTGCTGGT GCGCGAGTTC CTGCGGCGCG GGCTGCGGTT CGCCCCGGCC
GGACCGGTGG AGCGGCTGCG GTCGAACTTC ATGCTGGGGA TTCGGCATCT GCCGGTGACG
GTCTCTCAAT CAGAGGTCTG A
 
Protein sequence
MSTAVVDLDD LDLYTSGDPH AVWARLRREA PVYFNDTPNG GYWALTRYAD VNAVYVDPAT 
YSSKNGTVLG GSYRSDRDTA SGQMLICSDP PAHRQLRQHV HQAFGQRMMD VAAGYVADYL
GAALDRMTAD GGGDFATDIA PQLPAGLLAA MFTIGHADAL HLLRLTRTMI GFRDPEYTAP
EAPEAMILAG AQVEIFDFMT DLLAARRREP ADDLISILLA ARTNGRPLTD SQILYNALNV
AVGGDETTPF TASAIVETLM AHEREASRLH ADHALLGTAV DEFFRWTSTN AYVCRTTTRE
VEIRGVPIPA GATLTLWNAS ANRDEDEFPH ADRLDVGRTP NHHLAFGVAN HRCVGMPAAR
MEITLLVREF LRRGLRFAPA GPVERLRSNF MLGIRHLPVT VSQSEV