Gene Caci_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_2226 
Symbol 
ID8333575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp2526552 
End bp2527712 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content70% 
IMG OID644955380 
Productacyl-CoA dehydrogenase domain protein 
Protein accessionYP_003112986 
Protein GI256391422 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.190658 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0000495631 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCTTCG TGGAGCCCGA GGAGCGCCTG GCGCTCCGCC AGGCGGTGCG CGACCTGGCC 
AAGCGGTACG GACCGGACTA CGTGCGCAAG CAGGCCAAGG CCGGCCTGAA GTCGACCGAG
CTGTGGGCCG AGGTGGGCCG GGCCGGCTAC CTGGGCGTGA GCCTGCCGGT GGAGTACGGC
GGCGGGGGCG GCGGCATCGC CGACCTGGCG GCGGTCGGCG AGGAACTTGC CCGGGCCGGC
TGCCCGCTGC TGCTGATCGT GGTCTCCCCC GCCATCTGCG GCACCATCAT CGGCCGCTTC
GGCACCGAGG AACAGAAGCA GAAGTGGCTC CCGGGCATCT CCGACGGCTC CTTCAAGATG
GCCTTCGGCA TCACCGAGCC CGACGCCGGC TCCAACTCCC ACAAGATCAC CAGCACGGTG
CGCCGCGATG GCGACGAGTG GGTTCTTTCC GGTCAGAAGG TCTTCATCTC CGGCGTCGAC
GAGTCGGACG CGGTCCTGGT GGTGTCCCGC ACTAAGGACG CGGCCACCGG CGACCTGAAG
CCGGCGCTCA TGGTGGTCCC CACTGACGCG CCGGGCTTCA CCAAGACCAT GATCGAGATG
GAGATCGCCG GCCCGGAGAA GCAGTTCCAG CTCTTCTTCG ACGACGTCCG TCTCCCGGCC
GACGCGCTGA TCGGGGACGA GGACGCCGGG CTGCTCCAGC TGTTCGCGGG CCTGAACCCC
GAGCGCATCA TGGCCGCCGC CTTCGCCATC GGCACCGCGA AGTACGCCCT GGACAAGGCC
GCCGAGTACG CCAAGACCCG GACGGTCTGG CGCGACCGCC CCATCGGCAC CCACCAGGGC
GTCGCGCATC CCCTCGCCCA GGCCGCGATC CACGTCGAGC TGGCCAAGGT GATGACGCAG
AAGGCTGCCT GGCTCTACGA CAACGGCGAC GACTTCGGCG CCGGCGAAGC CGCGAACATG
GCCAAGTTCG CCGCCGCCGA CGCCGCGGTG GAGGCCGTCG ACCAGGCGAT CCAGACACAC
GGCGGCAACG GCCTGGCGAC CGAGTACGGC TTGGCGCCGC TGCTCGCGGC GACCCGCGTG
ACGCGGATCG CGCCGGTCAG CCGGGAGATG ATCCTGAACT TCGTCGCGCA GCACACGCTG
GGGCTTCCGA AGTCGTACTA G
 
Protein sequence
MSFVEPEERL ALRQAVRDLA KRYGPDYVRK QAKAGLKSTE LWAEVGRAGY LGVSLPVEYG 
GGGGGIADLA AVGEELARAG CPLLLIVVSP AICGTIIGRF GTEEQKQKWL PGISDGSFKM
AFGITEPDAG SNSHKITSTV RRDGDEWVLS GQKVFISGVD ESDAVLVVSR TKDAATGDLK
PALMVVPTDA PGFTKTMIEM EIAGPEKQFQ LFFDDVRLPA DALIGDEDAG LLQLFAGLNP
ERIMAAAFAI GTAKYALDKA AEYAKTRTVW RDRPIGTHQG VAHPLAQAAI HVELAKVMTQ
KAAWLYDNGD DFGAGEAANM AKFAAADAAV EAVDQAIQTH GGNGLATEYG LAPLLAATRV
TRIAPVSREM ILNFVAQHTL GLPKSY