Gene Caci_7047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_7047 
Symbol 
ID8338414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp8190637 
End bp8191914 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content71% 
IMG OID644960128 
ProductCBS domain containing protein 
Protein accessionYP_003117718 
Protein GI256396154 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTCC TGCTGTTCTG GGAGATCGTC GCCGTCGTGG TCCTGGTCGC GGTCGCGGGG 
TTCTCCGCCT GCGCCGACGC CGCGCTGTCC CGGGTGTCCC GGGTGCGGGC CGCCGAGCTC
GTCGCCGACG GCGTGCCGCG CGCGGCCCGG CTCAAGCAGC TGGTCGACGA CCCGGCCCGG
GTGCTGAACC TGGTCCTTTT ACTGCGCGTG GCCTGCGAGA TCGCGGCGAC CGTGCTGGTC
ACCGTGCTGT TCATGCGCCG GATCGACGGC GCCTGGGGCG CCGGGTTCGC CGCGATCGGC
GTGATGATCG CGGTGTCCTA CATCTTCATC GGCGTGATGC CGCGCACCAT CGGCCGGCAG
CACTCGGTGC GGGTGGCGCT GCACTTCGCC GGCCCGCTGG CGCTGCTGAC CACGGTCCTG
GGCCCGCTGG CGCAGCTGCT GATCGCGGTC GGCAACGCGG TGACGCCCGG CCGGGGCTTC
CGCGAGGGCC CGTTCGCCTC CGAGGCCGAG CTGCGCGCCC TGGTGGACCT CGCCGAGGCG
AACAGCGTCA TCGAGGACCA GGAGCGCCGC ATGGTGCACT CGGTCTTCGA GCTCGGCGAC
ACGCTGGTGC GCGAGGTGAT GGTCCCGCGC ACCGACATGG TGTTCATCGA GCGGCACAAG
ACCCTGCGCC AGGCGCTGTC GCTGGCACTG CGCAGCGGCT TCTCCCGCAT CCCGGTGGTC
GGCGAGAACG CCGACGACGT GGTGGGCATC GTGTATCTCA AGGACCTGGT CCGGCGGATC
CACGAGCATC CGAGCGGGGA GACCACCGAG CTGGTCGAGT CCGCGATGCG CGACCCGGTC
TGCATCCCGG ACAGCAAGCC CGCCGACGAG CTCCTGCGCG ACATGCAGGC CGGCCACATC
CACCTGGCCG TGGTGATCGA CGAGTACGGC GGCACCGCCG GACTGGTCAC CATCGAGGAC
ATCCTGGAGG AGATCGTCGG GGAGATCGCC GACGAGTACG ACGTGGAGCG CCCCTCGGTC
GAGCACCTGT CCCCGGACGC GGCCCGCGTC ACCGCGCGCC TCGGCGTGGA CGAGCTCGGC
GACCTGTTCG GCGTGGACCT GGAGGACGAC GACGTGGAGA CGGTCGGCGG CCTGATGGCC
AAACGGCTCG GCCGCGTCCC GATCCCCGGA GCCCAGATCG AAGTGGAGGG ACTACGCCTC
ACCGCCGAGT CCCCAGAGGG CCGCCGACGC CGGATCGGGA CCGTGTTGGT GAAGCGCGTT
CCGCAGGACG ACGCCTGA
 
Protein sequence
MSVLLFWEIV AVVVLVAVAG FSACADAALS RVSRVRAAEL VADGVPRAAR LKQLVDDPAR 
VLNLVLLLRV ACEIAATVLV TVLFMRRIDG AWGAGFAAIG VMIAVSYIFI GVMPRTIGRQ
HSVRVALHFA GPLALLTTVL GPLAQLLIAV GNAVTPGRGF REGPFASEAE LRALVDLAEA
NSVIEDQERR MVHSVFELGD TLVREVMVPR TDMVFIERHK TLRQALSLAL RSGFSRIPVV
GENADDVVGI VYLKDLVRRI HEHPSGETTE LVESAMRDPV CIPDSKPADE LLRDMQAGHI
HLAVVIDEYG GTAGLVTIED ILEEIVGEIA DEYDVERPSV EHLSPDAARV TARLGVDELG
DLFGVDLEDD DVETVGGLMA KRLGRVPIPG AQIEVEGLRL TAESPEGRRR RIGTVLVKRV
PQDDA