Gene Caci_5780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5780 
Symbol 
ID8337141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6679079 
End bp6680140 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content70% 
IMG OID644958884 
Productprotein of unknown function DUF21 
Protein accessionYP_003116479 
Protein GI256394915 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACAGCG CCTGGGCCCT GGTCGTCTCC GCCCTTCTCC TGGCCGCCAA CGCCTTCTTC 
GTCGCCGCCG AATTCGCCCT CGTCACCAGC AAACGACACC GCCTGGAGGC GGCCGCCGCC
GAGGGCAGTC GCGCGGCGCG CGTCGCGGTG GCCGGCACGC GCGAGCTGTC CCTGATGCTG
GCGGGCACCC AACTGGGCAT CACGTTGTGC ACGCTGGGTT TGGGCGCCCT GGCCGAGCCC
GCGGTCGCGC ACCTGCTGGA CCCGGTGCTC TCCGCGACCG GGCTGCCCGA GGGCGTGTCG
TACGGGATCG CGTTCGCCGC GAGCCTGGCG CTGGTCGTGT TCCTGCACAT GGTCGTCGGC
GAGATGGCGC CGAAGTCCTG GTCGATCACC CACCCGGAAC GGTCCGCGGC CCTGGTCGCG
CTGCCGTTCC GGGCTTTCAC GCAGCTGGTG CGCTGGCCGC TGGTCGCCCT CAACGGCATG
ACCAACGGCC TGTTGCGCCT GCTGAAGGTG GAGCCGCAAA GCGAACTGGC CGAAGCGCAC
AGTCCCGAAG ACCTGCGGAT GCTGGTCCGA CAGTCCGCCG AGCACGGACT GATCCCCGCA
GTGCAGCAAA GGCTTCTGGC CCAAGCGCTG CGCCTACAGA ACACGCCGCT GTCCGAGGTC
ATGATCGCCT GGACGGACGC CGTCACCGTT CCCTGCGATT CCACCGCGAG CGCCGTGGAG
GACCTGAGCC GCGCCACCGG CCACTCCCGC TTCCCGGTCA CCGGCGCCGA CGGCAACCCG
GTCGGCCTGG TCCACGTCCG CGACGCGGTG CGCGCGACCA CCGCCGGTCT CGACCCGGAC
GTGTCCGACC TGCGCAGCAC CGCGCTGACG CTGCGCGCGG ACCAGACCGG AGCCGAGGCG
GTCAGCGTGA TGCGGCGGTA TCGCTCGCAA CTGGCTCTGG TGAAGAGCGG CGCCGGGGGC
GACGAGGGAG CTCAGGACAC TGATGCGGTG GTCGGTGTCG TGGCACTGGA GGATCTGTTG
GAAGAGCTCA TCGGCGAATT CCAGGACGAG ACAGATATCT GA
 
Protein sequence
MNSAWALVVS ALLLAANAFF VAAEFALVTS KRHRLEAAAA EGSRAARVAV AGTRELSLML 
AGTQLGITLC TLGLGALAEP AVAHLLDPVL SATGLPEGVS YGIAFAASLA LVVFLHMVVG
EMAPKSWSIT HPERSAALVA LPFRAFTQLV RWPLVALNGM TNGLLRLLKV EPQSELAEAH
SPEDLRMLVR QSAEHGLIPA VQQRLLAQAL RLQNTPLSEV MIAWTDAVTV PCDSTASAVE
DLSRATGHSR FPVTGADGNP VGLVHVRDAV RATTAGLDPD VSDLRSTALT LRADQTGAEA
VSVMRRYRSQ LALVKSGAGG DEGAQDTDAV VGVVALEDLL EELIGEFQDE TDI