Gene Caci_4474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4474 
Symbol 
ID8335828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5099153 
End bp5100862 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content68% 
IMG OID644957576 
Producthypothetical protein 
Protein accessionYP_003115178 
Protein GI256393614 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.805357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.305115 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGGCT ATTTTGGCAT GCCCGGAAAC GGGGAGGACC GGGGTTTCGG GCGGGTTATA 
CGCGCGGGTT TGGCTGTGAT TGCGGTCATG GCGTGTGTCA TGGCTCTGAT GTCGTGTCAT
CACGCGGCGG CGGTGTCCGG GGCGCAGAAC GGTGAGGTGT CTCGTGGCGG CAGCAGCACT
ACCAGCATCG ACACCAGTTC CAGTTCCAGT TCCAGCACCA GCAGCCCTGA TGGCGACAAC
TGCCCGGCGG GCAAGCTCAC GTCTGGATGG CTGCGTGCGG AGAACGCGCT CGGCGGGACG
ACGGCGTGGC AGGACGCGAA GCACACCGTG TCCGGGACGG TGAACGGCTA CCTGAACCAT
GCCAGCGCGC GGTGCGGGGA CACGGTCACC GCGTACCTGA GTGCGCCCGA GCGGGTGTCA
GGAGCCTCAT TGTCCGCCTA CCGGATGGGG TACTACGGCG GCGCGGGTGG GCGGCTGGTC
TGGGAGTCGC GGAATCTCGG CCTCGGGCCT CAGGTGAGCG CCACGGTGAG CGATCCGAGT
CTGCTGACCG AGGCGCCGTG GGCGGCGACG CTGACCTTCC CCGTCACCGG GCGGTGGGTC
CCCGGGTACT ACCTGCTGGT GGTGCGGGCG CCGGGACAGA CGGCGTCGTC GATTCCGCTG
GTGATTCGGG CCGACGGCGA TGACGCGCCG TTGGGCTACC AAGCCAGCGT GCTGACGTAT
CAGGCCTACA ACACCTTCGG CGGGCACTCG GCGTACGGCA ACCTCCCCGC CGCACCGAGC
ACGGAGCTGA GCTTCGACCG TCCGTATCAG GACGGCGGGT ACTACTCGGC GTACCAGTAC
GAGCTGCCGA TCGTCCGGGA GATCGAAAAG CTCGGCATCG ACACCGACTA CTTCACCGAC
GTCGACGCCG ATGCCGACCC CGCACAGCTG AAGCAGCACC ACGGCATCAT CATCGGCGGC
CACTCCGAGT ACTGGACCAA GCGCATGTAC GACGGCGCCC TCGCCGCACG CCAGGCCGGC
GTGAACATCG CGTTCTTCGG CGCCAACTCG GTCTACACCG CGGTCCGGCT GACCAGCTCG
CCGCTGGGCT CGGACCGCCG CATGGTCCTG CGCCGCACGG CGGCGGGCGA CCCGGTGGCC
GCCAAGGACC CGTCGCAGGC CACGGTGAAC TGGGCCGACC CACCGCTGAA CAGACCCGAG
GCCGCTCTGG TCGGCGAGGG GTACGGCTAC CTCGGCGCGA CCGGATCACT CCGCGTCCTC
CACGCCCAGT CCTGGATCTT CGCCGGCACC GGCGTGACAC CCGGCGAGGT CCTCCGCAAC
ACCATCGGCG GCGAGTACGA CCAAGTCGAC GTGAACAATC CGACGACGCC ACGCGACGTA
GACGTCCTGG CCGCAATGCC GCTACGCACG AAGAGCGGCG CGGCCGAGAT GGCCACGACG
ACGTACTACG TGGCGCCATC GCAAGCCGGC GTCTTCAACG CGGGCACGAC CTACTGGCCG
TGCGTGCTGA ACGGGGACTG CCTGCACCTA GCGCCGACCC CACCCGCAGC CCAGCAGGTG
ATCACACGCA TGACAGACAA CATCCTGACC ACCTTCGCCG CCCAACCCGC AGGCCGGCAA
CATCCCTCGA CGCCATCCTG GCCACCGACA CCGGAAGGGC TGGTCTCCTC GGCGAAGCAG
CCCGGGGACG TGTCGCAGCG CACCGATTGA
 
Protein sequence
MRGYFGMPGN GEDRGFGRVI RAGLAVIAVM ACVMALMSCH HAAAVSGAQN GEVSRGGSST 
TSIDTSSSSS SSTSSPDGDN CPAGKLTSGW LRAENALGGT TAWQDAKHTV SGTVNGYLNH
ASARCGDTVT AYLSAPERVS GASLSAYRMG YYGGAGGRLV WESRNLGLGP QVSATVSDPS
LLTEAPWAAT LTFPVTGRWV PGYYLLVVRA PGQTASSIPL VIRADGDDAP LGYQASVLTY
QAYNTFGGHS AYGNLPAAPS TELSFDRPYQ DGGYYSAYQY ELPIVREIEK LGIDTDYFTD
VDADADPAQL KQHHGIIIGG HSEYWTKRMY DGALAARQAG VNIAFFGANS VYTAVRLTSS
PLGSDRRMVL RRTAAGDPVA AKDPSQATVN WADPPLNRPE AALVGEGYGY LGATGSLRVL
HAQSWIFAGT GVTPGEVLRN TIGGEYDQVD VNNPTTPRDV DVLAAMPLRT KSGAAEMATT
TYYVAPSQAG VFNAGTTYWP CVLNGDCLHL APTPPAAQQV ITRMTDNILT TFAAQPAGRQ
HPSTPSWPPT PEGLVSSAKQ PGDVSQRTD