Gene Caci_5150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5150 
Symbol 
ID8336504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5914582 
End bp5916486 
Gene Length1905 bp 
Protein Length634 aa 
Translation table11 
GC content68% 
IMG OID644958248 
Producthypothetical protein 
Protein accessionYP_003115850 
Protein GI256394286 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0397106 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.0961186 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCCCG CCGAGGGAGG CGGTGGCGGC GACGGCTGGT GGCCGCTGAG CCCGAACGGC 
GATCCGGCCA AGGGCAACGC CGGGACGCTG TCCACGCATG TGACGGCGTA CACGAAGATC
GCCTCGCAAG CGGCGTCGGT GTCGAAGGCG TTGACCGGCC TGTGTGTCGA CGAATTCTGG
GACTCCGACT CCGGCACGGC GTTCAAGGTG CGCTGCGACA ACCTCGCCGC CGAACTCGCC
AAGATCGTCT CGCGGTATCA GACCGCTGCC GACGCGCTGA AGGCCTACAT CCCGGACCTG
GAGCACGCGC AGACGAACGC GGCCAGCGCC AAGAGTCTCG GCGACAGTGC GGAGTCCGAC
TTCGCCGCCA TCGGCTACAC CTACAACCCG AGCGGGGAGT TCGGCGCCAC CTTCACCGGC
GTCCGGGTGC CCTTCGCCCC GACCGTGCCC TCCGCCAAGC CGATGTCCCC GGTGGACCCG
CACTGGGGCA AGTACCAGAC GGCGGCTCAG GAGTGGAACA CCGCGGTGCG CCGGGTCGCC
GACGCCGCGC ACCTGCACGA CACCAGCGCG TCCGCCGCCG CCCGCAAGAT CAAGGCGGTC
GCCGAGGGCG ACGGTGTGTC CAACCAGCAC TGGACCGCCC TGGACGGGCA GCAGCGGGCC
TTCCTGTCGC AGCTGACCAA CCCCGCCACC GTCGAGTCCG CGGTCCTGAA CCTGCCGCTG
GTCAACCCCA ACGGGGAGCA GAACGGCGGC GTCAATCCGG AGCTGCCGAT CGATCCGTCG
GTGCTGGAAG CGGTCCTGGA CGACCTGCGG TCCAAGAACG TGGATCCCAA GCAGTACAAG
ACGCTGCTCG ACCAGTATTG GGTCGCCACC GCCGCGCAGC ACGCCGGCAT CGACCTGAAC
GAGTGGGACC CGGCGGCCGG CACCGGTCCG AACATGGCCA ACATCCAGGC TGTGTACACG
TACTACGGGA AGCTGTTCCT GGACCATCCC GAGCTCCAGT GGGCCGGTAT GGCGAACATG
ATCGGGCCCT CCTTCGCCGG CGGGTTCATG GACATCGACA TGTTCCGCAA GTTCGCGCAG
GACTTCGCGA CGAAGGTGGA CGGCCTGCCG GCGGCGGTGC GCGAGTCCCT GCCGCCGGAG
CTGCAGCAGC TGGCCGCAGC CGGCGGCCAG ATGTCGGCCA CCGAGCTGAA GTACTTCGAG
ACCAAGTTCC TGGCGATGCA GAAGCACATC TTCTTCGACC AGGCCGCCGC GCACGAGGCG
TATCTGGCCG GGCCGCCGAA CAACAAGACG GCCTACATCG ACGAGATGCA GAAGGCCGGG
CTGTTCCCCG GCGACGTCCC GCCGGACAAG ACCGCGAACG CCTGGCACGT CATCGCCAAC
CCGCACAGCA CGCCGCAGCA GGTCATGGAC GCGAACGGCA CGCTGCTCTA CCGCGAGCAG
AACGTCGTGA TCAAGGACCA GTACAACCAG ATGTACAACC ACGACGGACC GGTGGGCAAG
GCCTTCACGT ACATGATGAC CACGGTCGGC GCGGCGTCCA TCCCGGGCAC GCACACGCCC
GGGGAGTACC GGCCGATCAC CTTCGGCGGC GACGTCAGCG TCCCGGTGGT CGTCGGCAAG
GAGACGGTCG GGGTCCATGT GACGACACCG CTGCCGGACT TCAACATCAC CAACACGCAG
GACCGCTGGG ACTACGTCAC GCACGACACG CTCCCGGCGT ACCAGAAGCT GCTGCAGCAG
GATCCGAACC TGGCCCGGCA GATCGTCGCC ACGCCGGTGC AGGACCGGAT CGCGCAGCAG
CGGCTGTCCG CGCGCTGGCC GACCATCGCC GACGACCTGC TGGCCAAGGA CTGGGACATC
AAGCTGGACC ACCACTTCAC CCCCGGCCTT GACCTCCCCT GGTAG
 
Protein sequence
MRPAEGGGGG DGWWPLSPNG DPAKGNAGTL STHVTAYTKI ASQAASVSKA LTGLCVDEFW 
DSDSGTAFKV RCDNLAAELA KIVSRYQTAA DALKAYIPDL EHAQTNAASA KSLGDSAESD
FAAIGYTYNP SGEFGATFTG VRVPFAPTVP SAKPMSPVDP HWGKYQTAAQ EWNTAVRRVA
DAAHLHDTSA SAAARKIKAV AEGDGVSNQH WTALDGQQRA FLSQLTNPAT VESAVLNLPL
VNPNGEQNGG VNPELPIDPS VLEAVLDDLR SKNVDPKQYK TLLDQYWVAT AAQHAGIDLN
EWDPAAGTGP NMANIQAVYT YYGKLFLDHP ELQWAGMANM IGPSFAGGFM DIDMFRKFAQ
DFATKVDGLP AAVRESLPPE LQQLAAAGGQ MSATELKYFE TKFLAMQKHI FFDQAAAHEA
YLAGPPNNKT AYIDEMQKAG LFPGDVPPDK TANAWHVIAN PHSTPQQVMD ANGTLLYREQ
NVVIKDQYNQ MYNHDGPVGK AFTYMMTTVG AASIPGTHTP GEYRPITFGG DVSVPVVVGK
ETVGVHVTTP LPDFNITNTQ DRWDYVTHDT LPAYQKLLQQ DPNLARQIVA TPVQDRIAQQ
RLSARWPTIA DDLLAKDWDI KLDHHFTPGL DLPW