Gene Caci_4201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4201 
Symbol 
ID8335555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp4757608 
End bp4759551 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content72% 
IMG OID644957304 
Productprotein of unknown function DUF181 
Protein accessionYP_003114906 
Protein GI256393342 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGACCC TGACTCCCGT CGAGGAGCTG ACCCGGCTGC TCGGCGAGTA CGCGGCGCGC 
GACGGTCGCG GGCCCGATAA GGTCCCGACG GTCGTCGAAC TCGGCGGCGA GGACCTGCTG
GCCTTTGAGC GCTCGCCGCT GCGCGAGGCC GGTGAGGCGC GTCCGGATCT GCGCGACGCG
GACATCTTCC TGACCTCGCG GGCCGTTCTC ATCGGCCCGA GAAGGTCGGC GCGATCGGAG
CGGTCGGAGC TGTCGGCGCG GTCGGCGCGG TCGCAGAGGT CTGAGCGGGG ATGCGGGCAC
TGTCTGGCGT TGCGACTGCA GCGCATGCAG CCGGTGCATG AGCGCGACCT GCTCGAAACC
AGCCCCGGCG CGCACCGGGC CGGCGTGTGG CCGGACGTCG GCGACCAGCT CGCCGCCGCC
GCGTGGGCCG CTTTCCGGCA CGCCGAGCTG GTCCCGGCCG GCGACCTGCC GAAGGTCTGG
CGTATCGACA TCCGAACCCT GATGATGACG GCCGCGGCGC TGCTGCCTGA CTCGCGCTGC
CCCAGCTGCG GGGACCTCGA CCCCGAGCCG CCGGACCTCG ATCTCGGCGC ACCGCGCAAG
CCGAGCGCCG AGGCGTACCG ACTGCGCCAT CCCGAGGACT TCGCAGTCCC GGTCGAAGCG
CTGGTCAACA CGGTCTGCGG AGTGCTCGCA CCGGCGACGT CGGCCGCGCT GGCCTCGCCG
ACGACCGCCC CGGTCGCGGG CTCGGCGGCG GTGCGCGGAC CGACCGGGCT GCACGACCTG
AGCTGGAGCG GTCAGGCGAA CTCCTACGCG GCGAGCGCCC GCCTGGCGGT GTTCGAGGGC
TTGGAACGGC ACGCCGGGAC GATGCGCAGG CGGCCCGGGC TGGTCGTCGG CAGCTACACC
GAACTCGCCG ACCGAGCTCT GAATCCGAGT GACTGCACGG CTTACAGCGC CGATTCCTAC
CTCCAGGACG AACACCTCAC GCCGTTCGAC CCGGACCGGA CGATCCCGTG GGTCCCCGGC
TACTCGCTGC GCGACAAGCG GTCCGTGCTG GTCCCGGTGC GCCTGGCGTT CTACGGCTGG
GACGGCGGCG CGGAGCTGTT CTCCTTCGAG TGCTCCAACG GCTGCGCGAG CGGCGGATGC
CTGACCGAGG CGGTCCTGTT CGGTCTGCTG GAGCTGATCG AGCGCGACGC CTTCCTGCTC
GCCTGGTACG GCGGCGCGCT GCTGCCCGAG ATCGACACCG GCTCGCTGGA CCGCACGGCG
CGCGCGATGC TGAGCCGGGC TCGGCTTCAG GGCTACGACG TCCGCCTGTT CGACAACCGG
ATCGACCTGC CGGTACCGGT CGTCACCGGC GTGGCGCGCC GGCGGGACGG CGGCGACGGG
CTGTTCTCCT TCGCCGCCGG GGCGGGCATG GACCCGGCGG CGGCGGTGGA GGCCGCCCTC
GGCGAGATCC TGACCTACAT CCCCTCGATG CGGCATCGGG TGCGTGCGCG CCGTGAGGAG
CTGGTCGCGA TGACGCGGGA CTTCTCGCTG GTTAGCGGGC TGGCCGATCA TCCGGCGCTG
TTCGGGCTGC CGGAGATGCA GGAGCACGCC TGGCGTTACA CGCGCCACGC CGATCCGATG
CCTGTCGCAG AGCTCTACCA CGAGTGGCTG CGGGTCCGTC CGGCGACCGA CGATTTGGCC
GACGACGTCC GCTTCCTGGT GGACGAGGTG GCACGCCGGG GATCGGACGT GATCGTGGTC
GATCAGACCA GCGCCGAGCA GCAGGCAGCC GGTCTGTCGG GAGTCCGGGT CATCGCGCCG
GGACTGCTGC CGATCGACTT CGGCTGGGGG CGGCAGCGCG CGCTGCGGGC TCCGCGGATG
TTCTCGGCGC TGCGCCACGC GGGTCTGCGC GAGAGCGACC TGACCGCTGA GGAGCTGCAC
ATGGTGCCGC ATCCCTTTCC GTGA
 
Protein sequence
MRTLTPVEEL TRLLGEYAAR DGRGPDKVPT VVELGGEDLL AFERSPLREA GEARPDLRDA 
DIFLTSRAVL IGPRRSARSE RSELSARSAR SQRSERGCGH CLALRLQRMQ PVHERDLLET
SPGAHRAGVW PDVGDQLAAA AWAAFRHAEL VPAGDLPKVW RIDIRTLMMT AAALLPDSRC
PSCGDLDPEP PDLDLGAPRK PSAEAYRLRH PEDFAVPVEA LVNTVCGVLA PATSAALASP
TTAPVAGSAA VRGPTGLHDL SWSGQANSYA ASARLAVFEG LERHAGTMRR RPGLVVGSYT
ELADRALNPS DCTAYSADSY LQDEHLTPFD PDRTIPWVPG YSLRDKRSVL VPVRLAFYGW
DGGAELFSFE CSNGCASGGC LTEAVLFGLL ELIERDAFLL AWYGGALLPE IDTGSLDRTA
RAMLSRARLQ GYDVRLFDNR IDLPVPVVTG VARRRDGGDG LFSFAAGAGM DPAAAVEAAL
GEILTYIPSM RHRVRARREE LVAMTRDFSL VSGLADHPAL FGLPEMQEHA WRYTRHADPM
PVAELYHEWL RVRPATDDLA DDVRFLVDEV ARRGSDVIVV DQTSAEQQAA GLSGVRVIAP
GLLPIDFGWG RQRALRAPRM FSALRHAGLR ESDLTAEELH MVPHPFP