Gene Caci_1410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_1410 
Symbol 
ID8332749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1603255 
End bp1604541 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content62% 
IMG OID644954558 
Productsigma-70 region 4 domain-containing protein 
Protein accessionYP_003112174 
Protein GI256390610 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000424388 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0278593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTAT GGAAGAGTCT CTCGCCACGG GACCAAGCCT TGCACTCGCG TCGGGCCTCT 
GGCGAGACTC TGGATTCGAT TGGACAGTCC TATGGGCTTT CGCGCGAGCG AATCCGCCAG
CTGATCAGCC GCGGCGAGAA GTCTTGGGTC GACCAGGTCG ACGCGCTTCG ACCAGGGTGG
CGTGCGAGCG TTAGACAGAG GTTCGCGGTG AGGCCTGTCG TCGCCGAATC CGACTTCGCC
GATATTCTTC CGGACTCGGT CGGCATCGTA CGGGGGACGC TACTCCGCGC TGCGGGCGCT
GACCGCGCCC GGACGTGGGC TGGACCTATC GATGGGATTT GGTCGATGGA CCCAAGTGGT
CTCGCCGCAC AACTACGTGA CTTGATCGCG TTGGCTCCGT TCACAGACGA CGACTTGGAC
ACCGCCGCGG CGAATCTCGA GTTTCCAGAG AACACTCCGT TGCGAGCCAT CCTCACCCAT
TCGCGCAGCC CTCTGGTGCG GGGCCCTCAC GATTACTGGC TGCGACGTAA CGCTAGAGCG
CGAGACGCCA GCTACCTCTG GCTGCTTTCT GAGGGCGAGC CGCGGAGAAT CGAACCAATC
GTGATGGCTG TCGGGGGTAA CCGCAACGCT GTCGCCGAGG CGATGCGTCG TGACAGCCGT
TTCCGGCAAT TGCGCCCCGA GGGCACGTGG GCCCTAACGG ACTGGCACGT TCCAGGCGCG
ACCGAATACA CGAACGCGAT GGATGTCGTT GTCGACGTAC TCACAGAGCG AGGACCGATC
ACACGGAAGA ACCTGATCGC AGAGGTCGTA CGCCGCTATC CGGTGAGTGC CGCACGCGTT
GTGCAGTGTC TCATTGGGGT ACGCGTCGGT ATCCATCGAG ACGGCCGGTT CGATCTGGTC
GAACGCGGGG CTAGTCCATA TGAGGAATCT GAGCCGCGAA GGCCGCGGAA CATCATCATC
GATGAGGCCG GGAACATCGC GGGTGTCCTA TTGACAGTAG ACAGGGAAGT CTTGCGGGGA
AGCGGGGTCA TCGTCCATCC GTGGCTCACA TGGCACCTCG GATTACGTCG GGCACCGATG
ACCCGACGAT TCTCCGTCCC GGGAGGCGAC GGAGATGTGA TCACCGTCAG CCGTCATACA
AGCGGGGCAC AGTTCTCGAG CATGAAGTCT TTTGTGGACG ACATGGGCCT AGCCATAGGT
TGCCAGTTCG CCGTGCTTCT CCGCCTCGAC GAAGAGACAG CGTCGGTACG ACACACGTGC
AAACCCGATA CCTGCACGGC GAGCTGA
 
Protein sequence
MALWKSLSPR DQALHSRRAS GETLDSIGQS YGLSRERIRQ LISRGEKSWV DQVDALRPGW 
RASVRQRFAV RPVVAESDFA DILPDSVGIV RGTLLRAAGA DRARTWAGPI DGIWSMDPSG
LAAQLRDLIA LAPFTDDDLD TAAANLEFPE NTPLRAILTH SRSPLVRGPH DYWLRRNARA
RDASYLWLLS EGEPRRIEPI VMAVGGNRNA VAEAMRRDSR FRQLRPEGTW ALTDWHVPGA
TEYTNAMDVV VDVLTERGPI TRKNLIAEVV RRYPVSAARV VQCLIGVRVG IHRDGRFDLV
ERGASPYEES EPRRPRNIII DEAGNIAGVL LTVDREVLRG SGVIVHPWLT WHLGLRRAPM
TRRFSVPGGD GDVITVSRHT SGAQFSSMKS FVDDMGLAIG CQFAVLLRLD EETASVRHTC
KPDTCTAS