Gene Caci_0871 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_0871 
Symbol 
ID8332201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp1010751 
End bp1012550 
Gene Length1800 bp 
Protein Length599 aa 
Translation table11 
GC content68% 
IMG OID644954021 
Productprotein of unknown function DUF1271 
Protein accessionYP_003111645 
Protein GI256390081 
COG category[R] General function prediction only 
COG ID[COG2346] Truncated hemoglobins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.207432 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.190699 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACGA GTAACGACAC CGACCTGACC ACCCCGACGG CGCTGCTGGC GGGCGCGCGC 
CGGCTTGAGC GCCGCGTCGC CGATGCTTTG TCCGGAACCT ATGACGGCGA GATCGACGCC
GAGCTGCTCC GCGGCGCCTC GGTGCAGCTG AACGGATCGG TTATCAGGCC TCTGGCTCTT
CTTGTAGCCG GAACGCTCGA CGACCCCGTT ACCGCCGAGG AGCCGTCGAT CGACGCCGAG
CTCTGGCGTC TCACCCAGGA AGCCACCCGG CTGCGCGCCA CCACTGGCGT GCCCGCTCCG
CTGATCGAGG CTACTGCGGC ACTTCAGGAC CTAGCCTGTC GGCTAGTTCC CGATCCTGCG
GTGGTCGCCG GACGCATCGC ACGGCTGGCC GCGCTACAGG GCGATCTGCC GACCAGCATC
CAGGCGTCGG AGGATGGTCC CTACCTCGTC ACCAATGCCA GCCACTTGAC CACCTGGCTA
GGGGAGCCGT TGCCGCTGCG TCCGCAGATG GCGCTGTGTC GCTGCGGAGG CTCGGCGACC
AAGCCGTTTT GCGACGGCGC GCATGCGACG AACGGCTTCA GCGGCGCCAA GAGTCCGGCA
CGCGTGGCCG ATCGGCGGGA CACGTATCCC GGGCAGCAGG TCACCGTCCT GGACAATCGC
GGGATCTGCG CTCATTCGGG GCTGTGCACC GACCGTCTTC CGACCGTGTT CCGTCAGGGC
CAGGAGCCTT TCGTGGCGCC GAGCGGGGGG CGCATGGACG AGATCGTCCG GGCGGTTCGG
GCGTGTCCGT CGGGCGCCTT GAGTTTTGCG ATCGACGACC GTGAGGCCCG GGAACAAGTC
GACCAGGATC GGCCGGCGGC GATTGAGGTC TCCAAGGACG GTCCCTACCG GGTCACCGGC
TCGATTCCGC TCACCGGTGC TGACGGCGAG CCGGAGCCGC GGAATGCGGG ATCCTCGACC
GAGCACTACA GCTTGTGCCG TTGCGGGCAG TCGCAGAACA AGCCGTTCTG CAGCGGCATG
CACTGGTACG TCGACTTCCA GGATCCGCCC GCGCCCTCGG AGCCGACGCT CTTCCAGTGG
GCCGGCGGGC TGCCGGCGCT GACCAGGATG ACGCGGATCT TCTACGCCAA GCACGTACCG
GCCGATCCGC TGCTCGCGCC GATCTTCGCG AACATGTCGC CGGACCATCC GGAACGCGTG
GCGGCCTGGC TCGGCGAGAC CTTCGGCGGC CCGACCGTGT ACACCGACAC CTACGGCGGC
TACGACCGAA TGGTCGGGCA GCACGCGGGC AAGGGCCTCA GCGAGGAGCA GCGCGCGCGC
TGGGCGCAGC TCATCGTGCG CTCGGCTGAT GAAGCCGGGC TGCCGAGCGA CCCCGAGTTC
CGCGCGGCGT TCGTCTCCTA CATCGAGTGG GGCTCGCGCA TCGCCGTGGA GAACTCCCAG
CCGGGCGCCC ACCCGCCACC GCACATGCCG GTACCGCGCT GGTGGTGGGT GTGCGGCGCG
ACGCCAGATG CCCGAGTCTC CGCTCTCGCC GTACAAACCA ATCCGGAAGG ACCTGTCATG
ACGCTGCCCG CGAACGACGC GCCGCTCAGC TTCGACGCAC ACATCAGGAC CCTGTTCAGG
GAGATGGACA GGCGATCGAT GAAGTTCGTC TTCGACTTGT GGTCGCACGA CGACGTCAGT
CGGCATGCCG AGGCGATCCT CGGCCGGCTC CGGCAAGGGT CGATGCCGTG CGACGGCGCC
TGGCCGAGGG AGAAGACGGA TGTCTTCGAG CGGTGGATTC GGGCTGGGAA ACCTGCCTAA
 
Protein sequence
MTTSNDTDLT TPTALLAGAR RLERRVADAL SGTYDGEIDA ELLRGASVQL NGSVIRPLAL 
LVAGTLDDPV TAEEPSIDAE LWRLTQEATR LRATTGVPAP LIEATAALQD LACRLVPDPA
VVAGRIARLA ALQGDLPTSI QASEDGPYLV TNASHLTTWL GEPLPLRPQM ALCRCGGSAT
KPFCDGAHAT NGFSGAKSPA RVADRRDTYP GQQVTVLDNR GICAHSGLCT DRLPTVFRQG
QEPFVAPSGG RMDEIVRAVR ACPSGALSFA IDDREAREQV DQDRPAAIEV SKDGPYRVTG
SIPLTGADGE PEPRNAGSST EHYSLCRCGQ SQNKPFCSGM HWYVDFQDPP APSEPTLFQW
AGGLPALTRM TRIFYAKHVP ADPLLAPIFA NMSPDHPERV AAWLGETFGG PTVYTDTYGG
YDRMVGQHAG KGLSEEQRAR WAQLIVRSAD EAGLPSDPEF RAAFVSYIEW GSRIAVENSQ
PGAHPPPHMP VPRWWWVCGA TPDARVSALA VQTNPEGPVM TLPANDAPLS FDAHIRTLFR
EMDRRSMKFV FDLWSHDDVS RHAEAILGRL RQGSMPCDGA WPREKTDVFE RWIRAGKPA