Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_0871 |
Symbol | |
ID | 8332201 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | - |
Start bp | 1010751 |
End bp | 1012550 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644954021 |
Product | protein of unknown function DUF1271 |
Protein accession | YP_003111645 |
Protein GI | 256390081 |
COG category | [R] General function prediction only |
COG ID | [COG2346] Truncated hemoglobins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.207432 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.190699 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTACGA GTAACGACAC CGACCTGACC ACCCCGACGG CGCTGCTGGC GGGCGCGCGC CGGCTTGAGC GCCGCGTCGC CGATGCTTTG TCCGGAACCT ATGACGGCGA GATCGACGCC GAGCTGCTCC GCGGCGCCTC GGTGCAGCTG AACGGATCGG TTATCAGGCC TCTGGCTCTT CTTGTAGCCG GAACGCTCGA CGACCCCGTT ACCGCCGAGG AGCCGTCGAT CGACGCCGAG CTCTGGCGTC TCACCCAGGA AGCCACCCGG CTGCGCGCCA CCACTGGCGT GCCCGCTCCG CTGATCGAGG CTACTGCGGC ACTTCAGGAC CTAGCCTGTC GGCTAGTTCC CGATCCTGCG GTGGTCGCCG GACGCATCGC ACGGCTGGCC GCGCTACAGG GCGATCTGCC GACCAGCATC CAGGCGTCGG AGGATGGTCC CTACCTCGTC ACCAATGCCA GCCACTTGAC CACCTGGCTA GGGGAGCCGT TGCCGCTGCG TCCGCAGATG GCGCTGTGTC GCTGCGGAGG CTCGGCGACC AAGCCGTTTT GCGACGGCGC GCATGCGACG AACGGCTTCA GCGGCGCCAA GAGTCCGGCA CGCGTGGCCG ATCGGCGGGA CACGTATCCC GGGCAGCAGG TCACCGTCCT GGACAATCGC GGGATCTGCG CTCATTCGGG GCTGTGCACC GACCGTCTTC CGACCGTGTT CCGTCAGGGC CAGGAGCCTT TCGTGGCGCC GAGCGGGGGG CGCATGGACG AGATCGTCCG GGCGGTTCGG GCGTGTCCGT CGGGCGCCTT GAGTTTTGCG ATCGACGACC GTGAGGCCCG GGAACAAGTC GACCAGGATC GGCCGGCGGC GATTGAGGTC TCCAAGGACG GTCCCTACCG GGTCACCGGC TCGATTCCGC TCACCGGTGC TGACGGCGAG CCGGAGCCGC GGAATGCGGG ATCCTCGACC GAGCACTACA GCTTGTGCCG TTGCGGGCAG TCGCAGAACA AGCCGTTCTG CAGCGGCATG CACTGGTACG TCGACTTCCA GGATCCGCCC GCGCCCTCGG AGCCGACGCT CTTCCAGTGG GCCGGCGGGC TGCCGGCGCT GACCAGGATG ACGCGGATCT TCTACGCCAA GCACGTACCG GCCGATCCGC TGCTCGCGCC GATCTTCGCG AACATGTCGC CGGACCATCC GGAACGCGTG GCGGCCTGGC TCGGCGAGAC CTTCGGCGGC CCGACCGTGT ACACCGACAC CTACGGCGGC TACGACCGAA TGGTCGGGCA GCACGCGGGC AAGGGCCTCA GCGAGGAGCA GCGCGCGCGC TGGGCGCAGC TCATCGTGCG CTCGGCTGAT GAAGCCGGGC TGCCGAGCGA CCCCGAGTTC CGCGCGGCGT TCGTCTCCTA CATCGAGTGG GGCTCGCGCA TCGCCGTGGA GAACTCCCAG CCGGGCGCCC ACCCGCCACC GCACATGCCG GTACCGCGCT GGTGGTGGGT GTGCGGCGCG ACGCCAGATG CCCGAGTCTC CGCTCTCGCC GTACAAACCA ATCCGGAAGG ACCTGTCATG ACGCTGCCCG CGAACGACGC GCCGCTCAGC TTCGACGCAC ACATCAGGAC CCTGTTCAGG GAGATGGACA GGCGATCGAT GAAGTTCGTC TTCGACTTGT GGTCGCACGA CGACGTCAGT CGGCATGCCG AGGCGATCCT CGGCCGGCTC CGGCAAGGGT CGATGCCGTG CGACGGCGCC TGGCCGAGGG AGAAGACGGA TGTCTTCGAG CGGTGGATTC GGGCTGGGAA ACCTGCCTAA
|
Protein sequence | MTTSNDTDLT TPTALLAGAR RLERRVADAL SGTYDGEIDA ELLRGASVQL NGSVIRPLAL LVAGTLDDPV TAEEPSIDAE LWRLTQEATR LRATTGVPAP LIEATAALQD LACRLVPDPA VVAGRIARLA ALQGDLPTSI QASEDGPYLV TNASHLTTWL GEPLPLRPQM ALCRCGGSAT KPFCDGAHAT NGFSGAKSPA RVADRRDTYP GQQVTVLDNR GICAHSGLCT DRLPTVFRQG QEPFVAPSGG RMDEIVRAVR ACPSGALSFA IDDREAREQV DQDRPAAIEV SKDGPYRVTG SIPLTGADGE PEPRNAGSST EHYSLCRCGQ SQNKPFCSGM HWYVDFQDPP APSEPTLFQW AGGLPALTRM TRIFYAKHVP ADPLLAPIFA NMSPDHPERV AAWLGETFGG PTVYTDTYGG YDRMVGQHAG KGLSEEQRAR WAQLIVRSAD EAGLPSDPEF RAAFVSYIEW GSRIAVENSQ PGAHPPPHMP VPRWWWVCGA TPDARVSALA VQTNPEGPVM TLPANDAPLS FDAHIRTLFR EMDRRSMKFV FDLWSHDDVS RHAEAILGRL RQGSMPCDGA WPREKTDVFE RWIRAGKPA
|
| |