Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_5961 |
Symbol | |
ID | 8337323 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 6880911 |
End bp | 6882551 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644959065 |
Product | Cholesterol oxidase |
Protein accession | YP_003116660 |
Protein GI | 256395096 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCAA CATCAAGGCA CGACCCAGGA GCCCGAGGTC TGTCACGGCG CGGATTCCTG GCCGGTACAG GCACGGTTCT GGGCGCGGCG GCCCTCGGCG GCCTGTCGGC CTCCCGGGCC TCCGCCGCGC AGCGCAGCAC CCCCATCTCC AACGGAGCCC ACGTCCAGGC GCTGATCATC GGCACCGGAT ACGGCGGCTC GGTCGCGGCG CTGCGGCTGG CCCAGGCCGG CATCGCGGTG GAGATGATCG AGATGGGCAT GGCCTGGGAC ACCCCGGGGT CGGACGGGAA GATCTTCTGC AACCTGACCA GTCCGGACCA GCGGTCCTTC TGGCTGCGGA CGCAGACCAA GCAGCCCGTC GGCTACTTCC TCGGGATCCC GATCGACCGC GCCATCCCGA ACTACACCGG CATCCTCGAC GCCGAGGACT TCGCCGGGAT CACGGTCTAC CAAGGGCGCG GCATCGGCGG CGGGTCCCTG GTGAACGGCG GTATGGCGGT GACGCCGAAG CAGGAGAACT TCGGCGCGAT CCTGCCCTCG GTGAACCCCG CCGAGATGTA CAACGTCTAC TACCCGCGCG CCAACGCCGG TCTCGGCGCC GGCGTCGTCC CGCAGAGCTG GTTCACCAAA ACCGACTGGT ACCAGTTCGC GCGCGTCGGG CAGAAGCAGG CCGGGCGGTC CGGGTTCCCG TTCCAGTTCG TGCCGGACGT GTACGACTGG AACTACATGC AGCAGGAGGA CGCCGGCACG GTCCCGAAGT CGGCGCTGGG CCAGGAACTG CTCTACGGCA ACAACTACGG CAAGAAGTCC CTGCAGAAGA CGTACATCCC GGCGGCGCTG GCCACCGGCA AGGTGAACAT CTCCCCGCTG CACAAGGTGA CCTCGGTGTC CCCGGCCTCC GGCGGCGGCT ACACGGTGCT GATGAACCAG CTGGACACCT CCGGGAACGT GGTCGTCACC AAGGAGGTCA CCGCCGACAA GGTGGTCTTC GCCGCGGGCA GCGTGGGCAC CAGCAAGCTG CTGGTCCAGA TGCGCGACAC CGGGCAGCTG CCGCACCTGA ACGACCAGGT CGGGCAGGGC TGGGGCGACA ACGGCAACAT CATGGTCGGC CGGGCGAACC AGATCTGGGA CCCCACCGGC TCCAAACAGT CCACGGTCCC GTGCGGCGGC ATCGACAACT GGACCAAGGG CGGCGCGTTC GCCGAGGTGG CGCCGCTGCC GATCGGGATC GAGACCTGGG CCTCGCTGTA CCTGTCGATC ACGAAGAACC CGCACCGCGC GCAGTTCACC TGGAACGCCG CCACGCAGAA GGTCGACCTG AGCTGGCAGC TGGCGTGGAA GCAGGACGGC ATCACGATGG CCAAGAGCAT CTTCGACAAG ATCAACTCCA CCGAGGGCAC CATCTACCGG ACCGACCTGT TCGGCTCGTA TAAGACCTGG CAGGACCAGC TGACGTACCA CCCGCTGGGC GGCGCGGTGC TGAATCAGGC CACGGACAAC TACGGCCGGC TGACCGCCTA TCCGGGCCTG TACGTGATGG ACGGCGCGCT GATCCCCGGC AACACCAGCG TGAACCCGTT CGTCACCATC ACCGCGCTGG CCGAGCGCAA CATCGAGAAC ATCATCGCCA ATGGCGGATG A
|
Protein sequence | MSATSRHDPG ARGLSRRGFL AGTGTVLGAA ALGGLSASRA SAAQRSTPIS NGAHVQALII GTGYGGSVAA LRLAQAGIAV EMIEMGMAWD TPGSDGKIFC NLTSPDQRSF WLRTQTKQPV GYFLGIPIDR AIPNYTGILD AEDFAGITVY QGRGIGGGSL VNGGMAVTPK QENFGAILPS VNPAEMYNVY YPRANAGLGA GVVPQSWFTK TDWYQFARVG QKQAGRSGFP FQFVPDVYDW NYMQQEDAGT VPKSALGQEL LYGNNYGKKS LQKTYIPAAL ATGKVNISPL HKVTSVSPAS GGGYTVLMNQ LDTSGNVVVT KEVTADKVVF AAGSVGTSKL LVQMRDTGQL PHLNDQVGQG WGDNGNIMVG RANQIWDPTG SKQSTVPCGG IDNWTKGGAF AEVAPLPIGI ETWASLYLSI TKNPHRAQFT WNAATQKVDL SWQLAWKQDG ITMAKSIFDK INSTEGTIYR TDLFGSYKTW QDQLTYHPLG GAVLNQATDN YGRLTAYPGL YVMDGALIPG NTSVNPFVTI TALAERNIEN IIANGG
|
| |