Gene Caci_5961 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_5961 
Symbol 
ID8337323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp6880911 
End bp6882551 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content67% 
IMG OID644959065 
ProductCholesterol oxidase 
Protein accessionYP_003116660 
Protein GI256395096 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCAA CATCAAGGCA CGACCCAGGA GCCCGAGGTC TGTCACGGCG CGGATTCCTG 
GCCGGTACAG GCACGGTTCT GGGCGCGGCG GCCCTCGGCG GCCTGTCGGC CTCCCGGGCC
TCCGCCGCGC AGCGCAGCAC CCCCATCTCC AACGGAGCCC ACGTCCAGGC GCTGATCATC
GGCACCGGAT ACGGCGGCTC GGTCGCGGCG CTGCGGCTGG CCCAGGCCGG CATCGCGGTG
GAGATGATCG AGATGGGCAT GGCCTGGGAC ACCCCGGGGT CGGACGGGAA GATCTTCTGC
AACCTGACCA GTCCGGACCA GCGGTCCTTC TGGCTGCGGA CGCAGACCAA GCAGCCCGTC
GGCTACTTCC TCGGGATCCC GATCGACCGC GCCATCCCGA ACTACACCGG CATCCTCGAC
GCCGAGGACT TCGCCGGGAT CACGGTCTAC CAAGGGCGCG GCATCGGCGG CGGGTCCCTG
GTGAACGGCG GTATGGCGGT GACGCCGAAG CAGGAGAACT TCGGCGCGAT CCTGCCCTCG
GTGAACCCCG CCGAGATGTA CAACGTCTAC TACCCGCGCG CCAACGCCGG TCTCGGCGCC
GGCGTCGTCC CGCAGAGCTG GTTCACCAAA ACCGACTGGT ACCAGTTCGC GCGCGTCGGG
CAGAAGCAGG CCGGGCGGTC CGGGTTCCCG TTCCAGTTCG TGCCGGACGT GTACGACTGG
AACTACATGC AGCAGGAGGA CGCCGGCACG GTCCCGAAGT CGGCGCTGGG CCAGGAACTG
CTCTACGGCA ACAACTACGG CAAGAAGTCC CTGCAGAAGA CGTACATCCC GGCGGCGCTG
GCCACCGGCA AGGTGAACAT CTCCCCGCTG CACAAGGTGA CCTCGGTGTC CCCGGCCTCC
GGCGGCGGCT ACACGGTGCT GATGAACCAG CTGGACACCT CCGGGAACGT GGTCGTCACC
AAGGAGGTCA CCGCCGACAA GGTGGTCTTC GCCGCGGGCA GCGTGGGCAC CAGCAAGCTG
CTGGTCCAGA TGCGCGACAC CGGGCAGCTG CCGCACCTGA ACGACCAGGT CGGGCAGGGC
TGGGGCGACA ACGGCAACAT CATGGTCGGC CGGGCGAACC AGATCTGGGA CCCCACCGGC
TCCAAACAGT CCACGGTCCC GTGCGGCGGC ATCGACAACT GGACCAAGGG CGGCGCGTTC
GCCGAGGTGG CGCCGCTGCC GATCGGGATC GAGACCTGGG CCTCGCTGTA CCTGTCGATC
ACGAAGAACC CGCACCGCGC GCAGTTCACC TGGAACGCCG CCACGCAGAA GGTCGACCTG
AGCTGGCAGC TGGCGTGGAA GCAGGACGGC ATCACGATGG CCAAGAGCAT CTTCGACAAG
ATCAACTCCA CCGAGGGCAC CATCTACCGG ACCGACCTGT TCGGCTCGTA TAAGACCTGG
CAGGACCAGC TGACGTACCA CCCGCTGGGC GGCGCGGTGC TGAATCAGGC CACGGACAAC
TACGGCCGGC TGACCGCCTA TCCGGGCCTG TACGTGATGG ACGGCGCGCT GATCCCCGGC
AACACCAGCG TGAACCCGTT CGTCACCATC ACCGCGCTGG CCGAGCGCAA CATCGAGAAC
ATCATCGCCA ATGGCGGATG A
 
Protein sequence
MSATSRHDPG ARGLSRRGFL AGTGTVLGAA ALGGLSASRA SAAQRSTPIS NGAHVQALII 
GTGYGGSVAA LRLAQAGIAV EMIEMGMAWD TPGSDGKIFC NLTSPDQRSF WLRTQTKQPV
GYFLGIPIDR AIPNYTGILD AEDFAGITVY QGRGIGGGSL VNGGMAVTPK QENFGAILPS
VNPAEMYNVY YPRANAGLGA GVVPQSWFTK TDWYQFARVG QKQAGRSGFP FQFVPDVYDW
NYMQQEDAGT VPKSALGQEL LYGNNYGKKS LQKTYIPAAL ATGKVNISPL HKVTSVSPAS
GGGYTVLMNQ LDTSGNVVVT KEVTADKVVF AAGSVGTSKL LVQMRDTGQL PHLNDQVGQG
WGDNGNIMVG RANQIWDPTG SKQSTVPCGG IDNWTKGGAF AEVAPLPIGI ETWASLYLSI
TKNPHRAQFT WNAATQKVDL SWQLAWKQDG ITMAKSIFDK INSTEGTIYR TDLFGSYKTW
QDQLTYHPLG GAVLNQATDN YGRLTAYPGL YVMDGALIPG NTSVNPFVTI TALAERNIEN
IIANGG