Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4459 |
Symbol | |
ID | 8335813 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5076245 |
End bp | 5077873 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 644957561 |
Product | glucose-methanol-choline oxidoreductase |
Protein accession | YP_003115163 |
Protein GI | 256393599 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.084448 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACAGC GTTCGAAGCG TGCGATGCCC GGGGAGGCCG ACTACGTCGT GGTCGGCGCC GGGAGTGCGG GGTGTGTGCT GGCCGCACGG CTGGCCGGGA GCGGGGCGCG GGTGGTGCTG ATCGAGGCCG GCGGCTCGGA CCGGACGACG CTGGTGCGCA AACCCGGGCT GATCGCCGCG GTGCACAGCG TGCCGCAGCT GAAGGCGCGG CTGGACTGGG GCTACTACTC GGTGCCGCAG AGCGACGCGC TGGAGCGCAA GATCCCGCAG ACGCGCGGCA AGGTGCTCGG CGGGTCCGGG TCGGTGAACG GGATGCTGTT CGTGCGTGGG AACGCCGCGA ACTATGACTC CTGGGCCGCA GAGGGCTGCG ACGGGTGGTC GTACGCGGAC GTGCTGCCCA GCTTCAAGAA GCTGGAGAGC TGGGAGGAAG GCGAGACCGA GTTCCGCGGC GGCGCCGGAC CGATTAAGGT GCGGCGGCAG ACGGACGTCA CGACCGCGAC GCTGGGCTTC ATGGAGGCGT TCGCCGACAC CGCCGGGGTG AAGGTGCTCG ACGACTACAA CGGCGAGTCG CAGGAGGGCA TCGCGATCGT CCAGCAGAGC GCGCACGACG GGCTGCGCTA CAGCTCCTCG GTCGGCTACC TGGACGACCA CGGCATGGCG CAGCTCGACG TCGTCACCGG GGTGACGGTC GCGCGGGTGG TGCTGGAGAA GGGACGCGCG GTCGGGGTCG AGGTCGTCGG CGAGGATGGT GTGCGGCAGG TGGTGCGGGC CACACGCGAG GTGGTGCTGT GCGCCGGGGT GTTCGGCTCG GCGCAGCTGC TGCAACTGTC CGGGATCGGA CCGGCGGAGC ATCTGCGCTC GGTGGGCGTC GAGGTGGTCC AGGACCTGCC GGTCGGGGAC AACCTGCACG ACCACCTGTT CGTCCCGATG TGCTTCCTGA TGCCGGAGGC GCGGAACAAG GGGACGGCGC CGTACTTCGC GCGCGGCTTC GTGAAGGAGA TGACGCGCGG CGGGACGTGG GTCGGGCGGA CGGTGTTCGA GTCGGTGGGG TTCGTACGCA GCCCGAACGC CGGCAGCGTG CCGGATTTGC AGATCCACGT GCTGCCGTGG TCCTATCCCG GACCGAACCA GGACGCGCCG ATCCGGCACA AGGCCGACCC GCGGCGGACG CTGACGGTGA TGCCGACGCT GATCTACCCC CACAGCCGCG GGACCCTGCG CCTGGCATCG GCCGACCCGC TCGCCGCGCC GCTCATCGAC CCGGCGTACC TGCGCGAACC GGCGGACACC CAGCTGCTGC TGGACGGGAT GGAGATGGTC CGCGAGGCGA TGGCGCACCG CTCACTGTCC GGGCGCGTGC AGGGCGAGAG CTCGCCGGGC ACGGCGTACG CGAACCGCGC GGCGCTCGCC GCCGAGCTGC CGAACCGCGC GACGACGGTC TACCATCCGG TGGGCACGTG CCGCATGGGC GTCGACGAGC GCGCGGTGGT GGACCCGGCC CTGCGGGTGC GGGGGGTCGA AGGGCTGCGG GTCGCGGACG CCTCGATCAT GCCGAGCATC GTCGGCGGGA ACACGAACGC CGCGGCGCTG ATGATCGGCG AGCATGCGGC GGGGCTGATT CTGGGGTGA
|
Protein sequence | MGQRSKRAMP GEADYVVVGA GSAGCVLAAR LAGSGARVVL IEAGGSDRTT LVRKPGLIAA VHSVPQLKAR LDWGYYSVPQ SDALERKIPQ TRGKVLGGSG SVNGMLFVRG NAANYDSWAA EGCDGWSYAD VLPSFKKLES WEEGETEFRG GAGPIKVRRQ TDVTTATLGF MEAFADTAGV KVLDDYNGES QEGIAIVQQS AHDGLRYSSS VGYLDDHGMA QLDVVTGVTV ARVVLEKGRA VGVEVVGEDG VRQVVRATRE VVLCAGVFGS AQLLQLSGIG PAEHLRSVGV EVVQDLPVGD NLHDHLFVPM CFLMPEARNK GTAPYFARGF VKEMTRGGTW VGRTVFESVG FVRSPNAGSV PDLQIHVLPW SYPGPNQDAP IRHKADPRRT LTVMPTLIYP HSRGTLRLAS ADPLAAPLID PAYLREPADT QLLLDGMEMV REAMAHRSLS GRVQGESSPG TAYANRAALA AELPNRATTV YHPVGTCRMG VDERAVVDPA LRVRGVEGLR VADASIMPSI VGGNTNAAAL MIGEHAAGLI LG
|
| |