Gene Caci_4459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaci_4459 
Symbol 
ID8335813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCatenulispora acidiphila DSM 44928 
KingdomBacteria 
Replicon accessionNC_013131 
Strand
Start bp5076245 
End bp5077873 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content72% 
IMG OID644957561 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_003115163 
Protein GI256393599 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.084448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACAGC GTTCGAAGCG TGCGATGCCC GGGGAGGCCG ACTACGTCGT GGTCGGCGCC 
GGGAGTGCGG GGTGTGTGCT GGCCGCACGG CTGGCCGGGA GCGGGGCGCG GGTGGTGCTG
ATCGAGGCCG GCGGCTCGGA CCGGACGACG CTGGTGCGCA AACCCGGGCT GATCGCCGCG
GTGCACAGCG TGCCGCAGCT GAAGGCGCGG CTGGACTGGG GCTACTACTC GGTGCCGCAG
AGCGACGCGC TGGAGCGCAA GATCCCGCAG ACGCGCGGCA AGGTGCTCGG CGGGTCCGGG
TCGGTGAACG GGATGCTGTT CGTGCGTGGG AACGCCGCGA ACTATGACTC CTGGGCCGCA
GAGGGCTGCG ACGGGTGGTC GTACGCGGAC GTGCTGCCCA GCTTCAAGAA GCTGGAGAGC
TGGGAGGAAG GCGAGACCGA GTTCCGCGGC GGCGCCGGAC CGATTAAGGT GCGGCGGCAG
ACGGACGTCA CGACCGCGAC GCTGGGCTTC ATGGAGGCGT TCGCCGACAC CGCCGGGGTG
AAGGTGCTCG ACGACTACAA CGGCGAGTCG CAGGAGGGCA TCGCGATCGT CCAGCAGAGC
GCGCACGACG GGCTGCGCTA CAGCTCCTCG GTCGGCTACC TGGACGACCA CGGCATGGCG
CAGCTCGACG TCGTCACCGG GGTGACGGTC GCGCGGGTGG TGCTGGAGAA GGGACGCGCG
GTCGGGGTCG AGGTCGTCGG CGAGGATGGT GTGCGGCAGG TGGTGCGGGC CACACGCGAG
GTGGTGCTGT GCGCCGGGGT GTTCGGCTCG GCGCAGCTGC TGCAACTGTC CGGGATCGGA
CCGGCGGAGC ATCTGCGCTC GGTGGGCGTC GAGGTGGTCC AGGACCTGCC GGTCGGGGAC
AACCTGCACG ACCACCTGTT CGTCCCGATG TGCTTCCTGA TGCCGGAGGC GCGGAACAAG
GGGACGGCGC CGTACTTCGC GCGCGGCTTC GTGAAGGAGA TGACGCGCGG CGGGACGTGG
GTCGGGCGGA CGGTGTTCGA GTCGGTGGGG TTCGTACGCA GCCCGAACGC CGGCAGCGTG
CCGGATTTGC AGATCCACGT GCTGCCGTGG TCCTATCCCG GACCGAACCA GGACGCGCCG
ATCCGGCACA AGGCCGACCC GCGGCGGACG CTGACGGTGA TGCCGACGCT GATCTACCCC
CACAGCCGCG GGACCCTGCG CCTGGCATCG GCCGACCCGC TCGCCGCGCC GCTCATCGAC
CCGGCGTACC TGCGCGAACC GGCGGACACC CAGCTGCTGC TGGACGGGAT GGAGATGGTC
CGCGAGGCGA TGGCGCACCG CTCACTGTCC GGGCGCGTGC AGGGCGAGAG CTCGCCGGGC
ACGGCGTACG CGAACCGCGC GGCGCTCGCC GCCGAGCTGC CGAACCGCGC GACGACGGTC
TACCATCCGG TGGGCACGTG CCGCATGGGC GTCGACGAGC GCGCGGTGGT GGACCCGGCC
CTGCGGGTGC GGGGGGTCGA AGGGCTGCGG GTCGCGGACG CCTCGATCAT GCCGAGCATC
GTCGGCGGGA ACACGAACGC CGCGGCGCTG ATGATCGGCG AGCATGCGGC GGGGCTGATT
CTGGGGTGA
 
Protein sequence
MGQRSKRAMP GEADYVVVGA GSAGCVLAAR LAGSGARVVL IEAGGSDRTT LVRKPGLIAA 
VHSVPQLKAR LDWGYYSVPQ SDALERKIPQ TRGKVLGGSG SVNGMLFVRG NAANYDSWAA
EGCDGWSYAD VLPSFKKLES WEEGETEFRG GAGPIKVRRQ TDVTTATLGF MEAFADTAGV
KVLDDYNGES QEGIAIVQQS AHDGLRYSSS VGYLDDHGMA QLDVVTGVTV ARVVLEKGRA
VGVEVVGEDG VRQVVRATRE VVLCAGVFGS AQLLQLSGIG PAEHLRSVGV EVVQDLPVGD
NLHDHLFVPM CFLMPEARNK GTAPYFARGF VKEMTRGGTW VGRTVFESVG FVRSPNAGSV
PDLQIHVLPW SYPGPNQDAP IRHKADPRRT LTVMPTLIYP HSRGTLRLAS ADPLAAPLID
PAYLREPADT QLLLDGMEMV REAMAHRSLS GRVQGESSPG TAYANRAALA AELPNRATTV
YHPVGTCRMG VDERAVVDPA LRVRGVEGLR VADASIMPSI VGGNTNAAAL MIGEHAAGLI
LG