Gene Caul_2930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2930 
Symbol 
ID5900385 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3176569 
End bp3178125 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content70% 
IMG OID641563427 
Productglucose-methanol-choline oxidoreductase 
Protein accessionYP_001684555 
Protein GI167646892 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.917064 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTTCA TCGACGCCCG AACCTTGCCT GACGCCAGCC GGATCGAGGC CGACCTGGTG 
ATCATTGGCG GCGGCCTGGC CGGCATCGCG CTCGCCAAGG AGCTGGCCGG CGGACCGCTG
AAGGTCGCCG TCCTCGAGAG CGGCGGCCGC GAGATCGACA TGGAGATCCA GGGCCTCTAC
GCCGGAACCG CCGTGGTCAA GGCGCCCGAC AATCCCGACA AGCCGTTCGA CGACTATCCG
GTCCAGTCGC GGGTCCGGGT GCTGGGCGGC TCGGGCATGG TCTGGGGTGG CAAGTGCGCA
CCGCTCGATC CCGCCGACTT CGCCGCCCGC GCCTGGGCGC CGCACAGCGG CTGGCCGGTC
ACGCGAGTCC AGATGCAGCC GTTCTACGAC CGGGCCTGCG ACCTGCTGGA GATCCCCCGC
TTCGACGCCG ACAACAAGGC GCTGAAGGAC CCGGCCCGCC CGCCGCTGGC GCTCGACCCG
CGGGACGGCT TCTTCTCGGC CCCGCGCGTC TTCACCCGCT ATTCCGGCGG CGCGGACAAG
GACGCCTTCG ACCGCTTCCG CACCGATTTC GCCGAAGCGC CCAACATCAC CGTCTATCTG
CACGCCAATG TCACCCAGAT CCGCCTGAAC GCGGCGGGCG ACCAGGTCGA AGGCCTGGAC
GTGGCCTGCC TGGACGGCAA GCGCCACACG GCGGTCGGCA AGACCCATGT GCTGGCGGTC
GGCGGCATCG AGAACGTCCG CCTGCTGTTG GCCTCGAACA GCGTGCGGCC CGAGGGCGTT
GGCAACCGCC ACGACCTGGT CGGCCGGTTC TTCCAGGGCC ACGTGACCTA CAGCTTGGAC
GGCGACGCCG AGACCGAGGG CACGGCGGTC CACGTCTCGC GCGCCGAACC GATGAGCCTC
TATTTCAACC CGGGCCGCAC CGCCGCCCAC TGCGTGCTGG CCAGCGGCCT TCCGGCCCAG
GCGCGGATGA AGACCGGCAA CTTCACCGCC ACCCTCTACG CCGCCGAAGA GACCGGGGTC
GCAACCCCGC CCGAGGCCGA GACCAAGGCC CTGCGCCGGG TCGCCACGCG GATCGACGGG
ACGGGAAAGA CCGACGGCCA ACTTCTGGGC TTCTTCGCGA TGTCCGAGCA CTTCCCCAAT
CCCGACAGCC GCGTGGCCCT GGATCCCTCG GCCAAGGACG CGCTGGGCAT GCCGCGCGTT
CATCTGGAGT GGCGCTATTC AAAGGCCGAC TGGGACAGCC TGGAACGCTC GGCGGCCGGC
TTCGGCGACG CCCTGGGCGC CTCAAGCCAG GGCCGCGCCT GCTGGCCGAT CAAGCGCGGG
CAGCTGCTGG AGATCGCCAG CGCCTCGCGT CACCACATGG GCACGACCCG GATGAGCGCC
GATCCCGAGA AGGGCGTCGT CGATCCGAAC CTGAGGGTCC ACGGGACCGG CAACCTCTAT
GTCGCCGGCA GCTCGGTGTT CCCGACCTCG GGCATCGCCA ACCCCACCCT GACGATCCTG
GCCCTGGTCA TGCGCCTGGC CGACCACCTG AAGCTGGACA TGGGAGCCCG CCGATGA
 
Protein sequence
MAFIDARTLP DASRIEADLV IIGGGLAGIA LAKELAGGPL KVAVLESGGR EIDMEIQGLY 
AGTAVVKAPD NPDKPFDDYP VQSRVRVLGG SGMVWGGKCA PLDPADFAAR AWAPHSGWPV
TRVQMQPFYD RACDLLEIPR FDADNKALKD PARPPLALDP RDGFFSAPRV FTRYSGGADK
DAFDRFRTDF AEAPNITVYL HANVTQIRLN AAGDQVEGLD VACLDGKRHT AVGKTHVLAV
GGIENVRLLL ASNSVRPEGV GNRHDLVGRF FQGHVTYSLD GDAETEGTAV HVSRAEPMSL
YFNPGRTAAH CVLASGLPAQ ARMKTGNFTA TLYAAEETGV ATPPEAETKA LRRVATRIDG
TGKTDGQLLG FFAMSEHFPN PDSRVALDPS AKDALGMPRV HLEWRYSKAD WDSLERSAAG
FGDALGASSQ GRACWPIKRG QLLEIASASR HHMGTTRMSA DPEKGVVDPN LRVHGTGNLY
VAGSSVFPTS GIANPTLTIL ALVMRLADHL KLDMGARR