Gene Caul_3722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3722 
Symbol 
ID5901178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4020469 
End bp4022769 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content69% 
IMG OID641564233 
Productmalic enzyme 
Protein accessionYP_001685347 
Protein GI167647684 
COG category[C] Energy production and conversion 
COG ID[COG0280] Phosphotransacetylase
[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGAGG ATTTCCGTAA AGCCGCCCTG GACTATCACC GTCTGCCGCG GCCCGGAAAA 
CTGGCGATCG AAGCCACCAA GCGCATGGCC ACCCAGCGCG ACCTCGGCCT GGCCTACTCT
CCCGGCGTCG CCGCGCCCTG CGAAGCCATC GCCGCCAATC CGGACCTCGC CCGCGACTAC
ACCGCCCGCG GCAACCTGGT GGCCGTGATC TCCAACGGCA CCGCCGTGCT GGGCCTGGGC
AATATCGGCC CACTGGCCAG CAAGCCGGTG ATGGAAGGCA AGGCGGTTCT GTTCAAGAAG
TTCGCCGGCA TCGACGTCTT CGACCTCGAG GTCGACGCCG AGGACCCCGA CCGCTTCATC
GAGGTGGTCG CCGCCCTGGA ACCCACCTTC GGCGGCATCA ATCTGGAGGA CATCAAGGCC
CCGGAGTGCT TCATCATCGA GCGCAAGCTG CGCGAGCGGA TGAACATCCC GGTCTTCCAC
GACGACCAGC ACGGCACGGC CATCGTCTGC GCCGCCGCCG TCCGCAACGC CCTGGTCGTG
CAGGGCAAGA CCCTGAAGGA CGTTAAGCTG GTCACCTCCG GGGCCGGCGC CGCCGCCTTG
GCCTGCGTGG ACCTGCTGGT CTCGATGGGG CTGCCGGTCG AGAACGTGAC GCTGACCGAC
ATCAAGGGCG TGGTCCATGC CGGCCGCGAT CCGGACATGC TCGACAACAT GGCCCGCTAC
GCCCGCCACA CCGACGCCCG CACCCTGCCC GAAGTGCTTT CCGGCGCCGA CATCTTCCTG
GGCCTCTCGG CCCCGCGTGT GTTCAAGGCC GAGTGGCTGC CGCTGCTGGC GCCCAACCCG
CTGATCCTGG CCATGGCCAA TCCCGAGCCG GAGATCCTGC CCGAACTGGT CGTCGCCGCC
CGCCCCGACG CCATCATGGC CACCGGCCGC AGCGACTATC CCAACCAGGT CAACAATGTC
CTGTGCTTCC CGTTCATCTT CCGGGGCGCG CTCGATGTCG GCGCCTCGGA GATCAACGAG
GCCATGAAGG TGGCCGCCGT CGAGGCCATC GCCGAGCTGG CCCGGGCCGA GGCCTCCGAG
GTGGTGGCCA GCGCCTATGG CGGCGTCGCC CCGATGTTCG GCGCGCAGTA TATCATCCCC
AAACCCTTCG ATCCGCGCCT GATCCTGCAG ATCGCTCCGG CCGTGGCTCG GGCGGCCATG
GACAGCGGCG TCGCCACCCG ACCGATCGCC GACTTCGACG CCTATCGCCA GGAGCTGGAG
CTGTTCGTCT ACCGCTCGGG GCAGCTGATG CGGCCGGTGT TCGAGCGGGC CCGCAAGGCC
ACCACGGCCT CGACGATCCG GGTGGCCTAC GCCGAGGGCG AGGACGAGCG CGTGCTGCGC
GCCGTTCAGA CCGTGCTGGA CGAAGGCCTC GCCAAGCCCG TGCTGATCGG CAAGCGCGAA
ACGATCATCG CCAAGGCCTC GGAAATGGGC CTGCGGCTGG ACTTCGACAA TCGCGTCGAG
ATCCTCGACC CTTCCGCCGA CAAGGCGCTG TTCGCCCCGC TGGTGGAGCG CTACCAGGGC
CTGGTCAGCC GTCGAGGCGT GCCGCCCCTG GCCGCCGAAC GCCGGGTGAC CAACCGCCGG
ACCGTCTCGG CCTCGATGCT GCTGCAGGCC GGTCATGTCG ACGCCGCCCT GGTGGGCGGG
GCCGGCGACT GGTGGCAACA CATGACCTAT GTGCTGCCGA TCATCCCCAA GCGCGAGGAC
GTCGGACGGG TTTACGCCCT GTCGGCCCTG ATCATCGACG CCGGCACGCT GTTCTTCTGC
GACACCCACG TCAATGTCGA CCCGACCGCC GACCAGGTGG CCGAGATGAC CCTGCTGGCG
GCGGAGTCGG TGCGCCGTTT TGGGCTGACG CCCAAGGCTG CTCTACTGTC CCATTCCAGC
TTCGGGGCCA GCAACTCTCC GACCGCCCGC AAGATGCGCG AAGCCCTGGC CCTGGTGCGC
GAGCGCGCGC CCGAGCTCGA GGTCGATGGC GAAATGCACG CCGACGCCGC CCTCTCCCAG
GCCCTGCGCG ACCGCCTGGT GCACGACAGC GCCCTGAAGG GCTCGGCCAA TCTGCTGGTC
ATGCCGACTC TGGACGCGGC CAATATCGCC CTGACCTTGC TCAGCGCCGC CACCGAGGGT
CTGTTGGTGG GGCCTGTGCT GCTGGGCATG AACCGCCCGC TGCACGTGCT GACCCCCAGC
GTCACGGCCC GCGGCATCGT CAATATGACG GCCCTGGCGG TGAACCAGGC CGCCGCCGAA
CGCGAGCACC GGCTGATCTG A
 
Protein sequence
MDEDFRKAAL DYHRLPRPGK LAIEATKRMA TQRDLGLAYS PGVAAPCEAI AANPDLARDY 
TARGNLVAVI SNGTAVLGLG NIGPLASKPV MEGKAVLFKK FAGIDVFDLE VDAEDPDRFI
EVVAALEPTF GGINLEDIKA PECFIIERKL RERMNIPVFH DDQHGTAIVC AAAVRNALVV
QGKTLKDVKL VTSGAGAAAL ACVDLLVSMG LPVENVTLTD IKGVVHAGRD PDMLDNMARY
ARHTDARTLP EVLSGADIFL GLSAPRVFKA EWLPLLAPNP LILAMANPEP EILPELVVAA
RPDAIMATGR SDYPNQVNNV LCFPFIFRGA LDVGASEINE AMKVAAVEAI AELARAEASE
VVASAYGGVA PMFGAQYIIP KPFDPRLILQ IAPAVARAAM DSGVATRPIA DFDAYRQELE
LFVYRSGQLM RPVFERARKA TTASTIRVAY AEGEDERVLR AVQTVLDEGL AKPVLIGKRE
TIIAKASEMG LRLDFDNRVE ILDPSADKAL FAPLVERYQG LVSRRGVPPL AAERRVTNRR
TVSASMLLQA GHVDAALVGG AGDWWQHMTY VLPIIPKRED VGRVYALSAL IIDAGTLFFC
DTHVNVDPTA DQVAEMTLLA AESVRRFGLT PKAALLSHSS FGASNSPTAR KMREALALVR
ERAPELEVDG EMHADAALSQ ALRDRLVHDS ALKGSANLLV MPTLDAANIA LTLLSAATEG
LLVGPVLLGM NRPLHVLTPS VTARGIVNMT ALAVNQAAAE REHRLI