Gene Caul_4968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4968 
Symbol 
ID5902430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5370497 
End bp5372785 
Gene Length2289 bp 
Protein Length762 aa 
Translation table11 
GC content68% 
IMG OID641565489 
Productmalic enzyme 
Protein accessionYP_001686586 
Protein GI167648923 
COG category[C] Energy production and conversion 
COG ID[COG0280] Phosphotransacetylase
[COG0281] Malic enzyme 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG CGGAGAAGAA GACCTTCACG GACGAGGAAG CGCTGAACTT CCACCGCTAT 
CCCACCCCCG GGAAGATCGC CATCGTGCCG ACCAAGCCGA TGGCCACCCA GCGCGATCTG
TCCCTGGCCT ATTCGCCGGG CGTCGCCGTT CCCGTCCACG CCATCGCCGC CGACCCGGAC
ATGGCCTACG AATACACCTC CAAGGGCAAC CTGGTGGCGG TGATCTCCAA CGGCACCGCG
ATCCTGGGCC TGGGCGACCT GGGCGCCCTG GCCTCCAAGC CGGTGATGGA AGGCAAGAGC
GTCCTCTTCA AGCGCTTCGG CGACGTCGAC AGCATCGACA TCGAGATCAG CTCCAAGGAC
GCCGACGAGA TCATCACGGT GGTCAAGAAC ATCGGGATCA CCTTCGGCGG CATCAATCTG
GAGGACATCA AGAGCCCCGA ATGCTTCCGC ATCGAGACCG AGCTGCAGGA GCTGCTCGAC
ATCCCGGTGT TCCACGACGA CCAGCACGGC ACGGCGATCA TCTGCGCCGC CGGCCTGATC
AACGCCTGCC ACATCACCGG CAAGAAGCTG GAAGACGTCA GGGTGGTGCT CAACGGCCCC
GGCGCGGCGG GCATCGCCTC GCTGGAACTG ATCAAGGCCA TGGGCGTGCG TCCCGAGAAC
TGCATCGCCG TCGACTCCAA GGGCGTGCTG TACCAGGGCC GCACCGAGGG CATGAACCAG
TGGAAGAGCG CCCACGCGGT CGACACCCCG CTGCGCACCC TGGCCGAAGC CGCCAAGGGC
GCCGACGTGC TGCTGGGCCT CTCGGCGAAA GGCGCCTTCA CGCCCGAGAT CATCGCCAGC
ATGGCCCCCA ACCCGGTGAT CTTCGCCATG GCCAACCCCG ATCCGGAGAT CACCCCGGAG
GAGGTCCACG CCGTGCGCAA GGACGCCATC ATGGGCACCG GCCGCAGCGA CTATCCCAAC
CAGGTCAACA ACGTCCTGGG CTTCCCCTAC ATCTTCCGCG GCGCGCTGGA CGTGCGGGCC
CGCCGCGTGA ACCATGAGAT GAAGATCGCC TGCGCCAACG CCCTGGCCAT GCTGGCCCGC
GAGGACGTGC CCGACGAGGT CGCCGCCGCC TATCACGGCC GCCAGCTGAA GTTCGGCCCG
CAATACATCA TCCCCTCGGC CTTCGACCCG CGGCTGATCT GGTACGTGCC GCCGTTCGTC
GCCCAGGCGG CCATGGACAC CGGCGTGGCC CGCAAGCCGA TCGCCGACAT GGACGCCTAT
CGCGCCAGCC TGGCCCAGCG CCTGGATCCC ACCGCCGGCT TCCTGCAGAA GATCAGCGGC
GCGGTCCTGG CCAACCCCAA GCGCATCGTG TTCGCCGAGG GCGAGGACCC CACCGTCATC
CGCGCCGCCT ACGCCTACCA GACCGGCGGC TTCGGCAAGG CCATCCTCTG CGGCCGCGAG
AACCTGGTGC ACGAGAACAT GCGGGTCGTC GGCCTCGACC CCGAGACCGC GGGCCTGGAG
ATCGTCAACG CCCGCCTCAG CGACCGCAAT CCCGACTATG TCGACGCCCT CTACGCCCGC
CTGCAGCGCC AGGGCTACCT GAAGCGCGAC GTCCAGCGCC TGATCAACCA GGACCGCAAC
AGCTTCGCCG CCTCGATGGT CACCCTGGGC GAGGCCGACG GCATGGTCAC CGGCGTCACC
CGCAGCTTCG ACCAGGCCCT GGAAGAGGTG CTGCGCGTCG TCGACCCGGC GCCCGGCGGC
CGGATCATGG GCATGTCGGT GGTGCTGGCC AAGGGCCGGA CCATCTTCGT GGCCGACACC
AACGTCACCG AGCTGCCCGA GGCCGAGGAG CTGGTCGAGA TCGCCTGCGA GGCGGCCCGC
GCCGTCACCC GCCTGGGCTT CAAGCCGCGC GTGGCCTTCA TGAGCTATTC CACCTTCGGC
AACCCGATGG GCCTGCGTTC GGAGAAGGTG CGCGAGGCCG TGGCCATGCT CGACGAGATG
GACGTCGACT TCGAATACGA GGGCGAAATG CCGCCCGAAC TGGCCCTGGA TCCCGAGCGC
CGCGCCAACT ACCCGTTCAT GCGCCTGACC GACAGCGCCA ACATCCTGAT CATGCCCGCC
ATCCACGCCG CCTCGATCTC GACCAAGCTG GTCCAGTCGC TCGGCGGGGC GACGGTGATC
GGTCCGGTGC TGCTGGGCCT GTCCAAGCCG ATCCAGATCG CGCCGTTGTC AGCGTCGGTG
TCGAAGATTC TGAACATGGC GATGATGGCG GCGTATGAGG GCGCGGGGGA TCTGGGGGCG
GCGGAGTAG
 
Protein sequence
MSDAEKKTFT DEEALNFHRY PTPGKIAIVP TKPMATQRDL SLAYSPGVAV PVHAIAADPD 
MAYEYTSKGN LVAVISNGTA ILGLGDLGAL ASKPVMEGKS VLFKRFGDVD SIDIEISSKD
ADEIITVVKN IGITFGGINL EDIKSPECFR IETELQELLD IPVFHDDQHG TAIICAAGLI
NACHITGKKL EDVRVVLNGP GAAGIASLEL IKAMGVRPEN CIAVDSKGVL YQGRTEGMNQ
WKSAHAVDTP LRTLAEAAKG ADVLLGLSAK GAFTPEIIAS MAPNPVIFAM ANPDPEITPE
EVHAVRKDAI MGTGRSDYPN QVNNVLGFPY IFRGALDVRA RRVNHEMKIA CANALAMLAR
EDVPDEVAAA YHGRQLKFGP QYIIPSAFDP RLIWYVPPFV AQAAMDTGVA RKPIADMDAY
RASLAQRLDP TAGFLQKISG AVLANPKRIV FAEGEDPTVI RAAYAYQTGG FGKAILCGRE
NLVHENMRVV GLDPETAGLE IVNARLSDRN PDYVDALYAR LQRQGYLKRD VQRLINQDRN
SFAASMVTLG EADGMVTGVT RSFDQALEEV LRVVDPAPGG RIMGMSVVLA KGRTIFVADT
NVTELPEAEE LVEIACEAAR AVTRLGFKPR VAFMSYSTFG NPMGLRSEKV REAVAMLDEM
DVDFEYEGEM PPELALDPER RANYPFMRLT DSANILIMPA IHAASISTKL VQSLGGATVI
GPVLLGLSKP IQIAPLSASV SKILNMAMMA AYEGAGDLGA AE