Gene Caul_4467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4467 
Symbol 
ID5901928 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4841333 
End bp4842913 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content69% 
IMG OID641564986 
ProductD-3-phosphoglycerate dehydrogenase 
Protein accessionYP_001686085 
Protein GI167648422 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID[TIGR01327] D-3-phosphoglycerate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0468875 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCTC CCCGCGTCCT CATCGCCGAC AGCCTGTCCC CCGCCGCCGT GGCGATCTTC 
ACCGCCCGCG GCGTCCAGGC CGACGTCAAG ACCGGCCTGA CCAAGGCGCA GCTGCTGGAA
ATCATCGGCG ACTATGACGG CCTGGCCGTC CGCTCGGCCA CCAAGGCCGA CCCCGAGGTG
ATCGCCGCCG CCAAGAAGCT GAAGGTCATC GGCCGGGCCG GCATCGGCGT CGACAACGTC
AACATCCCGG CCGCCACCGC CGCCGGCATC GTGGTGATGA ACACCCCGTT CGGCAACTCG
ATCACCACGG CCGAGCACGC CATCGCCATG ATGTTCGCCC TGGCCCGCCA ACTGCCGGCG
GCCGACGCCA GCACCCAGGC CGGCAAGTGG GAGAAGAACC GCTTCATGGG CGTGGAGCTG
TACGCCAAGA CCCTGGGCCT GATCGGCGCG GGCAATATCG GCGGCATCGT CGCCGACCGC
GCCCTGGGCC TGAAGATGAA GGTCGTGGCC TATGACCCCT TCCTGTCGCC GGAACGCGCC
ATCGAGATCG GCGTCGAGAA GGTCGAGCTG GACGACCTGC TGGCCCGCGC CGACGTCATC
ACCCTGCACA CCCCGCTGAC CGACAAGACC CGCAACATCC TGTCGCGCGA GGCCCTGCAG
AAGACCAAGA AGGGCGTGCT GATCGTCAAC TGCGCCCGCG GCGGCCTGGT CGACGAGGTG
GCCCTGCGCG AACTGCTCGA CAGCGGCCAT GTCGGCGGCG CCGGCTTCGA CGTGTTCACC
GAGGAGCCGG CCAAGGCCAA TCCGCTGTTC GGCTCCGACC GCGTGGTGGC CACCCCCCAC
CTGGGCGCCA GCACCAACGA GGCCCAGGAG AACGTCGCCC TGCAGGTCGC CGAGCAGATG
AGCGACTACC TGCTGACCGG CGCGGTGACC AACGCGCTGA ACAGCCCGTC GATCAGCGCC
GAGGAAGCCC CCAAGCTGAA GCCGTTCGTG GCCCTGGCCG AGAAGATCGG CGCCCTGGCC
GGCCAGATGG TCGACTTCGG GATCAAGGCC ATCGACATCG CCTATGAGGG CGAGGTCGCG
AACCTCAACG TCAAGCCGAT GACCTCGGCC GCCCTGGCCG GGATCCTCAA GCCCATGCTG
GCCGAGATCA ACATGGTCTC CGCCCCGGCC GTGGCCAAGG AGCGCGGCAT CACCGTCTCC
GAGAGCCGCC AGGAGGTCAG CCCCACCTAT GACAGCCTGA TGCGCATCAC CATCACCACC
GAGAAGGGCA AGCGCGCCTT CGCCGGCACG GTGATCGCCG GCGCGCCGCG CATGGTCGAG
GTCAAGGGCA TGGAGCTGGA CGCGGGCTTC GCCCCGGCCA TGCTCTACAT CAACAACCTC
GACAAGCCGG GCTTCATCGG CGCCCTGGGC ATGCTGTTGG GCGAGGCGGG CGTCAACATC
GCCACCTTCA ACCTCGGCCG CCTGTCGGCC GACGAAGACG CCATCGCCCT GGTCGGCGTC
GATCAGGCCC CGGACGAGGC CCTTCTGGCC AAGATCCAGG CCCTGCCACA CGTCAAGGAA
GCCCGCGCGC TGACGTTCTG A
 
Protein sequence
MTAPRVLIAD SLSPAAVAIF TARGVQADVK TGLTKAQLLE IIGDYDGLAV RSATKADPEV 
IAAAKKLKVI GRAGIGVDNV NIPAATAAGI VVMNTPFGNS ITTAEHAIAM MFALARQLPA
ADASTQAGKW EKNRFMGVEL YAKTLGLIGA GNIGGIVADR ALGLKMKVVA YDPFLSPERA
IEIGVEKVEL DDLLARADVI TLHTPLTDKT RNILSREALQ KTKKGVLIVN CARGGLVDEV
ALRELLDSGH VGGAGFDVFT EEPAKANPLF GSDRVVATPH LGASTNEAQE NVALQVAEQM
SDYLLTGAVT NALNSPSISA EEAPKLKPFV ALAEKIGALA GQMVDFGIKA IDIAYEGEVA
NLNVKPMTSA ALAGILKPML AEINMVSAPA VAKERGITVS ESRQEVSPTY DSLMRITITT
EKGKRAFAGT VIAGAPRMVE VKGMELDAGF APAMLYINNL DKPGFIGALG MLLGEAGVNI
ATFNLGRLSA DEDAIALVGV DQAPDEALLA KIQALPHVKE ARALTF