Gene Caul_0654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0654 
Symbol 
ID5898109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp721018 
End bp722406 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content72% 
IMG OID641561136 
ProductL-serine dehydratase 1 
Protein accessionYP_001682285 
Protein GI167644622 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1760] L-serine deaminase 
TIGRFAM ID[TIGR00720] L-serine dehydratase, iron-sulfur-dependent, single chain form 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.344045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.592232 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCCT CCGTCTTCGA CCTGTTCAAA CTGGGCGTCG GTCCGTCGAG CAGCCACACC 
ATGGGGCCGA TGACGGCCGC CGGGCTGTTC GTCGGGCGCC TGCGCGACGC CGGAAAGCTG
GCGCGCACGG CCCGGGTCGA GACCCGGCTC TACGCCTCCC TGGCCCTGAC CGGCCGGGGC
CACGCCACCG ACCGGGCGGT GATCCTGGGG CTGATGGGGT TCGTGCCCGC CACGCTGGAT
CCCGACGCCG GCGAGACGGC TCTGGCGCAG GCCGCCGCCA ACCAATGGAT CCAACTGGGG
GGCGAGGTCG GGATCAAGTT CGACGCCGAG CGCGACATCG CCTGGGCCGG CCACGAGCGC
CTGCCCCAGC ACCCCAACGG GCTGTCCTTC ACCGCTTTCG ACGCCGCTGG CGCCGTACTG
GCCGAACGCA CCTATTTCTC GATCGGCGGC GGCTTCGTGC GCGACGAGAG CGAGATGGGC
CGCAACGCCC CGCCGGAGGA CGGACCGGAG ATCCCGCATC CGTTCGAGTC CGGCGCCGAC
CTGCTGCGGC GGGCCGCCGA CACCGGCCTG TCGATCGCCG GGGTCATGGG CGCCAACGAA
CTGGCCCGCA TGGACCAGGC CGAGCTCGAC GCGGGCCTCG ACCGCATCTT CGCGGCCATG
GAGGCCTGCA TCGACCGGGG CATGCGCGAG ACCGGCGTCC TGCCCGGCGG CCTGAACGTC
AAGCGCCGGG CCCGCCAGAT CCACCAGACC ATCCAGGGCC GCATGGAGCG CCAGATCAGC
GACCCATTGG CGGCCATGGA CTATGTCAAC CTGTGGGCCA TGGCGGTCAA CGAGGAGAAC
GCCGCCGGCG GCCGGGTGGT CACCGCCCCC ACCAACGGCG CGGCCGGGCT GATCCCGGCG
GTGCTGCGGT TCTTCGTGCG CTTCCACAAC GGCGCGCCGG GCCAGATCCG GGTGTTCCTG
CTGACGGCGG CGGCGATCGG CGCGCTCTAC AAGCGCAACG CCTCGATCAG CGGCGCCGAG
GTCGGCTGCC AGGGCGAGGT CGGCGTGGCC TGCTCGATGG CGGCGGCGGG GCTGGCGGCG
GCCCTGGGCG CCACCAACGA CCAGATCGAG AACGCCGCCG AGATCGGCAT GGAGCACAAT
CTGGGCCTGA CCTGCGACCC GATCGGCGGC CTGGTCCAGA TCCCCTGCAT CGAGCGCAAC
GCCATGGGCG CGATCAAGGC CATCGACGCC GCGCGCCTGG CGCTGCTGGG GGACGGGCAG
CACTCGGTGT CGCTGGACAA GGTGATCGCC ACGATGAAGC GCACCGGCGA GGACATGAGC
GAAATCTACA AGGAGACCTC GCTGGGGGGC TTGGCGGTGG GGCTGTCGGT GAACCGGGTG
GAATGCTGA
 
Protein sequence
MTASVFDLFK LGVGPSSSHT MGPMTAAGLF VGRLRDAGKL ARTARVETRL YASLALTGRG 
HATDRAVILG LMGFVPATLD PDAGETALAQ AAANQWIQLG GEVGIKFDAE RDIAWAGHER
LPQHPNGLSF TAFDAAGAVL AERTYFSIGG GFVRDESEMG RNAPPEDGPE IPHPFESGAD
LLRRAADTGL SIAGVMGANE LARMDQAELD AGLDRIFAAM EACIDRGMRE TGVLPGGLNV
KRRARQIHQT IQGRMERQIS DPLAAMDYVN LWAMAVNEEN AAGGRVVTAP TNGAAGLIPA
VLRFFVRFHN GAPGQIRVFL LTAAAIGALY KRNASISGAE VGCQGEVGVA CSMAAAGLAA
ALGATNDQIE NAAEIGMEHN LGLTCDPIGG LVQIPCIERN AMGAIKAIDA ARLALLGDGQ
HSVSLDKVIA TMKRTGEDMS EIYKETSLGG LAVGLSVNRV EC