Gene Caul_3608 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3608 
SymbollldD 
ID5901063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3891800 
End bp3892942 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content70% 
IMG OID641564119 
ProductL-lactate dehydrogenase 
Protein accessionYP_001685233 
Protein GI167647570 
COG category[C] Energy production and conversion 
COG ID[COG1304] L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.236817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATTT CATCGACCAC CGACTTCCGC GAGGCGGCCC GCCGCCAGCT GCCGCGCTTC 
CTATTCGACT ATATCGACGG CGGCGCCTAT GCCGAGCGCA CCCTGGCCCG CAACGTCTCG
GACCTGGCCG ATATCAGCCT GCGCCAGCGG GTGCTGAAGG ACGTCTCCCG CGTCTCGACC
AGGACCACCC TGTTCGGGGT CGAGCAGACC TTGCCGGTCG CCCTGGCGCC GGTGGGCCTG
ACGGGCATGT ACGCGCGGCG CGGCGAGGTC CAGGCGGCCA GGGCGGCGGC CGCCAAGGGC
GTGCCGTTCT GCCTGTCGAC GGTGTCGGTC TGCGACCTGG CCGAGGTGTC CAGGGCCAGC
AGCGCGCCGA TCTGGTTTCA GCTCTACATG CTGCGCGATC GGGGCTTCAT GCGCGACCTG
CTGGCCAGGG CGGCCGACGC GGGCGCCACG GCCCTGGTGT TCACCGTCGA CATGCCGGTG
CCCGGCGCCC GCTATCGCGA CGCCCATTCG GGAATGACCG GTCCCAACGC GGCGATGCGG
CGACTGGTCC AGGCCGTCTT CAAGCCGGGC TGGGCTTGGG ATGTCGGCGT CATGGGCCGG
CCCCACACCT TGGGCAATGT CGCCCCGGTG CTGGGCGAGA ACACCGGGCT GGAGGACTTC
ATGGGCTGGC TGGGGGCCAA TTTCGATCCC TCGATCCAGT GGAAGGACCT CGACTGGATC
CGCGACCAGT GGAAAGGCCC GCTGATCCTC AAGGGCGTGC TCGATCCCGA GGACGCCAAG
GCCGCCGCCG ACATCGGGGC CGACGGCATC GTGGTCTCCA ACCACGGCGG CCGGCAGCTG
GATGGGGTTC TGTCCTCGGC CCGCGCCTTG CCGGACATCG CCGAGGCGGT GGGCGACCGC
CTGACCGTGC TGGCCGACGG CGGCGTGCGC TCGGGCCTGG ACGTGGTGCG GATGCTGGCC
CTGGGAGCCA AGGGCGTGCT GCTGGGCCGG GCCTTCGTCT ACGCCCTGGC GGCGCGCGGC
GGGCCGGGCG TGAGCCAACT GCTCGACCTG ATCGAGAAGG AGATGCGAGT GGCCATGGCC
CTGACCGGCG TCAACACCCT GGACCAGATC GACCGTTCGA TCCTGGCGAA GACCGACCGA
TGA
 
Protein sequence
MIISSTTDFR EAARRQLPRF LFDYIDGGAY AERTLARNVS DLADISLRQR VLKDVSRVST 
RTTLFGVEQT LPVALAPVGL TGMYARRGEV QAARAAAAKG VPFCLSTVSV CDLAEVSRAS
SAPIWFQLYM LRDRGFMRDL LARAADAGAT ALVFTVDMPV PGARYRDAHS GMTGPNAAMR
RLVQAVFKPG WAWDVGVMGR PHTLGNVAPV LGENTGLEDF MGWLGANFDP SIQWKDLDWI
RDQWKGPLIL KGVLDPEDAK AAADIGADGI VVSNHGGRQL DGVLSSARAL PDIAEAVGDR
LTVLADGGVR SGLDVVRMLA LGAKGVLLGR AFVYALAARG GPGVSQLLDL IEKEMRVAMA
LTGVNTLDQI DRSILAKTDR