Gene Caul_4935 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4935 
Symbol 
ID5902397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5332797 
End bp5334008 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content68% 
IMG OID641565455 
Productthreonine dehydratase 
Protein accessionYP_001686553 
Protein GI167648890 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR01127] threonine dehydratase, medium form 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCGCA TGACTCTGAC CCTGGACCAC ATCCGCGCCG CCGCTAGCCG CCTCGCCGGC 
CAGATCGAGC GCACCCCGTG CCGCTATTCC AAAACGCTGT CGAAGATCAC CGGCGCGGAA
GTCTGGGTGA AGTTCGAGAA CCTGCAGTTC ACGGCGGCTT ACAAGGAGCG CGGCGCGCTC
AACAAGCTGA TGCTGCTGTC CGACGCCGAA AAGGCCAAGG GCGTCATCGC GGCCAGCGCC
GGCAACCACG CCCAGGGCCT TGCCTATCAC GGCGCCCGCC TCGGCGTGCC GGTGACCATC
GTCATGCCCA GGACCACCCC GTTCATCAAG GTGCAGCACA CCCGCGACTT CGGGGCGACC
GTGGTGATCG AGGGCGAGAC CTATGACGAC GCCAACGCCC ATGCCCGCAA GCTGCAGGAA
GAGCAGGGCC TGACCTTCGT CCATCCGTTC GACGACTACG ACATCATGGC CGGCCAGGGC
ACCATCGCCC TGGAGATGCT GGAAGACGCC CCCGACCTGG AGATACTGCC GGTGCCGATC
GGCGGCGGCG GCCTGATCAG CGGCGTGGCG ACGGCCGCCA AGGCGGTCAA GCCCGACATC
CGGATCATCG GTTGCGAACC GGCCATGTAT CCATCCTTCA CCGCCAAGAT GCGCGGCGTC
GCGGCCCATT GCGGCGGCCA GACCATCGCC GAGGGCGTGG CGGTCAAACA GGTCGGCGAG
CTGACCTACG GCGTCGCCCG GCCGTTGATC GACGACGTGT TGCTGCTGGA AGAACCGCAC
ATCGAGCAGG CCGTTGCGCT GTACTGCAAC GTCGAGAAGA CCATCGCCGA GGGCGCCGGC
GCGGCCTCCC TGGCCGCCCT GCTGGCCTAC CCCGAGCGGT TCCGCGGCAA GAAGTGCGGT
TTGATCCTCT GCGGCGGCAA CATCGACACC CGCCTGCTGG CCTCGGTGCT GACCCGCGAA
CTGGTCCGCG CCCAGCGGCT GGTCAGCTTG CGCATCGTCG GCGACGACCG GCCGGGCCTG
TTGTCGACCG TGGCCAACGT CATTGGCACG GCCGGCGCCA ACATCATCGA GGTCAACCAC
AACCGCCTGG CCCTGGACGT GCCGGCCAAG GGCGCGGAGT TCGACATCAC CATCGAGACC
CGCGACGCCC AGCACACCCA GGAGGTCATG GACGCCCTGC GCGAGAAGGG CTATCCGCCG
CGCGCGGTGT GA
 
Protein sequence
MLRMTLTLDH IRAAASRLAG QIERTPCRYS KTLSKITGAE VWVKFENLQF TAAYKERGAL 
NKLMLLSDAE KAKGVIAASA GNHAQGLAYH GARLGVPVTI VMPRTTPFIK VQHTRDFGAT
VVIEGETYDD ANAHARKLQE EQGLTFVHPF DDYDIMAGQG TIALEMLEDA PDLEILPVPI
GGGGLISGVA TAAKAVKPDI RIIGCEPAMY PSFTAKMRGV AAHCGGQTIA EGVAVKQVGE
LTYGVARPLI DDVLLLEEPH IEQAVALYCN VEKTIAEGAG AASLAALLAY PERFRGKKCG
LILCGGNIDT RLLASVLTRE LVRAQRLVSL RIVGDDRPGL LSTVANVIGT AGANIIEVNH
NRLALDVPAK GAEFDITIET RDAQHTQEVM DALREKGYPP RAV