Gene Caul_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3043 
Symbol 
ID5900498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3307670 
End bp3308743 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content74% 
IMG OID641563545 
Productthiamine-monophosphate kinase 
Protein accessionYP_001684668 
Protein GI167647005 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.426157 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTCCAG AAACTGACGA CGACTGGTTC GACGGACCGA CGCCGGAGAC GGCGGCGGCC 
GACGACGCGT GGTTCGACGA GGCCGTCACG CCCGCCGCCG CGCCGCCGGT CGACGAGTTC
GGCCTGATCG AGCGGTTGCT GCGGCCGCTT ACGCGGGGCG ACGCCGCAGC GCTGAACCTG
CTGGACGACG CCGCCGTGCT GCCCTCGCGC CCCGGCTATG ACCTGGTGAT CACCAAGGAC
GCCATGGTGG CCGGCGTGCA TTTCCTGGCC GGCGAGGACC TGGACGTGGT GGCCAAGCGG
CTGCTGCGGA CCAACCTCTC CGACCTCGCC GCCAAGGCCG CCGTGCCCTA CGGCTACTTC
CTGGCGGTGG GCTGGCCTTC GGGCACCACC CTGACCGACC GCGAGACCTT CGCGCGCGGC
CTGGCCGAGG ATGGCGAGCT CTACGACGTC AACCTGCTGG GCGGCGACAC GGTCACCACC
TCGGGTCCGA TGGTGGTCTC GGCCACCTTC CTGGGCTGGA CGCCCAGCGG CGACGCGGTG
TTGCGCAAGG GCGCGCGGGT CGGCGACCGG CTGATGGTCA GCGGCACGAT CGGCGACGGC
TGGCTGGGCC TGCTGGCCCA CTGGGGCGAG GTCGAGGATC CCGATGGCGG CCTGGTGCGG
CGCTATCGCC TGCCGCCGCC GCGCCTGTCG ATCCGCGACG CCCTGCGCGC CTACGCCCGG
GCCGCCGCCG ACGTCTCCGA CGGCCTGCTG GCCGACGCCG CCCACGTGGC CAAGGCCAGC
GGCCTGCGGG TCAAGGTCGA TCTCGACCGC CTGCCGCTGT CGCCCGGCGC CCGCAACTGG
CTCGGCGCCC AACCGGAGGC CGGCGAGGCG CGGATGTCCC TGGCCTCGGG CGGCGACGAC
TACGAGATCG TCTGCGCCGT CGATCCCACC GACGTGGCCG CCTTCCAGGC CGCCGCCATG
GCCGCCGGCG TGCCCGTGCG CGACATCGGC GAGTTCGTGG AAGGCGAGGG GATCTGCGCC
CTGTTCAAGG GCAAGGACAT CACGCCCGAA CGGCTGGGCT GGCTGCACGG CTGA
 
Protein sequence
MTPETDDDWF DGPTPETAAA DDAWFDEAVT PAAAPPVDEF GLIERLLRPL TRGDAAALNL 
LDDAAVLPSR PGYDLVITKD AMVAGVHFLA GEDLDVVAKR LLRTNLSDLA AKAAVPYGYF
LAVGWPSGTT LTDRETFARG LAEDGELYDV NLLGGDTVTT SGPMVVSATF LGWTPSGDAV
LRKGARVGDR LMVSGTIGDG WLGLLAHWGE VEDPDGGLVR RYRLPPPRLS IRDALRAYAR
AAADVSDGLL ADAAHVAKAS GLRVKVDLDR LPLSPGARNW LGAQPEAGEA RMSLASGGDD
YEIVCAVDPT DVAAFQAAAM AAGVPVRDIG EFVEGEGICA LFKGKDITPE RLGWLHG