Gene Plut_1928 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlut_1928 
Symbol 
ID3744420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium luteolum DSM 273 
KingdomBacteria 
Replicon accessionNC_007512 
Strand
Start bp2141835 
End bp2142899 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content62% 
IMG OID637769952 
Productthiamine-monophosphate kinase 
Protein accessionYP_375813 
Protein GI78187770 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACACA AAACAACCGC TGAGGTCGGA GAGTTCGGCC TCATCGACAG AATCTCCGCC 
ATCGTCAAGC CCACCCTTGC GGCCTCACCC GGCCTCATCG CCGGCATCGG CGACGACTGC
GCCGTCTGGC ATCCGGCGGC GGATATGACT GAAGTGGCTT CAACCGACCT GCTTTTGGAG
CAGGTCCACT TCGACCTCCT CACCACCCCC GTGAAGCACC TCGGAAGCAA GGCGATCAGC
GTCAATGTCT CCGACATCTG CGCCATGAAC GCCATACCGC GCCACGCCCT CGTCTCCATA
GCCGTGCCGC CGTCATTTCC AGTCGAAATG ATCGAGGAGC TCTACCGAGG CATGGAGGCC
GCCGCCCGGG AGTACGGGAT CGCCATCGTC GGTGGCGACA CCTCGCGCTC ACCTTCCGGG
CTCGTGCTGT CCGTCACGGT AACAGGGGAA GCCGAAGAGG CCAACATAAC CTACCGGAAA
GGGGCCGAAC CGGGAGACCT CGTCTGCCTC ACCGGCACAC TCGGCGGCTC GGCGGCCGGC
CTCAGGGTCC TGACGCGCGA AAAGCTCATC ATGATGGAGC ATATCGAACA CAATGAAGCG
TACGAAGGCA GCATCATGGC CGACCTGAAG GAGTACAGCG GCGCCATTCA GCAGCACCTG
CTCCCGCTTG CACGGCTCGA CATCGTCCGC TTCCTCCACG AACGCCGGGC GCACCCCTCG
GCCATGATTG ATGTGTCCGA CGGACTCGGC CAGGACCTCG GCCACATCTG CAGCGCATCG
GGTACCGGCG CGCTCCTGCA GGAGAACCGC ATCCCGGTCA ACTCCACCGC CCGGCTGATT
GCCGACGAGC TGCAGGACGA CGCCCTCGGC TGGGCGATAG GCGGAGGAGA AGACTACCAG
CTGCTCTTCA CCATGCCACA TGAGGAATAC CAGAAGATCG CCGACAACCG CGACATCTCG
GTAATCGGCG AAATCACCCC CAAGGAACAG GGCATACTGC TCAAAGACAT CTACGGTATC
GAAATAGACC TCCAATCGCT CCCCGGATTC GACCACTTCC GATGA
 
Protein sequence
MTHKTTAEVG EFGLIDRISA IVKPTLAASP GLIAGIGDDC AVWHPAADMT EVASTDLLLE 
QVHFDLLTTP VKHLGSKAIS VNVSDICAMN AIPRHALVSI AVPPSFPVEM IEELYRGMEA
AAREYGIAIV GGDTSRSPSG LVLSVTVTGE AEEANITYRK GAEPGDLVCL TGTLGGSAAG
LRVLTREKLI MMEHIEHNEA YEGSIMADLK EYSGAIQQHL LPLARLDIVR FLHERRAHPS
AMIDVSDGLG QDLGHICSAS GTGALLQENR IPVNSTARLI ADELQDDALG WAIGGGEDYQ
LLFTMPHEEY QKIADNRDIS VIGEITPKEQ GILLKDIYGI EIDLQSLPGF DHFR