Gene Cag_1213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1213 
Symbol 
ID3748247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1612181 
End bp1613290 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content52% 
IMG OID637773747 
Productthiamine-monophosphate kinase 
Protein accessionYP_379518 
Protein GI78189180 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0611] Thiamine monophosphate kinase 
TIGRFAM ID[TIGR01379] thiamine-monophosphate kinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.00911735 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACTGA ATGGAGAATT CACCCTTATT GATACCATAG CGCACCTTGT GCAACCAACG 
CTTGCCAATG CCCCCACACT GCTGCAAGGT ATTGGCGACG ATTGCGCTAT TATGCAACCA
ACCGCCGGCA TGGTAGAAGT TGCCACTACC GACCTTTTAG TTGAATCGGT ACACTTCGAC
CTCTTAACCA CGCCACTCTC GCATCTCGGC AGCAAAGCCA TAAGCGTTAA CGTCTCCGAC
ATTTGCGCTA TGAATGCCCT ACCACGCTAC GCGTTAGTGA GCCTTGCCCT TCCTCCTACC
TTTTCTAAAA AAATGGTGGA AGAACTCTAT GGCGGTATGG TACACGCCGC TCAAGCCTAC
GGTATTGCAA TTGCAGGCGG CGATACCTCC GCCTCTCGTT CGGGACTGAT GATCTCCATT
ACTGCAATTG GCGAAGCATT GCCCACGCAA CTCACGCGTC GTAGCGGTGC TCAACTTGAC
GACTTGCTTT GCGTTACGGG CACCTTAGGT GGTTCAATGG CTGGGCTCAA GCTCTTAATG
CGCGAAAAAG AGATTATGTT AGAACACTTG CGCAATAACG AACCTGTTAA CCGCAATCTC
TTAGCGGATT TAGATGAGTA TCGAGAGTTA ATGCAGCGCC ACCTACTACC AACCGCCCGC
CTCGACGTTG TGCGCCTTTT CCACCGCCTT GGCATACAAC CCACCGCCAT GATTGATATT
TCCGATGGAC TCAGCTCCGA AGTGCAACAC ATCTGCCGCC ATTCCAACTG CGGAGCGTTG
CTACACGAAA GCCGCATTCC CATCCACGCC ACCACACGCC AACTTGCCGA CGAAATGCAA
GAAGAGCCGC TAACATGGGC ACTAACGGGC GGCGAAGAAT ACCAGCTCCT TTTTACGCTC
CCCGAAGCCA CCTACCAGCA ACTTGCCCAC GAGCGCGACA TACACGTTAT TGGCACCATC
ACGCCCACCA ATGAAGGCAT GGTGCTTGAA GAGATGTTTG GCATTCGCAT TGACCTTACC
ACCATTCACG GCTTTGACCA TTTTGCTCCA TCAGGTAATG ACGATGGTAA CACGGAAAAT
GAGGAAGAGG AGTTTGAGGA TGGCGTGTAA
 
Protein sequence
MPLNGEFTLI DTIAHLVQPT LANAPTLLQG IGDDCAIMQP TAGMVEVATT DLLVESVHFD 
LLTTPLSHLG SKAISVNVSD ICAMNALPRY ALVSLALPPT FSKKMVEELY GGMVHAAQAY
GIAIAGGDTS ASRSGLMISI TAIGEALPTQ LTRRSGAQLD DLLCVTGTLG GSMAGLKLLM
REKEIMLEHL RNNEPVNRNL LADLDEYREL MQRHLLPTAR LDVVRLFHRL GIQPTAMIDI
SDGLSSEVQH ICRHSNCGAL LHESRIPIHA TTRQLADEMQ EEPLTWALTG GEEYQLLFTL
PEATYQQLAH ERDIHVIGTI TPTNEGMVLE EMFGIRIDLT TIHGFDHFAP SGNDDGNTEN
EEEEFEDGV