Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2179 |
Symbol | |
ID | 4026673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 2452017 |
End bp | 2452961 |
Gene Length | 945 bp |
Protein Length | 314 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637967384 |
Product | hypothetical protein |
Protein accession | YP_574229 |
Protein GI | 92114301 |
COG category | [F] Nucleotide transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0352] Thiamine monophosphate synthase [COG1051] ADP-ribose pyrophosphatase |
TIGRFAM ID | [TIGR00586] mutator mutT protein [TIGR00693] thiamine-phosphate pyrophosphorylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.299804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGAAGA GAAGGGTACA CGTCGCGGCG GCTGCCATCA TTCGTGAGGA TGGGCATGTG CTGCTCGCAC GCCGCCCCTC GATCGTCGAT CAGGGTGGCT TGTGGGAATT TCCCGGTGGC AAGCTGGCGC CCTATGAAAC CGGCTTCGAA GCCCTCCGGC GGGAATTGCG CGAAGAGCTC GGTATCGAGA TACAGCGGGC CCAGCCGCTC ATTCGCGTCC ACCATGAATA CGAAGACAAG CGTATCCTGC TCGATGTCTG GCAGGTGCAT GCCTTCGAGG GCGAACCCTT CGGCCGTGAG GGCCAGGCGG TGCGCTGGGT GCCCCAGGAA GAGTTGAACA ATTATCCCTT CCCGGAGGCG AACCATGCGA TCCTGCGTGC GGTGTGCCTG CCTCACGATT ACCTGATCAG CGACGAGGAA GACGACGATG CGGTCTTCCT CGGCAAGCTC GAGCGGGCTC TCCGCGACGA TGGGGTTCGC CTGGTGCAGC TGCGGGCCAA GTCGCTGGAT GAAGATGCCT ACCTGAAGCG TGCCGAGCAG GCGCTGGCCC TCTGTCGTCG CTACCAGGCG CGCTTGATCC TCAACGGTGA CCCTGCGCTG CTCGAGCATG TCGATGCCGA TGGTGTCCAC CTGCCCAGCC GCACCTTGAT GGCGCTCGAG CATCGCCCCA TCGCGACCGG CAAGTGGCTG GCCGCCTCGA CCCACAACCC CGAGCAGCTC GCCCAGGCCG CGACGATCGG CTGTGATTTC GTGACCTTCT CGCCATTGCG GATCACGCCG AGCCACCCCG ATGCCGCGCC CGTGGGCTGG CACGATTTCC AGCAACTGGT CGAGACGGCC GCCATGCCGG TCTTCGCGCT GGGGGGCGTG ACGCGTGGTG ATATCGATCA GGCGCGTGCA GTGGGCGCGC AGGGGATCGC CTCGATCCGC GATCTCTGGA AATGA
|
Protein sequence | MVKRRVHVAA AAIIREDGHV LLARRPSIVD QGGLWEFPGG KLAPYETGFE ALRRELREEL GIEIQRAQPL IRVHHEYEDK RILLDVWQVH AFEGEPFGRE GQAVRWVPQE ELNNYPFPEA NHAILRAVCL PHDYLISDEE DDDAVFLGKL ERALRDDGVR LVQLRAKSLD EDAYLKRAEQ ALALCRRYQA RLILNGDPAL LEHVDADGVH LPSRTLMALE HRPIATGKWL AASTHNPEQL AQAATIGCDF VTFSPLRITP SHPDAAPVGW HDFQQLVETA AMPVFALGGV TRGDIDQARA VGAQGIASIR DLWK
|
| |