Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1272 |
Symbol | |
ID | 4027745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1450886 |
End bp | 1452376 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637966451 |
Product | hypothetical protein |
Protein accession | YP_573326 |
Protein GI | 92113398 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGCGG CATCGCCAGT GACACATGCC CTGTACGGGG CGGAGCAGGT TCGGACCCTG GATCGTCGCA TCATCGACGC CGGCGTGGCG GGGTTCGACT TGATGCAGCG AGCATCCCAG GCCGCCTATG ACGTACTGCG TGCACGATGG CCCGGTGCGC GTCGGCTCAC GGTGTTGTGC GGGGGCGGCA ACAATGCCGG TGACGGCTAC GTGATCGCGG CGTTGGCGGT GTGCGATGGC CTCGATGTAC AACTGGTGGC CCTGCGTGAT CCGGCATGCC TGACGGGCGA CGCGGCACGG GCATGCGCGC TGGCGCGGCG GGCCGGTGTG ACGCCGGTTG CCTGGCGGGA AGGCATGACG CTGGATGGCG AGGTGCTGGT CGATGCGTTG CTGGGGACCG GGGCCTCGGG AGAGGTGCGC GACCCCTTGC GGAGCGCGAT TCTGGCGATC AATGCCACGC GGCGCCCGGT ACTGGCGGTG GACGTGCCCT CCGGGCTGTC GGCACAGACC GGTGGCATCG GAGGCGTCGC GGTACACGCC ACGGTGACCG TCACGTTCAT TGCCGACAAG TTCGGGCTGC ATACCGGCGC CAGCGCGGAT CATGTCGGCG AACTGGTCGT CGAGAGCCTG GGCACGGCCC CGGAGACCCA TGGCGACCTG GTGCCGCTGG GCGAGTTGCT GGCGGCCGAG CAATTGCAGG CGGCGCTGCC GCCACGTGCC CGAGGCAGCC ACAAGGGTGA CTTCGGGCAT CTGCTGGTGG TGGGGGGCGC CGTGGGGTTC GGCGGCGCGG CGTTGATGGC CTGCGAGGCC GCATTGCGCA TGGGGTCGGG CAAGGTCAGC CTGGCCACCG ATGCGGCGCA TGTCGCCGCG AGCCTGGTGC GCAGCCCCGA GGTAATGGCG CGTGGCGTGC AGGCGGCGGC CGAGGCACAG CCGCTTCTTG CTCAGGCCGA CGCCCTGGTC GTCGGTCCCG GTCTGGGGCG TGATGCCTGG GGGAAGGCGT TATGGCGGCT GGCACTCGAT GCCGCCGTGC CCAGCGTCCT CGATGCCGAT GCTCTCAACC TGCTGGCCGA GGAGGCGCGC GATCGCGACG ACTGGGTGCT GACGCCACAT CCCGGCGAGG CTGCACGCCT GCTGGGCAGC ACCACCGCCG AGGTCCAGGC GGACCGTCGG GCAGCGGTGC TGGCGCTGCG CGAGCGTTAT GGCGGCAGTG TGGTACTCAA GGGCGCGGGA ACCTTGATTG CCGACGCAGA TGGGGTCGCC GTGTGTCCGT ATGGCAATCC CGGCATGGCC AGCGGGGGCA TGGGGGATGT GCTGTCGGGC GTCATCGGGG CGCTGCTGGG GCAGGGCAGA ACGCCGGGGG AGGCTGCACG CCTGGGGGTC CTGATCCATG CGCTGGCCGG TGACGCGGCG GCCCGGGCCG GGGGCGAGCG TGGCCTGGTC GCCACGGATC TGGCATCCTA TGTGCGCGTG ATCGCCAACC CACGATCCTG A
|
Protein sequence | MTAASPVTHA LYGAEQVRTL DRRIIDAGVA GFDLMQRASQ AAYDVLRARW PGARRLTVLC GGGNNAGDGY VIAALAVCDG LDVQLVALRD PACLTGDAAR ACALARRAGV TPVAWREGMT LDGEVLVDAL LGTGASGEVR DPLRSAILAI NATRRPVLAV DVPSGLSAQT GGIGGVAVHA TVTVTFIADK FGLHTGASAD HVGELVVESL GTAPETHGDL VPLGELLAAE QLQAALPPRA RGSHKGDFGH LLVVGGAVGF GGAALMACEA ALRMGSGKVS LATDAAHVAA SLVRSPEVMA RGVQAAAEAQ PLLAQADALV VGPGLGRDAW GKALWRLALD AAVPSVLDAD ALNLLAEEAR DRDDWVLTPH PGEAARLLGS TTAEVQADRR AAVLALRERY GGSVVLKGAG TLIADADGVA VCPYGNPGMA SGGMGDVLSG VIGALLGQGR TPGEAARLGV LIHALAGDAA ARAGGERGLV ATDLASYVRV IANPRS
|
| |