Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2221 |
Symbol | |
ID | 4026413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 2493962 |
End bp | 2494942 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637967426 |
Product | KpsF/GutQ family protein |
Protein accession | YP_574271 |
Protein GI | 92114343 |
COG category | [K] Transcription [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0794] Predicted sugar phosphate isomerase involved in capsule formation [COG2524] Predicted transcriptional regulator, contains C-terminal CBS domains |
TIGRFAM ID | [TIGR00393] KpsF/GutQ family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACCG TCACCGATCA TGATTACCGC GCCAGCGCCC GCCGTACGCT GACCCTCGAA TCGCATGCCG TGGCCGCCTT GATCGAACGC CTCGACGAAG CGTTCGATCA CGCCTGCCAG CTATTTCTCG CCTGCGAGGG ACGCATCATC GTCACCGGCA TGGGCAAGTC CGGGCATATC GCCCGCAAGA TTGCCGCGAC CCTGGCCAGC ACGGGCACGC CGGCGTTTTA CGTTCACCCC GGCGAGGCCA GCCATGGCGA CATGGGCATG ATCACCGCGC GCGACGTGGT CCTGGCGCTG TCCAATTCCG GCGAGACCGC CGAGGTCACG GCGCTGCTGC CGCTTCTCAA GCGCATGGGC ACCCCTCTGG TCAGCATGAC CGGGCGCCCG GGCTCGAGCC TGGCGCGGCA TGCCGAAGCT CACCTGGACA CCGCGGTGGA TCGCGAGGCG TGCCCGCTCG ACCTGGCCCC GACCGCGTCG ACCACTGCCG CCCTGGCCAT GGGCGATGCC CTGGCGGTGG CCTTGCTCGA GGCACGCGGC TTCACCGCCG AGGATTTTGC CCTGTCGCAT CCCGGCGGTA GCCTGGGCCG GCGATTGCTG CTCAAGGTCG AAGACCTCAT GCATCAGGGC GATCGCCTGC CCCGGGTGGC GCTGGGCAGC CCACTGCGCG ATGCCTTGCT GGAGATCACG CGTCAGGGCC TGGGATTCAC CTGCGTGCTC GACGAGGACG GCCGCCTCGC CGGGGTCTAC ACCGACGGCG ACCTGCGTCG CACCCTCGAC CATCATGACG ATCTGCGCCA GCTGCGCGTG GACGACGTCA TGACCCACGG CGGCAAGACG ATTCGTCCTC AATTGCTGGC TGCCGAGGCG GTCAAGATCA TGGAAGACAA TCGCATCACA GCCCTGGCCG TGGTCGACGA CCAGGGCCAT CCGGTCGGCG TCCTGCACAT GCACGACCTG CTGGCCAGCG GCGTCATCTG A
|
Protein sequence | MNTVTDHDYR ASARRTLTLE SHAVAALIER LDEAFDHACQ LFLACEGRII VTGMGKSGHI ARKIAATLAS TGTPAFYVHP GEASHGDMGM ITARDVVLAL SNSGETAEVT ALLPLLKRMG TPLVSMTGRP GSSLARHAEA HLDTAVDREA CPLDLAPTAS TTAALAMGDA LAVALLEARG FTAEDFALSH PGGSLGRRLL LKVEDLMHQG DRLPRVALGS PLRDALLEIT RQGLGFTCVL DEDGRLAGVY TDGDLRRTLD HHDDLRQLRV DDVMTHGGKT IRPQLLAAEA VKIMEDNRIT ALAVVDDQGH PVGVLHMHDL LASGVI
|
| |