Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1023 |
Symbol | |
ID | 4027869 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1154158 |
End bp | 1155165 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637966200 |
Product | hypothetical protein |
Protein accession | YP_573079 |
Protein GI | 92113151 |
COG category | [S] Function unknown |
COG ID | [COG3802] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.198151 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACTCG CACAAGCCCG GAACGACAAC GGCGAGCGTT TCGTCCTGGT CATCGAGGAC GATCCCACCC AGGCGCGTCG CCTGGATGAC GTGAACAGCG TGCATGCCCT GGCGCTCACC GCGCTCGATC GCAAGGTCGG GCTCGCCGCG CTGATCAAGG AATGGCCCGG GGATGCACGT GTCGATGCCG TGGCCCTGGC CGACAAGGGG CAACTGCTGG CGCCGCTGGA TCATCCCGAC CCGGCGCACT TGCTGGTCAC CGGCACCGGG CTGACGCACC TGGGCAGCGC CGAGGGCCGC GACAAGATGC ATCGTCAGAT GAACGAAGCG GATCAGAGCG ACCAGCCGCT GACCGATTCG ATGAAGATGT TCCGCAAGGG GCTGGAAGGC GGCAAGCCCG CCCCCGGCGA TTTCGGCGTG CAACCCGAGT GGTTCTACAA GGGCGACGGC GGCATCGTCA TGGCGCCGGG CGAGGCGCTG CGCATGCCGC ATTTCGCGCT GGATGGCGGC GAGGAACCGG AAGTCGCCGG TCTCTACCTG ATCGACAACG ACGGCGTGCC ACGGCGCATC GGCTATGCGC TGGGCAACGA GTTCTCGGAC CACGTCACCG AGCGCGAGAA CTACCTGTAC CTGGCGCATT CCAAATTGCG CGCCTGCTCT TTCGGCCCGA CGCTGCTGAT CGACGAGTTG CCCGATGACG TGCAAGGCAC CTCACGCATT CGTCGGGGCG ACGAGGTGCT GTGGGAAAAG CCGTTCATCT CCGGCGAGGC CAACATGTGC CACACCCTGG CCAATCTCGA GGCGCATCAC TTCAAGTATG CGCAGTTCTG CCGTCCGGGC GATGTCCATG TGCACTTCTT CGGCACCGCG ACGCTGAGCT TCAGCGACGG CATCATTCCC CAGCCCGGTG ATGTCTTCGA GATCGACGCC AAGCCGTTCG CCCTGCCGCT GCGCAATCCG CTTTCCAAGG ACGCCGTCGC GCCGGCACCA AGCGTCAAGC CGCTGTAA
|
Protein sequence | MQLAQARNDN GERFVLVIED DPTQARRLDD VNSVHALALT ALDRKVGLAA LIKEWPGDAR VDAVALADKG QLLAPLDHPD PAHLLVTGTG LTHLGSAEGR DKMHRQMNEA DQSDQPLTDS MKMFRKGLEG GKPAPGDFGV QPEWFYKGDG GIVMAPGEAL RMPHFALDGG EEPEVAGLYL IDNDGVPRRI GYALGNEFSD HVTERENYLY LAHSKLRACS FGPTLLIDEL PDDVQGTSRI RRGDEVLWEK PFISGEANMC HTLANLEAHH FKYAQFCRPG DVHVHFFGTA TLSFSDGIIP QPGDVFEIDA KPFALPLRNP LSKDAVAPAP SVKPL
|
| |