Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0034 |
Symbol | |
ID | 4026377 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 41655 |
End bp | 43862 |
Gene Length | 2208 bp |
Protein Length | 735 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637965186 |
Product | hypothetical protein |
Protein accession | YP_572098 |
Protein GI | 92112170 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [S] Function unknown |
COG ID | [COG1765] Predicted redox protein, regulator of disulfide bond formation [COG1944] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR00702] uncharacterized domain [TIGR03549] conserved hypothetical protein TIGR03549 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAATCA AGGTCAACTT TCTCGAAAAC CTGCGGCTGG CGGCCAAGTT CGACGACTTC ACCGTCGAGA CCGACCAGCC CATTCGCTAC AAGGGCGACG GCTCGGCGCC GAGTCCGTTC GACTACTTCC TGGCCTCCTC GGCGTTGTGT GCGGCTTATT TCGTGCGCTT GTACTGCAAC GCGCGGGACA TCCCTACCGA GAACATCCGG CTGTCGCAGA ACAACATCGT CGACCCCGAG AATCGCTACA ACCAGATCTT CAAGATCCAG GTGGAGCTGC CGGAGGACCT TTCCGAAAAG GACCGCACCG GTATCCTGCG CGCGGCGGAG CGCTGCAGCG TCAAGCGCGT GATCCAGAAC GCGCCTGAGT TCCAGATCGA GACGGTCGAG AACATCGACG AGGACGCCCA GGCGCTGTTG ATGGGCACGA ACGCCAAGGA CGGCGACGCA GGCGGCGAAA CCTGGATCGA GGGCAAGGAC CTGCCGCTGG AGCGCACCAT CGCCAACATG ACCCGGATCC TCGAGGATCT GGGCATGAAG ATCGAGATCG CCTCGTGGCG CAATATCGTG CCCAACGTCT GGTCGCTGCA TCTGCGCGAC GCGGCCTCGC CGATGTGTTT CACCAACGGC AAGGGCGCGA CCAAGGAAGC GGCGCTGTGC TCGGCGCTGG GCGAGTTCAT CGAGCGCCTC TCCTGCAACT TCTTCTACAA CGACCAGTTC TTCGGCGAGG CGATCGCCGA TAGTGCCTTC GTTCACTATC CCAGCGAGCG CTGGTTCCCG CTGGAAGATG ATGACGCGCT GCCCGCCGGG CTGCTCGACG CACATTGTCG GGCGATCTTC GATCCGGACG GCGAACTGCG CGGCTCGCAC CTGATCGACA CCAACTCCGG GCGAAAGGAT CGCGGCATCG TCGCGCTGCC GTTCAAGCGC CGCTCCGATG GCGAGACGGT GTACTTCCCT TCCAACCTGA TCGAGAACCT CTACCTCAGC AACGGCATGA GCGCCGGCAA CACGCTCGCC GAGGCCGAGG TGCAGTGCCT TTCCGAGATC TTCGAGCGCG CGGTGAAGAA GGAAATCATC GAGCAAGAGA TCGTCCTGCC GGACGTGCCG GAAGAGGTGC TGGCCAAGTA CCCCGGCATT CAGGAAGGCA TCGCCGCACT GGAGGCCCAG GGCTTCCCGG TGCTGGTCAA GGACGCCTCG CTCGGCGGCC GCTACCCGGT GATGTGCGTG ACCCTGATGA ACCCGCGCAC CGGTGGGGTG TTCGCCTCCT TCGGTGCGCA CCCGAGCTTC CAGGTCGCGC TGGAGCGCAG CCTCACCGAG CTGCTCCAGG GGCGCAGCTT CGAGGGCCTG AACGACCTGC CGCAGCCGAC CTTCAGCTCG CTGGCGGTCT CCGAGCCCAA CAACTTCGTC GAGCACTTCA TCGACTCCTC CGGGGTGATC TCCTGGCGCT TCTTCAGCGA CCGCACGGAT CTCGACTTCC ACGAATGGGA CTTCGCCGGC ACCACTGCGG AAGAAGCCGA GCGCCTCTAC GGACTGCTCG CCGATCAGGG CCTGGAAGCC TACGTGATGG AACACGAGGA CCTGGGCGCG CCGGTGTGCC GCATCCTGGT GCCGGGGTAT TCCGAGGTAT ACCCGGTCGA GGACCTGGTG TGGGACAACA CCAACATGGC GCTGGACTTC CGGGCGGACA TTCTCCATCT GCACACACTT GAGGATGAGC GCCTCGCCGA CCTGCTCGAG CGCCTGGAAG AAAGCCAGCT CGACGATCAC ATCAAGGTCG GCACGCTGAT CGGCATCGAG TTCGACGACA ACACCGTGTG GAGCGAGCTG ACCATCCTCG AGCTCAAGCT GCTGATCGAG CTCGCCCTGG GCGAGTACGA AGCCGCGCTG GAGCACGTTC AGATGTTCCT GCAGTTCAAC GACAACACCG TGCAGCGCGG CCTGTTCCAT CAGGCCATGC AGGCGGTGCT GGAGATCGCC CTGGACGATG ACCTCGACTT CAACGACTAC CACCGCAACC TCACGCGTAT GTTCGGCGAA GAGACCATGC GCCAGGTGAT CGGCGCGGTG AATGACGAGA TACGCTTCCC CGGGCTGACG CCCACCAGCA TGGGACTGGA AGGCATCGAC CGCCACCAGC GCCTGATCGA AAGCTACCGC AAGCTGCACG CCGCACGCGC CGCCGGGGCC GGCATCGCCA CCTCGTAA
|
Protein sequence | MEIKVNFLEN LRLAAKFDDF TVETDQPIRY KGDGSAPSPF DYFLASSALC AAYFVRLYCN ARDIPTENIR LSQNNIVDPE NRYNQIFKIQ VELPEDLSEK DRTGILRAAE RCSVKRVIQN APEFQIETVE NIDEDAQALL MGTNAKDGDA GGETWIEGKD LPLERTIANM TRILEDLGMK IEIASWRNIV PNVWSLHLRD AASPMCFTNG KGATKEAALC SALGEFIERL SCNFFYNDQF FGEAIADSAF VHYPSERWFP LEDDDALPAG LLDAHCRAIF DPDGELRGSH LIDTNSGRKD RGIVALPFKR RSDGETVYFP SNLIENLYLS NGMSAGNTLA EAEVQCLSEI FERAVKKEII EQEIVLPDVP EEVLAKYPGI QEGIAALEAQ GFPVLVKDAS LGGRYPVMCV TLMNPRTGGV FASFGAHPSF QVALERSLTE LLQGRSFEGL NDLPQPTFSS LAVSEPNNFV EHFIDSSGVI SWRFFSDRTD LDFHEWDFAG TTAEEAERLY GLLADQGLEA YVMEHEDLGA PVCRILVPGY SEVYPVEDLV WDNTNMALDF RADILHLHTL EDERLADLLE RLEESQLDDH IKVGTLIGIE FDDNTVWSEL TILELKLLIE LALGEYEAAL EHVQMFLQFN DNTVQRGLFH QAMQAVLEIA LDDDLDFNDY HRNLTRMFGE ETMRQVIGAV NDEIRFPGLT PTSMGLEGID RHQRLIESYR KLHAARAAGA GIATS
|
| |