Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_2008 |
Symbol | |
ID | 4027092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 2268048 |
End bp | 2269691 |
Gene Length | 1644 bp |
Protein Length | 547 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637967203 |
Product | hypothetical protein |
Protein accession | YP_574058 |
Protein GI | 92114130 |
COG category | [R] General function prediction only |
COG ID | [COG3008] Paraquat-inducible protein B |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGATG CTCCCCTGCG CGCGACGCGC CACCATCGCC GGCGGCTTTC GCCGATCTGG ATCGTACCGC TCGTGGCGGT TCTCATCGGC GCATGGATGC TCTACGACAA TCTCTCGGCA CTCGGCCCCA CGATCACCCT GGAGATGAAA AACGCCGAGG GCATCGAGGC CGGCAAGACA TTGATCAAGA CGCGCAACGT CGAGGTCGGC CGCGTCGAGG ACGTGACACT CTCCGAAGAC ATGTCCCACA CCATCATCAC GGCCCGCATG AAGCCCGATA CCGAACGCAT GCTGAATGAC GAGGCGCGCT TCTGGGTCGT CAAGCCGCGT ATCGACCGGG AAGGCATCAG CGGACTGGGC ACCGTGCTCT CGGGGGCCTA TATCCAGCTG CTGCCCGGCA ACGGCGAAAC CGCCCAGCGT GAATTCGAGG TACTCGACCA ACCGCCCGTG GCTCCGCCCG ATGCGCCGGG CATTCGCGTC AACCTGGTCA GCAAGGTGGG CAGTTCCCTG CGCGCCGGCG ACCCCATCAC CTATCAGGGA TTCACCGTCG GGCGCGTGGA GAACACCGAG TTCGATCCCG AAGAAAAGGA AATGCGCCAT CGCCTTTACA TCCAGTCGCC TTATGACGTT CTGGTCACGG ACACCACACG CTTCTGGATC TCGTCGGGTG TCGACGTGCG CCTCGACTCC CAGGGCTTCC GGGTCAACGT GGAATCCATG GAATCGCTCA TCGGCGGCGG CGTGACCTTC GGCGTGCCGG AGGATGTCGC CATGGGGCAT CCCGCCGAAT CCGAGGCGAC CTACCAACTC TTCAGCGACG AGGAAAGCGC GCGTGAAGGC ACCTTCGACC GCTACCTCGA ATACGTGCTG CTGGTCGACG ACACCGTACG CGGCTTGAGC CGAGGCGACT CAGTCGAGTA CCGCGGCGTG CGCGTGGGGA CCGTGGAAGC CGTGCCCTGG CGCTTCTCCG CTCCCCAGCC GGATACGCTG AACCGTTTCG CGATTCCGGT ACTGATTCGC ATCGAGCCAC AGCGCTTCGA CGACGCCATG GCGAATTTTG ACGCCGAGGA CTGGCGTGCC CGCCTCGAAC GCATGTTCGA GCACGGCCTG CGCGCCACGC TCAAGGCCGG CAATCTGCTC ACCGGCGCGC TGTTCGTCGA CCTCAACTTC CGCGACGACC CCGAGCCCTA CGAAGCGCTC ACCTTCGAGG GCAAGACGGT GTTCCCGACG ACGTCGGGCG GCTTCGCGCA AATCGAGCAG AAAGTCTCCA ACCTGCTCGA CAAGCTCAAC GAGCTCGAGG TCGAGCCGAT CCTCACCTCG CTGAACGATA CCTTGACGAC GACACGCGCC ACGATGCGCA AGGTCAACGA CATCGCCAAC TCCGTGGATA CCTTGCTGAA CGACCCGGCG ACCCGCGAGC TGCCGCAGAA CCTCAACGAG ACGCTGCGCC AGACCCGCGA TACGCTGCAG GGCTTCTCGC CCGACTCGCA GGGCTACCGT GAACTCAACG ACACACTCTC GCGGCTCGAG TCGCTGATGC GCGACCTGCA GCCCGTGGTG CGCACGCTCA GCGAGAAACC CAACGCCTTG ATCTTCGACC GTGAAGAGAC ACGAGATCCA TTACCGAGGG CCCCAAGCCA ATGA
|
Protein sequence | MPDAPLRATR HHRRRLSPIW IVPLVAVLIG AWMLYDNLSA LGPTITLEMK NAEGIEAGKT LIKTRNVEVG RVEDVTLSED MSHTIITARM KPDTERMLND EARFWVVKPR IDREGISGLG TVLSGAYIQL LPGNGETAQR EFEVLDQPPV APPDAPGIRV NLVSKVGSSL RAGDPITYQG FTVGRVENTE FDPEEKEMRH RLYIQSPYDV LVTDTTRFWI SSGVDVRLDS QGFRVNVESM ESLIGGGVTF GVPEDVAMGH PAESEATYQL FSDEESAREG TFDRYLEYVL LVDDTVRGLS RGDSVEYRGV RVGTVEAVPW RFSAPQPDTL NRFAIPVLIR IEPQRFDDAM ANFDAEDWRA RLERMFEHGL RATLKAGNLL TGALFVDLNF RDDPEPYEAL TFEGKTVFPT TSGGFAQIEQ KVSNLLDKLN ELEVEPILTS LNDTLTTTRA TMRKVNDIAN SVDTLLNDPA TRELPQNLNE TLRQTRDTLQ GFSPDSQGYR ELNDTLSRLE SLMRDLQPVV RTLSEKPNAL IFDREETRDP LPRAPSQ
|
| |