Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1702 |
Symbol | |
ID | 4028540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1934686 |
End bp | 1935666 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637966890 |
Product | hypothetical protein |
Protein accession | YP_573753 |
Protein GI | 92113825 |
COG category | [S] Function unknown |
COG ID | [COG5464] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01784] conserved hypothetical protein (putative transposase or invertase) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.412934 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAGTC ACCCCCACCA CCCTCACGAC CACAGCTATA AGTTGCTGTT CTCACACCCT GAGATGGTGA GAGATCTATT GACCGGGTTC GTCAAGGAAG CCTGGGTGGA ACAACTCGAC TTCTCGACAC TGGAGAAGGT CAGCGGCTCC TATATCACCG AAGATCTGCG AGATCGCGAG GACGACGTCA TCTGGCGAGT GCGTTGGGGC GACGACTGGC TCTATGTTTA TTTGCTGCTC GAGTTTCAGT CGAGCGTCGA TAGATTCATG GCCGTGCGAG TCATGACCTA CCTGGGGCTG CTCTATCAGG ACTTGATTCG TCAGGAGGCC TTTACCCCCA ATGGCAAGCT ACCGCCAGTG TTGCCGATTG TGCTCTACAA TGGGGAGAAG CGCTGGACGG CGGCACAAAA CGTGGCGGAC CTGGTCGAAC AGGTACCCGG AGGGCTCGAA CGTTATCGGC CGAACTTGGC CTACCTGCTT CTCGACGAAG GAGCGGTCAT CAGCGATCCT GAGTGGTCGG ATCACATGCG CAACGTGGCT GCTGCGCTCT TTCGATTGGA GCACAATCGC GACGAGCAAG ACATGCTGGA GGTGCTGGGC ACGCTGGTCG AGTGGCTCAA GGCGCCCGAG CAAACCGGGC TACGACGGGC CTTCGTGGTG TGGATACGCC GCGTACTGCT GCCCAACCGG GCGCCGGGGA TGGAACTGCC CGAGTTCAAC GAGTTGCAGG ATCTACACGA GGTACACGAC ATGCTGGCAG AACGCATCAA GCAATGGCCT GAACGGTGGG AAGAGAAAGG CCGTCAGGAA GGCCGTCAAG AAGGGCGTAA AGAAGGGCGT CAGGAAGGCG AACAACGGGG CATCGAGAAG ACCGCCCGCA ACCTGATCAA GCTGGGTGTA CTCAGTGATG AACAGATCGC CGAGGCCACG GGGCTGACGG TGGCCGAGGT GGAAGGGCTG CGCGAAGAAG ACACGCAGTG A
|
Protein sequence | MASHPHHPHD HSYKLLFSHP EMVRDLLTGF VKEAWVEQLD FSTLEKVSGS YITEDLRDRE DDVIWRVRWG DDWLYVYLLL EFQSSVDRFM AVRVMTYLGL LYQDLIRQEA FTPNGKLPPV LPIVLYNGEK RWTAAQNVAD LVEQVPGGLE RYRPNLAYLL LDEGAVISDP EWSDHMRNVA AALFRLEHNR DEQDMLEVLG TLVEWLKAPE QTGLRRAFVV WIRRVLLPNR APGMELPEFN ELQDLHEVHD MLAERIKQWP ERWEEKGRQE GRQEGRKEGR QEGEQRGIEK TARNLIKLGV LSDEQIAEAT GLTVAEVEGL REEDTQ
|
| |