Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1654 |
Symbol | |
ID | 4029116 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1881393 |
End bp | 1882769 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637966843 |
Product | hypothetical protein |
Protein accession | YP_573706 |
Protein GI | 92113778 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0304713 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGGGG ATTACAAGGT ACCGGGAATT GCAACGATCA TACTGGCGCT GGCACTCGCT GGATGCAGCG GTAGTTCATC GGAGTCTCAG CCCATGGATG GCGGGCCAGG GGACGGCCCT GATATGGGCG ATAACAACGG CGGCAACAAT GGCGGCAACA ACGGAGGTGA TGGCGGTAAC GGCAATGGTG GCAACGACGG CGGCGATGGT GGTAATGGCG GCGGCGATAT CGCCGACGAT TCGACGGCCG TCGAGCAGGT CGTCTCCGAC ACCGGTGATG CCACCACGGG CCTCGGCGAT ACCGTGATCG GGGCCGGCGA AGCCATTTCC CAGTCCGACG TCGGCGGCGC CGGCTTCGTC ACCGATGGCA CGGGCGGTGT CATCGTGGAT GTCGGTGAAG GCGTGAACAG TCTCAGTTCG GGCCTGAACG ATGGCCTCGG CAGCTTGAGC GAAAACGACA ACGCCCTCGG GACCACACTG GGCGGCGCTA CATCGGCCGT GGGTGAAGTC GGCGAAGCGG TGTCGTCCGC CAGCACCCTC GTCACCGGGC TCAACACGCT GCCGGTCGTC GGTCAGCTAG ACGAACGCAC CCACCTAGTG TCCGGCGTCG GGGATACCGT CAATCAACTG GGCGTGGCGG TGACTTCCAC CGCGGGATCG CTGACCGCGT CGCTGACGGC GGAAGACGGC CACCTCAGTG GTTTGACGCG AGAAGCCACG AGCGTCGTGC GGCCGCTGGT CACCAACCTC GAAGGCACCA CGCAAACCCT GGGTGACGGC CTGGGCGTCG GCCAGCCAGT CGATGGTCTG CTGACTTCCG TGGGCGGCAT CGTGACCCAG ATCGGCGGGC GCGTCGGCGA GCGTTCCGAA GCGCTCGGCG GGCTGGGCGG CACCCTGCAG GGCGTCGGCA ATGCCGTCGG CGACCTGGGC GGCCTGGTCA CCGAGGGTGA CGGTAACGCC AGCGGCGGTC TGCTCAGCGG CCTGCAAGGC GAAGACGGCC TGCTCGGTGG CGGTCTCGGC GGTGAAGGGG GCCTGCTGGG CGGTCTCACC GGCGGTGAAG GCGGTCTGCT CGGCGGCGGT AACGGCGCTG GCGACGGCGA TACCGGCAAG GGGCTGGTCG GCGGTGTCGT GGGCGTCGTC GGCAACGTGA CCGGCGGATT GACCGGCGGT GGGCTGACCG GCGGCGGCTC GAGCGACGAC GGCAGCGACA AGAAAGGCTT GGTCGGTGGC CTCGTCGGTG GCGTGACCGA CTCGCTCTCC GGCGTGACCG GTGGCTTGAG CGGTGGCGGC GATGCCAGCG ACGGCGGCAC GGACCAGGAC ACTCAGCGCA AAGGGCTGCT CGGCAACCTG CTGGGCGGGG GACTGCTCGG CCGCTGA
|
Protein sequence | MLGDYKVPGI ATIILALALA GCSGSSSESQ PMDGGPGDGP DMGDNNGGNN GGNNGGDGGN GNGGNDGGDG GNGGGDIADD STAVEQVVSD TGDATTGLGD TVIGAGEAIS QSDVGGAGFV TDGTGGVIVD VGEGVNSLSS GLNDGLGSLS ENDNALGTTL GGATSAVGEV GEAVSSASTL VTGLNTLPVV GQLDERTHLV SGVGDTVNQL GVAVTSTAGS LTASLTAEDG HLSGLTREAT SVVRPLVTNL EGTTQTLGDG LGVGQPVDGL LTSVGGIVTQ IGGRVGERSE ALGGLGGTLQ GVGNAVGDLG GLVTEGDGNA SGGLLSGLQG EDGLLGGGLG GEGGLLGGLT GGEGGLLGGG NGAGDGDTGK GLVGGVVGVV GNVTGGLTGG GLTGGGSSDD GSDKKGLVGG LVGGVTDSLS GVTGGLSGGG DASDGGTDQD TQRKGLLGNL LGGGLLGR
|
| |