Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0581 |
Symbol | |
ID | 4026318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 646273 |
End bp | 647772 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637965749 |
Product | Ppx/GppA phosphatase |
Protein accession | YP_572642 |
Protein GI | 92112714 |
COG category | [F] Nucleotide transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0248] Exopolyphosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.298881 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGTC GCTACCCGGC TCCCCGACGT CTCGCCGCCA TCGACCTGGG ATCGAATAGC TTCCACCTGC TGGTCGCCAA TCATTATGCC GGCCAGCTGC AGGTCGTCGC CAAACTGGGC GAGAAAGTGC AGCTGGCAGC GGGGCTCGAC GAAGACGAGC GGCTCGATGA CGACGCCATG CAGCGCGCCC TGGACTGCCT GGAACGCTTC GCGCCGTATC TGAGCGGACT CGCGGCCAGC GATGTCCGCA TCGTGGGAAC CAACGCCCTG CGCGCAGCGC GCAACCGCCA GGCACTGATC GACCGCGCCG AGGCCTTGAT CGGCCACCCC ATCGAGATCA TTGCCGGTCG CGAGGAAGCC CGCCTGATCT ACCTGGGGGC CGCGCATGCA CTGGCCGAGG TGCATGGACG ACGACTCATC GTCGACATCG GCGGCGGCTC CACCGAGCTC ATCATCGGCG AGCGCTTCGA ACCGCTGGCG CTGGAAAGTC TGCACATGGG CTGCGTGGCC TATACGCAGC GTTTCTTCGC CAACGGCGAC ATCAGCGAAA AGGCCTTTCG CGCGGCCGAG CTCGCCGCGC TTTCCGAGCT GGCCAACATT CGCCGTCCCT ACCAGGAGCT TGGCTGGCAG GACCCGGTGG GGTCCAGCGG TACCATCAAG GCCATCGCCG CCGTGCAGGC CGCAAGCGGC GATGCGCCGG AAGGCATGAT CACGCGTCAG GGGCTGAAGA ACCTGCGCGG CCGGCTGCTG AAGTGCAAGA AGCTCGACAA GGTCGCCATG GAGGGGCTCA AGAGCGACCG GGCACGGGTA TTCCCGGCCG GCGTGGCAAT CCTGTGTGCC ATCTTCGAGG CCTTCGACCT GGAACGCATG CGCTACTCCG ACGGCGCCCT GCGCGAAGGC GTGCTCTACG ACCTCCTCGG CCGCAACACC ACCGAGGACT CCCGCGCGGC GGCCCTGCTC ACCCTGCGCC GACGCTTCGA CGTGGACGCC CGCCAGGCCA ACAACGTGCG GGCCACCGCC GACCATCTGG TCACACAAGT CGCCGATCAC TGGTCGCTGA GTCAGGAACA ATGCCAGTTT CTGGCGTGGG GCGCCGAACT GCATGAGATT GGCCTGGCGA TTTCCCACAG CCAGTTCCAC CGGCATGGCG CCTACCTGCT GGAGAACTCC GACCTGGCCG GGTTCTCGCG CCCCGAGCAG CAGGCCCTGG CCTTCCTGGT GCGTGCGCAT CGCCGCAAGT TTCCCGGCAA GGAGCTCGAC GCCCTCCCGG CCGCCGCGCG TACCACCTAC GCACGGCTGG CACGCCTGCT GCGCCTGGCG GTGGCCCTCA ACCACTCACG CCCCGAGCAG CCGCCCCGCG ATGTGACGCT GCGAGCCGAT GGCGAGACCC TCTACGTGGA CCTCGACCAG GCCAGCGATC AGCCCCTGCT GAGCGCCGAC CTGGAACAGG AAGCGCTCTA TCAGCGCAAC GCCGGGTTTC GGCTCGAGCT CAGCCGCTAG
|
Protein sequence | MSSRYPAPRR LAAIDLGSNS FHLLVANHYA GQLQVVAKLG EKVQLAAGLD EDERLDDDAM QRALDCLERF APYLSGLAAS DVRIVGTNAL RAARNRQALI DRAEALIGHP IEIIAGREEA RLIYLGAAHA LAEVHGRRLI VDIGGGSTEL IIGERFEPLA LESLHMGCVA YTQRFFANGD ISEKAFRAAE LAALSELANI RRPYQELGWQ DPVGSSGTIK AIAAVQAASG DAPEGMITRQ GLKNLRGRLL KCKKLDKVAM EGLKSDRARV FPAGVAILCA IFEAFDLERM RYSDGALREG VLYDLLGRNT TEDSRAAALL TLRRRFDVDA RQANNVRATA DHLVTQVADH WSLSQEQCQF LAWGAELHEI GLAISHSQFH RHGAYLLENS DLAGFSRPEQ QALAFLVRAH RRKFPGKELD ALPAAARTTY ARLARLLRLA VALNHSRPEQ PPRDVTLRAD GETLYVDLDQ ASDQPLLSAD LEQEALYQRN AGFRLELSR
|
| |