Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1712 |
Symbol | |
ID | 4028820 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1948041 |
End bp | 1949201 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 637966900 |
Product | polysaccharide export protein |
Protein accession | YP_573763 |
Protein GI | 92113835 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1596] Periplasmic protein involved in polysaccharide export |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.69321 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTAC GCCGCAACGA GTCTCGTTCG GGATGGAGAA AGCCACTGCT GGGGCTTTCT TTCATGGTGG CGCTCTCGGG CTGTGCGTTC GCACCCGGTG GGCATATCGA TTATGACACG CAGGGGGAGG ACCTTTCCGA GAACATCGAG GTCAAGCCGA TCACGCCGAG CCTGGTCAAG ACGATGGCGG CGAGCGGGGA TGAGCTGCGC GAGTCGCTGT TCGAGTATGC CGAGAATGCC GAGCCGCAGA TGGAGGATCT CGATTACGAC TACATGATCG GGCGCGGGGA CGTGCTGGCG GTCGTCGTGT ATGACCATCC CGAACTGACG ATTCCCGCCG GTAGCGAACG CAGTGCGGAA GAGTCGGGCA ACGTGGTGCA CTCGGACGGC ACCATCTTCT ATCCCTACAT CGGTACGGTG GATGTCGCTG GACGCACCGT GCGCGATGTC CGCAGCGAGA TTCAGCGCCG CCTCGAAGGC TACATCGCTC AGCCTCAGGT GGACGTGAAG GTCGCCGCCT TCAATGCGCA GAAGGCGTAC GTCACCGGCC AGGTCGAACG TGCCGGTGCC CAGCCGATCA CCAACATTCC GCTCACCGTG CTGGATGCCT TGAGCAACGT GGGGGGGCTG ACCCAGGGCG GCGACTGGCA TGACGTCGTG CTCACGCGTG ACGGCCAGGA GATCCACCTG TCGGTGTACG ACATGCTGGT CAACGGCAAT CTCGAACAGA ACCTGTTGCT CCAGGATGGC GACGTGCTGC ATGTGCCGGT GGTCGGCAAC CAGCAGGTCT ACGTGATGGG CGAGGTCAAT ACGCCGACCG CGGTACCGAT GCCGAACGAG CGTCTGTCGT TGACCAACGC CTTAGCCCAG GCCGGCGGCA TCAACGAGAA CAGCGCCGAT GCCTCGGGGA TCTTCGTGAT TCGCCGCAAT CACGATGTCG AAAGCGACAC CTTCGCCACC GTCTACCAGC TCAACGCCAA GAACGCGATC TCCTTCGTGC TGGGGTCGGA ATTCATTCTG CAGCCCACCG ATGTGGTGTA TGTCACCGCC GCGCCGATTG CCCGCTGGAA CCGCGTCATC AGCCAGATCC TGCCCAGCGT GACGGCGATC TACCAGCTGA CGCAGGCCAC GCGTGACATT CAGGACATCG ACGATAACTA G
|
Protein sequence | MTLRRNESRS GWRKPLLGLS FMVALSGCAF APGGHIDYDT QGEDLSENIE VKPITPSLVK TMAASGDELR ESLFEYAENA EPQMEDLDYD YMIGRGDVLA VVVYDHPELT IPAGSERSAE ESGNVVHSDG TIFYPYIGTV DVAGRTVRDV RSEIQRRLEG YIAQPQVDVK VAAFNAQKAY VTGQVERAGA QPITNIPLTV LDALSNVGGL TQGGDWHDVV LTRDGQEIHL SVYDMLVNGN LEQNLLLQDG DVLHVPVVGN QQVYVMGEVN TPTAVPMPNE RLSLTNALAQ AGGINENSAD ASGIFVIRRN HDVESDTFAT VYQLNAKNAI SFVLGSEFIL QPTDVVYVTA APIARWNRVI SQILPSVTAI YQLTQATRDI QDIDDN
|
| |