Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_0799 |
Symbol | |
ID | 4026172 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 894547 |
End bp | 895740 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 637965965 |
Product | aldose 1-epimerase |
Protein accession | YP_572855 |
Protein GI | 92112927 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2017] Galactose mutarotase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGAT TTTCAGGAAT TTCACGCTTG GCCATGGGGG TGGGCATGAC CTGCCTCGCG CTGAGCGCGG CACAGGCGAA CACGATGCAA ACGCAGTCCC CGTCGGTGGA AAAGGAATCT TTCGGACAGT TGCCCGATGG CCGCAAGGTC GAGGCGTACC ACCTACGCAA TGGCCATGGC ATTGACATGA AGGTGACCAC CTATGGCGGC ATCATCACCT CCCTGCGCAC ACCGGATGCC GAGGGCGAGT GGGCCGATGT CGTTCTGGGG TTCGACAATT TGGCCGACTA TCGCTCCGAG GCGTATCGCC AGTCCAACCC CTATTTCGGT GCCCTGATCG GGCGCTATGG CAACCGAATC GCCGAGGGGC GCTTCACGCT CGATGGCACG ACCCATGAGC TGGCGACCAA CGATGGTGCC AATCACCTGC ATGGTGGGGA GCGGGGCTTC GACAAACGCC TGTGGACGGC GGCGCCTTTC GAGAACGACA GCGAGGTGGG CGTCGAGCTG ACCTATGTCA GCGAGGATGG CGAGGAAGGT TATCCCGGGC GGCTGGAAAC CCACGTGACC TACACGCTGA CCGCCGACGA CGAGGTGATC ATCGATTATC ACGCCACGAC CGACAAGGCC ACGCCGGTCA ATCTCACGCA GCACAGTTAC TTCAACCTGG AGGGCGAGGG CAGCGGCTCG ATCGTCGATC ATCGGCTGAT GCTCAATGCC GATGCCTTCA CGCCGGTGGA CGACACCTTG ATCCCGACCG GAGAGCTGCG CGACGTGGCG GACACGCCGT TCGACTTCCG TGAGGCCACC CCGATCGGTG CACGCATTGG CGCGGACAAT ACGCAACTGG CGTACGGGCA GGGCTACGAT CACAATTTCG TGCTGATGCG TGACGCGACG GCGGAAGACG AACTGGTGCT GGCCGCCCGC GTCGAAGCGC CGGACAGTGG CCGCGTCCTG GAGATCGCCA CGTCGGAGCC CGGTGTGCAG TTCTACTCGG GCAACTTTCT CGATGGCACG CTGATCGGCA AGCAGGGCAA GGCATACGAA AAGCGCAGCG GGTTCGCGCT GGAAACCCAG CATTTTCCGG ATTCACCCAA CCAGGCGGCC TTTCCCTCGA CGATTCTCGA GCCCGGCGAG ACCTATCAAT CGCGCACGGT GTGGCGTTTC TCCACGCAGA CACCGACGCC CTGA
|
Protein sequence | MSRFSGISRL AMGVGMTCLA LSAAQANTMQ TQSPSVEKES FGQLPDGRKV EAYHLRNGHG IDMKVTTYGG IITSLRTPDA EGEWADVVLG FDNLADYRSE AYRQSNPYFG ALIGRYGNRI AEGRFTLDGT THELATNDGA NHLHGGERGF DKRLWTAAPF ENDSEVGVEL TYVSEDGEEG YPGRLETHVT YTLTADDEVI IDYHATTDKA TPVNLTQHSY FNLEGEGSGS IVDHRLMLNA DAFTPVDDTL IPTGELRDVA DTPFDFREAT PIGARIGADN TQLAYGQGYD HNFVLMRDAT AEDELVLAAR VEAPDSGRVL EIATSEPGVQ FYSGNFLDGT LIGKQGKAYE KRSGFALETQ HFPDSPNQAA FPSTILEPGE TYQSRTVWRF STQTPTP
|
| |