Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_2116 |
Symbol | |
ID | 5704970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 2437857 |
End bp | 2438846 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641271601 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001536972 |
Protein GI | 159037719 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.463092 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00350753 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGAGCTCAT CCAGGGTGGT GGTCACGGGC GGGTGTGGGT TCATCGGCAG TCACCTGGTG GACCAGTTGG TCAGGCGGGG CGACGACGTG GTGACCTTCG ACGGCGTAGC ACCCAGCACG GGTGAGCGGC GTCCCGGTAC CACGGCGCGG CACATCGTCG GTGACGTCCG CGACCCCTCG GGGCTCGCCC AGGCGATACA GCCCGGCGTC GACGTGGTCT ACCACATGGC CGCCGTGGTT GGGGTGGACC AGTACCTGGC CCGGCCACTG GACGTCATCG ACATCAACCT CAACGGCACC CGCAACGTCC TCGAACTGGC CGCCAGGGCC GGTGCACGGG TGATCGTGGC CAGCACCAGC GAGGTGTTCG GCAAGAACCC GGCGGTGCCC TGGAAGGAGG ACGGCGACCG CGTCCTCGGC CCGACCACGG CCGACCGGTG GGCGTACTCC TCCAGCAAGG CACTCGCGGA GCACCTGACG TTCGCGTTCG CCCGCCAGCA CAGCCTGGCG GCCACCGTGG TGCGCTACTT CAACGTGTAC GGGCCACGCC AGCGTCCCGC CTACGTCGTG TCCCGCAGCA TCCACCGAGC CCTCAACGGG CTCGCCCCGG TGGTGTACGA CCAGGGCCGG CAGTCCCGCT GTTTCACGTA CGTGGCCGAC GCGGTGGACG GGACCATGCT GGCCGCCGCT GCGCCGTCCG CCGTCGGTGA GGCGTTCAAC CTCGGCAGCA TGCGAGAGAG CATGATCAGC GAGGTCGTCG AGCTGGTCGC CAAGTTGGCG GGCGGCACCT CTACCACGTC GGTGGACACC GCGGCACGGC TCGGCGCCGC GTACCAGGAC CTACCCCGGC GCGTGCCGGA CAACACCAAG GCCCGCACGA CTCTCGGCTG GGACTGTGCC ACACTGCTGG AGGACGGCCT GGCGCGGACG ATCGAGTGGG CCCGCGCCAA CGCCTGGTGG CTGGCACGGG CCGACACCGG CGCCGCGTGA
|
Protein sequence | MSSSRVVVTG GCGFIGSHLV DQLVRRGDDV VTFDGVAPST GERRPGTTAR HIVGDVRDPS GLAQAIQPGV DVVYHMAAVV GVDQYLARPL DVIDINLNGT RNVLELAARA GARVIVASTS EVFGKNPAVP WKEDGDRVLG PTTADRWAYS SSKALAEHLT FAFARQHSLA ATVVRYFNVY GPRQRPAYVV SRSIHRALNG LAPVVYDQGR QSRCFTYVAD AVDGTMLAAA APSAVGEAFN LGSMRESMIS EVVELVAKLA GGTSTTSVDT AARLGAAYQD LPRRVPDNTK ARTTLGWDCA TLLEDGLART IEWARANAWW LARADTGAA
|
| |