Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sde_0774 |
Symbol | |
ID | 3965827 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Saccharophagus degradans 2-40 |
Kingdom | Bacteria |
Replicon accession | NC_007912 |
Strand | + |
Start bp | 998716 |
End bp | 1000218 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 637919836 |
Product | L-arabinose isomerase |
Protein accession | YP_526248 |
Protein GI | 90020421 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2160] L-arabinose isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.031491 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0818763 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTT ACGGCGAGAA AAAAATTTGG TTTGTAACAG GTTCGCAGCA TCTCTACGGC CCAGGTGTAT TGGCACAAGT AGCAGAAAAT AGCCAAGCCA TTGCGGCAGG TTTAACTGCC TCCGAGCATG TTTCTGCCAA CATCGAATCG CGCGGTGTAG TTACAACGCC AAAAGAAATT TTGGATGTAT GCCAAGCGGC TAACAGCGAT GAAAATTGTG TGGGCCTCAT TTTGTGGATG CATACATTTT CGCCAGCAAA AATGTGGATA GCTGGTTTAA GTGCACTCAA CAAGCCTTAT ATGCATTTAC ATACCCAGTT TGGCGCAGCA TTGCCCTGGG GCGACATAAA CATGAACTAT ATGAACCTTA ATCAAAGCGC CCACGGCGAC CGCGAATTTG GTTACATTGG CACTCGTTTG CGTCAAGAGC GCAAAGTAGT TGTTGGCCAC TGGCAAAAAG AAAGCGTGCA AATACAAGTA GACGATTGGG TGCGTGCGGC AATGGGCTGG GCGGAATCGC AAACCCTTAA AGTGGCTCGC TTTGGCGATA ACATGCGTCA AGTTGCTGTA ACTGAAGGCG ACAAGGTATC TGCTCAAATT CAGTTTGGCT ACGAAGTGCA TGCTTTCGGT TTGGGCGATT TAGCAAAAGC GTGTGAGAAA ATTACTGCAG AGCAAATTAC TGCGCAGTTA GAACTATACA AGCAAGATTA CGAAATAGAT GCAGATGTAT TTACCGACGT ACACAGTTTA GAAATGCTGC AAAACGAAGC GCGTTTAGAG CTGGGTATGG AAGCGTTTCT AGAAGAAGGC AACTTTAAAG CCTTTACCAA CTGTTTTGAA AACTTGACCG GCTTATCGGG TTTGCCAGGT TTGGCTACCC AGCGATTAAT GAGCAAAGGC TATGGCTACG GTGGTGAAGG CGACTGGAAA ACCGCTGCAA TGTGTCGCAT TGTAAAAGTG ATGAGCTTGG GTAAGGCTGC AGGCACATCT TTTATGGAAG ATTACACCTA CAATTTCGGC GACCCAGATC AAGTATTGGG TGCGCATATG TTGGAAGTGT GCCCAAGTAT TTCCAACGAG AAGCCCAAGG TTGTGGTAGA GCGTCACACC ATTGGTATTA AGAAAGATAT CGCCCGCTTA ATCTTTACCG GCACCCCTGG CCCTGCTATC AATATTTCTA CCATCGATAT GGGCACACGG TTCCGTATTA TTGCCAACGA AGTAGATACC GTAAAACCAC CGCAGGATTT GCCAAACCTG CCTGTTGCAA AAGCACTGTG GGAGCCGCGT CCAAGCCTAG AGGTTGCTGC AGCCGCGTGG ATTCACGCTG GTGGCGCACA TCACAGCGTT TACACACAAG GTATTAACCT AGATCAGCTA AATGATTTTG CCGAAATGGC CGGTGTAGAA ATGGTTGTAA TCGGTGCCGA TACCAACGTT AACGAATTCA AAAAAGAATT GCGTTTTAAC GCTGTGTACT ATCATTTAAG TCACGGTATC TAA
|
Protein sequence | MKIYGEKKIW FVTGSQHLYG PGVLAQVAEN SQAIAAGLTA SEHVSANIES RGVVTTPKEI LDVCQAANSD ENCVGLILWM HTFSPAKMWI AGLSALNKPY MHLHTQFGAA LPWGDINMNY MNLNQSAHGD REFGYIGTRL RQERKVVVGH WQKESVQIQV DDWVRAAMGW AESQTLKVAR FGDNMRQVAV TEGDKVSAQI QFGYEVHAFG LGDLAKACEK ITAEQITAQL ELYKQDYEID ADVFTDVHSL EMLQNEARLE LGMEAFLEEG NFKAFTNCFE NLTGLSGLPG LATQRLMSKG YGYGGEGDWK TAAMCRIVKV MSLGKAAGTS FMEDYTYNFG DPDQVLGAHM LEVCPSISNE KPKVVVERHT IGIKKDIARL IFTGTPGPAI NISTIDMGTR FRIIANEVDT VKPPQDLPNL PVAKALWEPR PSLEVAAAAW IHAGGAHHSV YTQGINLDQL NDFAEMAGVE MVVIGADTNV NEFKKELRFN AVYYHLSHGI
|
| |