Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Shewmr4_1979 |
Symbol | |
ID | 4252552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shewanella sp. MR-4 |
Kingdom | Bacteria |
Replicon accession | NC_008321 |
Strand | + |
Start bp | 2353647 |
End bp | 2355149 |
Gene Length | 1503 bp |
Protein Length | 500 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 638118592 |
Product | L-arabinose isomerase |
Protein accession | YP_734109 |
Protein GI | 113970316 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2160] L-arabinose isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0000173872 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.0652997 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCCT TCAAACAAAA ACAAGTGTGG TTTATCACGG GTTCGCAGGA TTTATACGGC CCAAAAGTAT TAGAGCAAGT CGCTAAAAAC AGTGAGCAAA TTGTTTATGG CTTTAATGAA TCTTCCGCCA TTTCCATTGA AGTGGTGTAT AAGCCAACCG TAAAATCCCC ACGTGAAATT CACGCCGTAT GTCAAGCGGC CAACAGCGAT GAAAACTGTG TTGGCGTTAT TCTGTGGATG CACACTTTCT CTCCTGCCAA GATGTGGATT GCTGGCCTTA ATGAATTAAG CAAGCCATTC ATGCACTTAC ACACTCAGTT CAATGCTGAG CTCCCTTGGA GCGAAATCAA TATGAACTAC ATGAACACCC ACCAAAGTGC TCACGGTTGC CGCGAATTTG GTTTTATCGG CACTCGTATG CGTAAAGAGC GCAAAGTGGT TGTGGGTCAC TGGCAATCGA GCGATGTACA GGCTCAAATC GATGATTGGT GCCGCGCAGC GGCGGGTTGG CACGAGAGCC AAAACCTGCG TATTGCCCGC TTTGGCGACA ACATGCGTCA AGTGGCCGTA ACCGAAGGCG ACAAAGTTGC CGCACAAATT CAATTCGGTT ACGAAGTGCA CGCCTACAGC TTAGGTGAAC TCAATGAAGC GATTGCAGCC ATTGCCGAAG GCGATGTAAC CGCACAACTC GACCGTTACG CCAGCGAATA CCAAGTGGGT AACGAGCTAT TTGGCGATGA ATACCAATTA GACCGTTTAA GAAAAGAAGC CAAGATTGAA CTCGGCTTAA CCCAATTCTT AACCCAAGGT GGATTTGGTG CCTTTACCAA CTGCTTCGAA AACCTCACCG GCATGACAGG ATTACCAGGA CTGGCTACCC AACGTCTGAT GGCGAACGGT TTCGGTTACG GCGGTGAAGG TGACTGGAAA ACGGCTGCCA TGGTGCGCAT CATGAAGGTG ATGGGCCAAG GCCGCGCCGG TGGTACTTCA TTTATGGAAG ACTACACCTA TAACTTTGGG GCGACTGACC AAGTTCTTGG CGCCCACATG TTAGAAGTGT GCCCATCGAT TGCTGCTGCA AAACCGCGTT TAGAAGTTCA CCGCCACACC ATTGGTGTGC GTTGTGACGT GCCACGTCTG TTATTCACAG GTAAAGCGGG CCCAGCAATC AACGTATCGA CTATCGATTT AGGCAACCGT TTCCGTATCA TTCTCAATGA ATTAGATACA GTGACACCAC CACAGGATCT GCCAAATCTG CCTGTCGCAT CTGCGCTGTG GGAGCCTCGT CCGAATTTAG CGGTTGCCGC CGCAGCTTGG ATCCACGCCG GTGGTGCTCA CCACTCAGCT TACAGCCAAG CTATCACGAC GGATCAGATT GTCGACTTTG CTGAAATGGC CGGTGCTGAA CTGGTTATCA TCGATGCCGA TACTAAGATC CGCGAGTTTA AGAATGAGCT TCGCCAAAAT TCCGTTTATT ACGGTTTAGC AAGAGGTTTA TAA
|
Protein sequence | MKAFKQKQVW FITGSQDLYG PKVLEQVAKN SEQIVYGFNE SSAISIEVVY KPTVKSPREI HAVCQAANSD ENCVGVILWM HTFSPAKMWI AGLNELSKPF MHLHTQFNAE LPWSEINMNY MNTHQSAHGC REFGFIGTRM RKERKVVVGH WQSSDVQAQI DDWCRAAAGW HESQNLRIAR FGDNMRQVAV TEGDKVAAQI QFGYEVHAYS LGELNEAIAA IAEGDVTAQL DRYASEYQVG NELFGDEYQL DRLRKEAKIE LGLTQFLTQG GFGAFTNCFE NLTGMTGLPG LATQRLMANG FGYGGEGDWK TAAMVRIMKV MGQGRAGGTS FMEDYTYNFG ATDQVLGAHM LEVCPSIAAA KPRLEVHRHT IGVRCDVPRL LFTGKAGPAI NVSTIDLGNR FRIILNELDT VTPPQDLPNL PVASALWEPR PNLAVAAAAW IHAGGAHHSA YSQAITTDQI VDFAEMAGAE LVIIDADTKI REFKNELRQN SVYYGLARGL
|
| |