Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0224 |
Symbol | |
ID | 4447315 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 233867 |
End bp | 235402 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639688020 |
Product | L-arabinose isomerase |
Protein accession | YP_829725 |
Protein GI | 116668792 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2160] L-arabinose isomerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAACC CCAATTACAC ATCCGCCAAC GGAACATCGC TCAGCCAGTA CGAGGTCTGG TTCCTCACCG GCAGCCAGCA CCTGTACGGC GAGGACGTCC TCAAACAGGT CGCAGCGCAG TCGCAGGAGA TTGCCGACGC GTTGAACGGA TCCTCAGACG TTCCGGTCAA GGTGGTCTGG AAGCCCGTCC TTACGGATTC GGACGCCATC CGCCGCACCG CGCTGGAAGC CAATGCCGAC GATTCCGTGA TCGGCGTGAC GGCATGGATG CACACGTTCA GCCCGGCCAA GATGTGGATC CAGGGCCTGG ACCTGCTGCG TAAACCGTTG TTGCACCTGC ACACCCAGGC CAACGTTGAG CTGCCTTGGG CGGACATCGA CTTCGACTTC ATGAACCTCA ACCAGGCCGC CCACGGCGAC CGCGAATTCG GCTACATCCA GTCCCGCCTG GGCATCCCCC GCAAGACCGT GGTGGGCCAC GTGTCCAACC CGGAGGTCAC CCGGCAAGTG GGCGTCTGGC AGCGCGCGTC CGCCGGCTGG GCCGCCGTCC GCACTCTGAA ACTGACCCGC TTCGGCGACA ACATGCGCAA CGTGGCCGTC ACCGAAGGCG ACAAGACCGA GGCCGAGCTC CGCTTCGGCG TCTCTGTGAA CACCTGGTCC GTGAATGAGC TCGCCGACGC CGTGCACGGC GCCGCGGAGT CCGACGTCGA CGCGCTCGTT GCGGAGTACG AGCGCCTCTA CGAAGTGGTC CCCGAGTTGA AGGCCGGGGG AGCGCGGCAC GAATCGCTGC GCTACAGCGC CCGGATCGAA CTGGGCCTGC GCAGTTTCCT CGAGGCCAAC GGCTCGGCCG CGTTCACCAC CTCCTTCGAG GACCTGGGTG AACTGCGCCA GCTGCCCGGC ATGGCCGTGC AGCGGCTGAT GGCGGACGGC TACGGCTTCG GCGCCGAGGG CGACTGGAAG ACCGCCATCC TGGTCCGCGC CGCCAAAGTG ATGGGCTCTG GCCTGCCCGG CGGTGCATCA CTAATGGAGG ACTACACCTA CCACCTCGCC CCCGGCCAGG AAAAGATCCT GGGCGCGCAC ATGCTGGAGG TCTGCCCGTC GCTGACCGCC ACCAAGCCGC GCGTCGAGAT CCACCCGCTG GGCATCGGCG GCAAGGAAGA CCCCGTCCGC ATGGTCTTTG ACACCGACGC CGGCCCTGGC GTTGTAGTGG CGCTGTCCGA CATGCGCGAC CGCTTCCGCC TCGTGGCGAA CGCCGTTGAC GTCGTGGACC TGGACGAGCC CCTGCCCAAC CTCCCGGTGG CGCGTGCGCT GTGGTCTCCG AAGCCGGACT TCGCGACCTC CGCCGCGGCC TGGCTGACTG CCGGCGCGGC CCACCACACG GTGCTCTCCA CCCAGGTGGG CATGGACGTG TTCGAGGACT TCGCCGAGAT CGCGAAGACC GAGCTCCTCA CCATCGACGA GGGCACCACC ATCAGGCAGT TCAAGAAGGA ACTGAACTGG AACGCCGCCT ACTACAGGCT GGCCGGCGGG CTCTAA
|
Protein sequence | MSNPNYTSAN GTSLSQYEVW FLTGSQHLYG EDVLKQVAAQ SQEIADALNG SSDVPVKVVW KPVLTDSDAI RRTALEANAD DSVIGVTAWM HTFSPAKMWI QGLDLLRKPL LHLHTQANVE LPWADIDFDF MNLNQAAHGD REFGYIQSRL GIPRKTVVGH VSNPEVTRQV GVWQRASAGW AAVRTLKLTR FGDNMRNVAV TEGDKTEAEL RFGVSVNTWS VNELADAVHG AAESDVDALV AEYERLYEVV PELKAGGARH ESLRYSARIE LGLRSFLEAN GSAAFTTSFE DLGELRQLPG MAVQRLMADG YGFGAEGDWK TAILVRAAKV MGSGLPGGAS LMEDYTYHLA PGQEKILGAH MLEVCPSLTA TKPRVEIHPL GIGGKEDPVR MVFDTDAGPG VVVALSDMRD RFRLVANAVD VVDLDEPLPN LPVARALWSP KPDFATSAAA WLTAGAAHHT VLSTQVGMDV FEDFAEIAKT ELLTIDEGTT IRQFKKELNW NAAYYRLAGG L
|
| |