Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0803 |
Symbol | |
ID | 4446674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 867722 |
End bp | 870748 |
Gene Length | 3027 bp |
Protein Length | 1008 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 639688609 |
Product | glycoside hydrolase family protein |
Protein accession | YP_830301 |
Protein GI | 116669368 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0383] Alpha-mannosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.462233 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCACGACG ACCGCCGCAT CACGGAAGTC CGTCTGGACC GCTTCATGCG CGAACGCGTG GACCCTGCGG TGTACTCCCG CAGCGTTCCG CTGAACCTCA GCGCCTGGGA CGTTCCGGAC GAGCCTGTCT CCGTTCTGGA GGCCCTGCGC CACGACTTCG TGCCGCTGGA ACACGGATCG GCGTGGGGCC GCCCCTGGAG CACCAAATGG CTGAGGCTGC AGGGTGAGGT GCCCGATTCC TGGGGCACGG CTCCCGATAC CGCGGTGGAA ATAGTGGTGG ACCTGGGGTT CACCCGGGAG CTCCCGGGCT TTCAGTGCGA AGGGATCGCC TGGCGGCCGG ACGGCACCAT CATCAAGGCC ATCTCGCCCC GGAACCAGTA CATTCCACTG AAGCTCCTCG GCAGCGGGAT GGCAGTGGAC TTCTACGTGG AGGCCGCCGC CAACCCCGAT GTTGCCCAGG GGTGGACTTT CGCAGCCATG CCCTACGGTG ACAAGGCAAC AGCGGGAAGC GACCCCAAAT ACCGGCTGGG CGCCATGGCC ATCGCCGAGC TCAACCAGAC GGTGTGGGAA CTGCAGCAGG ACGTATGGAC GCTCAGCGGA CTCATGCATG AGCTCCCGAT GGAACTGCCG CGCCGCCACG AGATCCTGCG CGCCCTGGAA CGGATGCTGG ACGTCATGGA TCCGGACGAT ATTCCCGGCA CCGCCGCGGC AGGCCGCGCA GCCCTGGCTG AGGTTCTCTC CCGTCCGGCG TATGCCTCCG CCCATCAGCT GGTCGCCACA GGACACGCGC ACATCGATTC GGCGTGGCTG TGGCCCGTCC GGGAGACCAT CCGCAAGTGC GCGCGCACCT TCTCCAACGT CGTGGCCCTG ATGGACGAGT CTCCCGACTT CGTTTTCTCC TGCTCGTCCG CGCAGCAGCT CGCCTGGATG AAGGAGTTCT ACCCCGAGCT GTTCGGCCGG ATCCGCGAGA AGGTCAAAGC GGGCAAATTC GTGCCGGTCG GCGGCATGTG GGTGGAATCC GACACCAACA TGCCGGGCGG TGAGGCCATG GCCCGGCAGT TCATCGAAGG CAAGGGGTTC TTCCTCGACG AGTTTGGCGT GGAGTGCCGG GAGGCATGGT TGCCCGATTC CTTCGGCTAC TCGGCTGCAT TGCCGCAGAT CGTCAAGGCT GCCGGCAGCA AGTGGTTCCT GACCCAGAAG ATCTCCTGGA ACCAGGTCAA CAGGATGCCG CACCACACCT TCAACTGGGA AGGAATCGAC GGCACGCGGC TGTTCACCCA CTTCCCGCCC GTGGACACTT ACAATTCGGA GCTGAGCGGC CGGGAACTGG CACATGCGGA ACGCAACTAC CGGGACCACG GCCGCGGAAC CGTCTCCCTG GTCCCGTTCG GCTACGGCGA CGGCGGCGGC GGACCGACAC GGGAGATGAT CGCCGCCGCC CACCGTACGG CCGATCTCGA AGGGTCGCCG AAGGTCCGGA TCGGAACGGC TGCGAATTTC TTCACGCAGG CGCAGGCCGA ATATGCGTCC CTGCCCGTCT GGGTGGGGGA GATGTACCTG GAGCTGCACA GAGGGACCTA CACCAGCCAG GCGAAAACCA AACGGGGCAA CCGGCGCAGC GAACACCTTC TCCGCGAGGC CGAACTGTGG TGTGCCACAG CATCAGTGCG CACCGCCGGC GGGTTCGCGT ATCCGGCGGC CGAGTTGAAG CGCCTGTGGC AGCTGGTCCT GCTGCAGCAG TTCCACGACA TCCTGCCCGG CAGTTCCATT GCCTGGGTCC ACCAGGACGC AGAGCGGAAC TATGCGGCCA TCGCGGAAGG CCTTGAAGCC ATCATTGCCG ATGCCGCGCG CGCCATGCTC GGTGAGGGCA GCCGCGAGTT CCTGCTGAAC GCCGCGCCGC ACGAACGCAG CGGAGTACCC GCTCTTGCCG CCGCCGAACC GGTCCGGAGC GACCACCCGG TGACGGTCAC CGAGCATGCC GGGGGATACA TCCTGGACAA CGGCGTGATC AGGGCCGTGC TGGACTCGAA CGGACTCCTG ACTTCCCTCA TCGACCACGC AAGCGGCCGC GATGCCATCG CCCCCGGCCA GTACGGGAAC CATCTGGAAC TACACCGCGA TACGCCCAAC GAGTGGGACG CGTGGGACAT TGACGAGTTC TACCGCCGCA ACGTCACCTC ACTGACCGAA GCGCGTTCGG TGACGCTCGA GCGGGGCGGC TGGGACGCCG TCGTCGTGGT GGAACGACTG GCGGGAGCAT CCGCGATCAC CCAGCGGATT TCGCTGGAGG CGGGTTCCGG CTCGCTGGGC ATCCTGACCT CCGTGGATTG GCAGGAACGC GAAAAGCTGC TCAAGATCGG ATTCCCCCTG GACGTGCGAG CAGACCGTTC GGCGTCAGAG ACGCAGTTCG GGCATGTCTT CCGGCCCACC CACACCAACA CATCCTGGGA GGCAGCCAAG TTCGAAATTT GTGCCCACCG CTGGATTCAT GTGGCGGAGC CGGGTTACGG CGTGGCGGTC ACCAATGCCT CCAGTTATGG ACACGACGTC ACCCGCACCG TGAGGGACGA CGGCGGCACC ACCACTACCG TCCGTACCTC GCTGCTGCGC GCACCCAAGT ATCCGGATCC CGACGCCGAC CGCGGGCGGC ACGAGCTGCT GGTGACCATC AGGCCCGGGG CGGCCATTGC TGACGCCGTG GAGGAGGGCT ACCGGACCAA CCTGGCCCCG CGGATCATGA GGGGCGCCAA CGCTGTCCTT CCACTGTTCA CGGTGTTCAA CCAGGGAATC GTGGTTGAGG CGGTAAAGCT GGCGGAGGAC GGTTCCGGTG ACGTCATTGT GCGTCTCTAT GAGTCCCTGG GGGAGCGGTC CGAGGGAATC GTGACAGCCA ATTTCGAAAC CAGGCAAGTG CAGGTAGTGG ACCTGCTGGA GCGTCCGGTT GCGGGCCCGG GTGTTGAAAC CGGCCGGGAT TCCGCAAAGC TGACGTTGCG TCCGTTCCAG CTGCTCACCC TGCGGTTTGC CCGCTGA
|
Protein sequence | MHDDRRITEV RLDRFMRERV DPAVYSRSVP LNLSAWDVPD EPVSVLEALR HDFVPLEHGS AWGRPWSTKW LRLQGEVPDS WGTAPDTAVE IVVDLGFTRE LPGFQCEGIA WRPDGTIIKA ISPRNQYIPL KLLGSGMAVD FYVEAAANPD VAQGWTFAAM PYGDKATAGS DPKYRLGAMA IAELNQTVWE LQQDVWTLSG LMHELPMELP RRHEILRALE RMLDVMDPDD IPGTAAAGRA ALAEVLSRPA YASAHQLVAT GHAHIDSAWL WPVRETIRKC ARTFSNVVAL MDESPDFVFS CSSAQQLAWM KEFYPELFGR IREKVKAGKF VPVGGMWVES DTNMPGGEAM ARQFIEGKGF FLDEFGVECR EAWLPDSFGY SAALPQIVKA AGSKWFLTQK ISWNQVNRMP HHTFNWEGID GTRLFTHFPP VDTYNSELSG RELAHAERNY RDHGRGTVSL VPFGYGDGGG GPTREMIAAA HRTADLEGSP KVRIGTAANF FTQAQAEYAS LPVWVGEMYL ELHRGTYTSQ AKTKRGNRRS EHLLREAELW CATASVRTAG GFAYPAAELK RLWQLVLLQQ FHDILPGSSI AWVHQDAERN YAAIAEGLEA IIADAARAML GEGSREFLLN AAPHERSGVP ALAAAEPVRS DHPVTVTEHA GGYILDNGVI RAVLDSNGLL TSLIDHASGR DAIAPGQYGN HLELHRDTPN EWDAWDIDEF YRRNVTSLTE ARSVTLERGG WDAVVVVERL AGASAITQRI SLEAGSGSLG ILTSVDWQER EKLLKIGFPL DVRADRSASE TQFGHVFRPT HTNTSWEAAK FEICAHRWIH VAEPGYGVAV TNASSYGHDV TRTVRDDGGT TTTVRTSLLR APKYPDPDAD RGRHELLVTI RPGAAIADAV EEGYRTNLAP RIMRGANAVL PLFTVFNQGI VVEAVKLAED GSGDVIVRLY ESLGERSEGI VTANFETRQV QVVDLLERPV AGPGVETGRD SAKLTLRPFQ LLTLRFAR
|
| |