Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1840 |
Symbol | |
ID | 4445634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 2068089 |
End bp | 2070419 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639689658 |
Product | alpha-L-rhamnosidase |
Protein accession | YP_831330 |
Protein GI | 116670397 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGACTGCCC AGATCGATGC CACTGCCCAG GACCTGCTCG CCAAGGCGAC CTGGATCCAG GCCGCTGAAG ACACTGTCCC TCCGGCCGGC GCGCGCCCAG CCTACGAATT CCGCACCACC TTCACCCTTC ACGGGGCGCC TCGGCACGCC ACCCTGGCGG CCACCGCCCA CGGGATCTAT GAAGCCTTCA TCAACGGGGT CCGTGTTGGC GATGAGGAAC TGACGCCCGG CCTGACCAGT TACGCCAAGA CCCTCTACGT GCAGCACCAC GAGGTCACCG GCTTGCTGGA GACGGGTTTG AACGAACTGC GGCTGGTGCT CAGCGATGGA TGGTTCCGCG GACGATGCGG ACCCAGCCGG GTTCCTGACA ACTTCGGAGT ACATACCGCT CTTGTCGCAC AACTCAACCT CGAAACATCA GCCGACACCA TCATCATTGC CACGGGCCCT GACTGGGAAT ACGGCACCGG CTCCATCACC GCCGCAGACC TCATGGACGG GCAAACGAAC GATTTCACGA GGCTGGACGA CATCCCATGG CAGCCCGTCA GGGTCGCCGA CAACGCGTTG ACCCTTGATC GATCCCGGCT CGCCTTTTCG CCCGCGCCTC CCGTGCGGCG CATCCGTCAA TACCCGGCTA TTGACGTCAC CAGGCTCTCC CGTGGCAGAC AGATCGTGGA CTTTGGACAA AACCTCAACG GCTGGGTCCG ACTGTCAGCG CTCGGACCGG CAGGAACCAC CACCAAGCTC ACCCACGGAG AAGCCCTCGA CCCCACCGGT GACCTGACCA CCGCGCACCT CGCCTACACC CCGTACCCGG ACCCCAACCC CCTGCCCACA GGCCAAACCG ACACCGTTAT CTCCCGAGGG CATCCCGGCG ACGTGTTCGA ACCACGCCAC ACCACCCACG GGTTCCGGTA CGTTGCCGTC GATGGCCTTC TTGAGGACCT CAACCCGGCC GACATCGTCG CCGTCCTCGT GCACACGGAC TTGGCAGCTA CCGGCACTTT CGAATGCAGC GACGAACGTG TCAACCGACT GCACCGCATC GCTGAGGCCA GCTGGCACGC GAACGCGTGC GACGTGCCCA CTGACTGCCC GCAACGGGAA CGCTGGGGCT ACACCGGCGA CTACCAAATT TTCGTCCGCA GCGCCGCGTT CCTCGACGAC ATCTATGGTT TCTCGAGGAA ATGGCTGCAG TCGCTGGCTG ATGACCAGCT CGACAACGGC TGCATCACCA ACGTTGCCCC CAACACGGGC GTGGTCGACA ACCCCGAGAT CCCCTTCTCC TTTGACGGGT CCGCAGGCTG GGGGGACGCG GCAACCATCG TGCCCTGGCA GCTCTACGTA ACCTACGGCG ACGCCCGCGT GCTCGAAGAT AGCTTCGAGA TGATGACCCG ATGGGTGGAC TACATCGCCG GCCTCGCCGC GACCGGGCGG CACCCCTCCC GCCAGGAGAG CCGAGCGGAA CCTGCGCCCC ATGAAAGCTT CCTTTGGGAC TCAGGCTTCC ACTGGGGAGA ATGGGCCGAG CCCGGGGGAG CCTTCGACTT CTTTGGCGAC AAAGGCATCG TGGCCACCGC ATACATGGCC CGTTCCGCGG ACATCGTCTC CAAAGCCGCC GCGATCCTCG GTAAAAAGGA GCTCGCCCAC CACTACCGGG AACTTCATGC CAACGCCCTT GACGCCTGGC GCACCGAGTA CCTCACCCCG GCAGGTCACC TGACCCTCGA ATCGCAAGCC AACTACGTGC GGGGTCTGGC TTTCGGGCTG ATCCCTGCAG AACTCGAAAC AAACGCCGTC AACCGTCTTG TCGAGCTCAT CCGGGAAAAG GACAACCATC TGTCCACAGG ATTCCTCTCG ACACCATTCC TGCTGCCGGT ACTGGCGGAC CACGGCCGGA CAGATGTCGC CTATGAACTG CTGTTCCAGG ACACCGCTCC GAGCTGGATG ACCATGCTCG ACAGGGGTGC GACCACGATT TGGGAATCCT GGGAAGGAAT CACCGAAGAC GGCATGATGC ATGAGTCGCT GAACCACTAC TCCAAGGGAG CGGTCATCAC GTTCCTTCAC GAATACGTCG CCGGCATCCG TCCCGACGAA AACCACCCCG GGTATGAGCA CTTCACTATC GAACCCAAGC CCGGCCACGG ACTGAACTGG GCACGCGCAA ACCTCCGTGC AGCGCGCGGG GCTATCACCA GCGCATGGCA CATCGACGAC GAGACGTTCA CACTCGACGT GACCGTGCCC GCGGGGGCAA CCGCCCGCAT TGTCATGCCC ACCGGAACAC AGCACACAGC CGGGCCGGGA AACCACACCT TCAAGGAATA G
|
Protein sequence | MTAQIDATAQ DLLAKATWIQ AAEDTVPPAG ARPAYEFRTT FTLHGAPRHA TLAATAHGIY EAFINGVRVG DEELTPGLTS YAKTLYVQHH EVTGLLETGL NELRLVLSDG WFRGRCGPSR VPDNFGVHTA LVAQLNLETS ADTIIIATGP DWEYGTGSIT AADLMDGQTN DFTRLDDIPW QPVRVADNAL TLDRSRLAFS PAPPVRRIRQ YPAIDVTRLS RGRQIVDFGQ NLNGWVRLSA LGPAGTTTKL THGEALDPTG DLTTAHLAYT PYPDPNPLPT GQTDTVISRG HPGDVFEPRH TTHGFRYVAV DGLLEDLNPA DIVAVLVHTD LAATGTFECS DERVNRLHRI AEASWHANAC DVPTDCPQRE RWGYTGDYQI FVRSAAFLDD IYGFSRKWLQ SLADDQLDNG CITNVAPNTG VVDNPEIPFS FDGSAGWGDA ATIVPWQLYV TYGDARVLED SFEMMTRWVD YIAGLAATGR HPSRQESRAE PAPHESFLWD SGFHWGEWAE PGGAFDFFGD KGIVATAYMA RSADIVSKAA AILGKKELAH HYRELHANAL DAWRTEYLTP AGHLTLESQA NYVRGLAFGL IPAELETNAV NRLVELIREK DNHLSTGFLS TPFLLPVLAD HGRTDVAYEL LFQDTAPSWM TMLDRGATTI WESWEGITED GMMHESLNHY SKGAVITFLH EYVAGIRPDE NHPGYEHFTI EPKPGHGLNW ARANLRAARG AITSAWHIDD ETFTLDVTVP AGATARIVMP TGTQHTAGPG NHTFKE
|
| |