Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3917 |
Symbol | |
ID | 4443730 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 4420956 |
End bp | 4424018 |
Gene Length | 3063 bp |
Protein Length | 1020 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639691745 |
Product | glycoside hydrolase family protein |
Protein accession | YP_833392 |
Protein GI | 116672459 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCGTTG AGCCCACAAC ATCCAGCCTC TCCTCCGCTG CGGGCCGCGG CGCTGGACCC GGGTCCGAGG CAGCGGTCAG GCAGGCGGTG CACGCCCCGG AGCGGATCCT CGACGTCGCC GCCGAGCAGG TGGCCGCACT CGCTTCCGTA GCCCCGCCAC GGGGCACGAC GCCGGCACGG GCCTACCTTG TGAGTGATGC CCCGCGTCTG CCGCTGAACG GTGTGTGGAA GTTCAGGCTT CACGCGGGCA TCCGGCAGGC GCCACACGAC GGCTGGCAGT CGGGCGGCGA TCGTCCGGAT TTCGGCGACC TCCCGGTACC GTCCAGCTGG CCCATGCACG GCCACGGGAC GCCCTGGTAC ACGAACGTTC AGTTCCCCTT CGCCGTGGAA CCGCCGCATG TTCCTGACGC CAATCCCGTC GGCGATCTCT TCGTCGAGTT TGAGGCGGGC CCGGAGTACT TCCCGTCAGC CTTGATCCGC TTCGACGGCA TAGAATCGGC CGGAACTGTA TGGCTGAACG GCACCTTGCT TGGCACCACA CGCGGCAGCC GGCTCGCCCA CGAATTCGAT GCCACGGGTG TGCTGGTACC GGGACGGAAC GTGCTTGCAG TCCAAGTGGC GCAGTTCTCG GCAGCCAGCT ATGTGGAAGA CCAGGACATG TGGTGGCTGC CCGGCATCTT CCGGGATGTG ACGCTGCAGG CGCGGCCGTC GGAGGGGCTC GACGACGTCT TCGTCCACGC TGACTATGAC CATCGCAGCG GCGAAGGCAT CCTCCGCGTA GAGGCGAGCC GGGCAGGGCA GCCGGCTGCC GCCGTCGTGC GCATCCCTGA ACTGGGGCTG GAAGTGCCCG CCGGGGAGGA GCGTCGGCTC CCGGGCATCG AGCCGTGGTC GGCAGAATTG CCGCGGCTGT ACGACGCCAC CGTCAGCACC ACGGGGGAGA CCGCCTCGGT GCAGCTCGGC TTCCGCACCA TCAGTATCGA GGATGCCCAG TTCAAGGTGA ACGGCCAGCG GATCCTGCTC CGAGGCGTCA ACCGCCACGA ACACCATCCC AAGCTGGGCC GCGTGGTGCC CAGGGAGGTC ATGGAAAGCG AACTGCGGCT GATGAAGCAG CACAACATCA ACGCCATCCG GACCTCGCAC TACCCGCCGC ACCCGGACTT CCTGGCGCTG GCGGACCAAC TTGGCTTCTA CGTGGTCCTT GAATGCGACC TCGAAACGCA TGGCTTCGAA CAAGGCGGCT GGCAGCAGAA TCCCAGTGCC GATCCGCAGT GGCAGCAGGC CCTCGTGGAC CGAATGTCCC GGACCGTGGA GCGCGACAAG AATCACGCGT CGGTGGTTAT GTGGTCCCTG GGCAACGAGG CCGGCACGGG CGAAAACCTT GCAGCGATGT CCAGCTGGAC CAAACTCCGC GACCCCTCCA GGCCCATCCA TTACGAGGGC GACTGGAGCT CCGCCCACGT GGACGTATAT TCGCGGATGT ATGCAAGCCA GGCGGAAACG GAGCTGATCG GCCGTGGCGT GGAACCTCCT TTGGATGACG CGGACCTTGA CGCGCGGCGG CGCGCCATGC CGTTCGTCCT GTGCGAGTAC GTTCACGCCA TGGGCAACGG CCCGGGGGGC ATGTCCGAAT ACCAGGAACT TTTTCACCGG CACCCGCGGC TGATGGGCGG CTTTGTGTGG GAGTGGCTGG AGCATGGCAT CACCCGAACG GCTCCTGACG GGCAGGAATA CTTCGTCTAT GGCGGCGACT TCGGCGAGGA AGTCCACGAC GGAAACTTCG TCACTGACGG CCTGGTGGAT GCCAACCGGG TGCCGCGCCC GGGCCTGCTG GATTTCAAGA AGGTCATCGA GCCGCTAACC ATTGACGTGG CGGCGGACTG GTCCGGGTTC TCGCTCACGA ACCGCTTCGA TTTCGCCGAC ACATCGGGGC TGGAGTTCCG CTACTCTGTG GAGGCCGACG GCGTCACGGT CGACGGCGGT CCGGTTGACG TTGCGCCTGT GGCGCCGCAC GAATCAGCCG ACCTCCGGTT GCCGGCCGGG CTGCTGGAAC GAGCGGGGCT GGCTCCTGCC GTGCTGACGG TCAGGGCAGT CCTCAAGGAA GACGCCGCCT GGGTGGACGC GGGCCACGAA GTTGCGTGGG GCCAGAAGTC TGCCAATGTG CGGGTGGCTC CGGTGCCGGC GGCAACTTCC GCCGTGCAGG TGGACGGTGG CCTCCTTCGC CTGGGGCCGG CTGCCTTCGA TCGCGCGACG GGTTCGCTTG TTCGGCTGGG CGACACTGCA GTCGAGGGGC TTCGACTCCT GCTGTGGCGT GCACCCACGG ATAATGATCT TGGGGCGGAG TGGGGTAGCC CGGATCCCCG CCCAGTAGCC ACACAGTGGC TGGATGCGGG ACTCAACCGG ATGCACGCCC GGCTTGTCGG CATTTCCTCC CGGCCCGCGG CAGGCGGCGG GGACGAGCTG GTGGTGCGGA CGCGGGTTGC AGCGGCAGGT AAGCAATTCG GGGTGCTGGC CGAATACGCC TGGACCAGTG ACGGCTCCAG TGTCAGCGTG CGGACAACGG TCACACCCGA CGGTTCGTGG GTCAACGCCG GCTGGCCAGT GCCGTGGGCG CGCATAGGAG TGGAACTCGT GATGGGTTCG GCCGCCAAGT CCGTGGAGTG GTTCGGCCAG GGGCCGCACC ACAGTTACCC GGATACAGGC CAGGGCACCA GGCTTGGCTG GTACGACTTG CCCTTGAAGG ACCTGGACGT GGAGTACGTC CGGCCCCAGG AATCGGGCGC CCGTTCCGGC GTGTACTCGG CAGCGTTGGA GCTCGACGCC GGCCGGCTTA CCATCGGCGG CGAGCCGTTC GCACTCACCG TGCGCCCCTA CAGCCTGGCG GCACTGGAAG CCGCCACCCA TCGCCCGGAC CTTATTCCGG ACGGCCGCAC CTACGTGTAC CTCGATCACG CCGTACTCGG AGTCGGAACT GCGGCCTGCG GTCCGGGGGT GCTGGAAGGT TACCGGCTGG CGCCGCGGGA AGCCGACTTC TCGTTGGTCT TTGGTGTGAC TCAGTCCACA TAG
|
Protein sequence | MPVEPTTSSL SSAAGRGAGP GSEAAVRQAV HAPERILDVA AEQVAALASV APPRGTTPAR AYLVSDAPRL PLNGVWKFRL HAGIRQAPHD GWQSGGDRPD FGDLPVPSSW PMHGHGTPWY TNVQFPFAVE PPHVPDANPV GDLFVEFEAG PEYFPSALIR FDGIESAGTV WLNGTLLGTT RGSRLAHEFD ATGVLVPGRN VLAVQVAQFS AASYVEDQDM WWLPGIFRDV TLQARPSEGL DDVFVHADYD HRSGEGILRV EASRAGQPAA AVVRIPELGL EVPAGEERRL PGIEPWSAEL PRLYDATVST TGETASVQLG FRTISIEDAQ FKVNGQRILL RGVNRHEHHP KLGRVVPREV MESELRLMKQ HNINAIRTSH YPPHPDFLAL ADQLGFYVVL ECDLETHGFE QGGWQQNPSA DPQWQQALVD RMSRTVERDK NHASVVMWSL GNEAGTGENL AAMSSWTKLR DPSRPIHYEG DWSSAHVDVY SRMYASQAET ELIGRGVEPP LDDADLDARR RAMPFVLCEY VHAMGNGPGG MSEYQELFHR HPRLMGGFVW EWLEHGITRT APDGQEYFVY GGDFGEEVHD GNFVTDGLVD ANRVPRPGLL DFKKVIEPLT IDVAADWSGF SLTNRFDFAD TSGLEFRYSV EADGVTVDGG PVDVAPVAPH ESADLRLPAG LLERAGLAPA VLTVRAVLKE DAAWVDAGHE VAWGQKSANV RVAPVPAATS AVQVDGGLLR LGPAAFDRAT GSLVRLGDTA VEGLRLLLWR APTDNDLGAE WGSPDPRPVA TQWLDAGLNR MHARLVGISS RPAAGGGDEL VVRTRVAAAG KQFGVLAEYA WTSDGSSVSV RTTVTPDGSW VNAGWPVPWA RIGVELVMGS AAKSVEWFGQ GPHHSYPDTG QGTRLGWYDL PLKDLDVEYV RPQESGARSG VYSAALELDA GRLTIGGEPF ALTVRPYSLA ALEAATHRPD LIPDGRTYVY LDHAVLGVGT AACGPGVLEG YRLAPREADF SLVFGVTQST
|
| |