Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0404 |
Symbol | |
ID | 4447099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 431280 |
End bp | 435083 |
Gene Length | 3804 bp |
Protein Length | 1267 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639688203 |
Product | levanase |
Protein accession | YP_829905 |
Protein GI | 116668972 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1621] Beta-fructosidases (levanase/invertase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAAAGA GGTTAGTTGC AGGTATTACC ACGGCCGCGC TAGGCCTGGG CGCGCTGGGA GCTGGCTCGC TGCTGGCCCC GGCCATGCTG GCCGCGCCCG CCCAAGCCGC TCCAGGCGGC CCTTTCCCCG CCGGGCCGAT CACGCTGGGC AGCAGCAACA GTCCCATGCC GGACGTGGTG GGCATCAGCG GAAACTGGAC CAGGCAGTCC GACGGCGGCC AGTCCGCCGT CGCCTCGCCG GACCAAAACG CGGCCGCCAT CAGCGAACAG CGCATCAGCT CCACCGCCCG GTACACGGCC GGAGTCACAG TCGATCCCGG CAGCCCGTAC GCCGTAGGCG CCCTGGTTTT CCGCAGTGCT GCCGATGCCA GCAGCGGCTA TGCGGCCACC ATCGATCCCA ACCTGGACAG GGTGCGGCTG TTCGATCTGG CCACTGGACA GGATGTGGCA CCTGCCGCCA CGGTCCCGCT GGACACGGGG CGGAGCTACT CCGTGGACGT CCACGTGGAC GGGCCCCGGA TCTATGTGGC GGTGGACGGA GTGCCGCGGA TCGACGCCAC CGACCAGCGG TACCAGGACG GCCATGTGGG CCTCCACGCC TACAACGGCT CGGTGAACTT CAGCGAGCCC CGGGTGCGCA CCATCGACGC CAACGTCAGT GGCTGGCAGG CGGCGGACGG TGCCGGCTGG ACCGCCTCGG CCACCGGCCT TCGGGGGACA GCTCCGGGCG GCACCAACAT CAGGGCTGTC GCCACCGACC AGGCACCGCA GGTCACGGAT TTCACCGCTG ATGTCCAGGT GACGTCGCCT TACGCCGTCG GCGCGGTCCT CTTCCGCACC AACACTGCCG GAACCACCGG GTACGGGGCC GAAGCGGACG CCAACGCCGG CAGGCTGCGG CTGTACCGTA TCGAGGACAA CGCCACCCTG GGCACCTTCG CGACCACCAT CACCGTCAAC GCGGTATACC GGCTGCGCGT CACGGCCGAC GGCGGGCAGC TCGCCGTGCA CTGGCAGACG GACCTGCTGG ACCCCAACGG CTACGCCCCG GCCATCACCG CAACAGACAC AGCGCACGGC TCCGGGCACA TTGGCCTCCT CGCCTTCAAC GGCAGCACCG TGTTCCAGGG CATGACCCTC AAAGGCCTGG ACACTTCCCT GCAGGGCTGG CGCACCGCCT CGGGAAGCTG GGAACCGGAC GTCCGCGGCC TGCGCGGGGC AACCGACGGC CTGGCCGCCG GCGCGGATGC GGCACGCTTC GTCCCGGCGG TGGCGTCCGA CGTCGTCGCG TCCCTTGACC TGGACGTCGC CGCCCCGGCA ACGGCCGCCG TCGTGGTCCG GAGCGCGCCC GACGGCACCG GCGGCACCGA ACTCAGGGTG GACCCCGGAG CAGGAACGGT GCAGCTGAAG GACCGCACCA CCGGGACCGT CCTTGCCGGC GGGGCCGTAC CGCCGGACAG CTTTGCCGCC GGCCAGCTCA ACCGCATCCA ACTCACCGTC CGCGGCGGGC AGGCAACCGC GCTGATCAAC GGCGTCCAGT CCGTTGCCGG GCCGGCCGGG CCGGCGACAG GCAGCGGCTT CGGTCTGCGC GTCAGCGGCG GCGGCGCCTA CTTCCAAAAC GTCCGGGCAG ACGACGTCGC CAGCTACATG AACGGCCTCT ACCAGCCCGG CTACCACTAC AGCCAGAATT CCGGGAACAG TTCGGACCCC AACGGCCTGG TGTACTTCGA TGGCGAATAC CACCTGTTCC ACCAGGACCG CGGCCGCTGG GCGCACGCCG TCAGCACCGA CCTGCTGCAC TGGAAGCAGC TGCCCATAGC CCTGCCGCAC CTGGCTGCGG GGGAGTCATG GTCCGGCTCC GCGGTGGTGG ATGCCAATGA TTCCAGCGGA CTGTTCGACG GCGGGCAGGG GCTCGTTGCG TTCTACACGA GCTTCAACCA CGATGCCGCC AACGGCAACC AGTCCGTGCG CGCCGCGTAC AGCAAGGACC ACGGCCGCAC CTGGTCCATC GTGCAGGCCC AGCCCGTGGT GGAAAACCCC GGCGGTCCTG CCGGCAGCTG GGACTTCCGC GACCCCAAAG TCACTTGGGA CGCCGCCACC GGCACATGGA TCATGGTGGT GGCCGGCGGG GACCACCTGC GCTTCCACAC GTCCACCGAC CTGGTCCACT GGACCTTCAC CAGCGCCTTC GGCTACGGCG ACTGGGTCCG CGGCGGGGTC TGGGAATGCC CGGACTTCTT CGAGCTGCCG GTGGAAGGGC AGCCCGGCGC CAGACGGTGG GTGCTCTGGT GGAGCACCGG TGCCGTCAGG CCCACCAACG GCTCCGCGGC GCAATATGTC ACCGGAACAT GGAACGGAAC GTCCTTCACC CCGGACACCG GCCCGGACGA GGTGCTCCAG GCCGACTCCG GCCGCGACTA CTACGCTGCC ATGAGTTTCT TCGGTGCCCC GGACGGGCGG CGGATCATGC TGGGCTGGAT GAGCAACTGG GACTATGCCT TCAGCCCGCC CACCGGCCGC TGGAACGGGC AGCTGAGTGT CCCCCGGCAG CTGAGCCTGA AAGACATTCC GGGCGTCGGG CCCCGGCTGG CGCAGGAACC CATAACCGAA CTGGCAGGGC TGCGCACGTC CACGTGGCAG GCGTCCGACG TCACGGTCAC GCCGACGTCG GCCAACCCGC TGTCCGCTGC TTCCGGCCGG TCGTTCGAGC TGGAGGCCGA GGTGGCCATC CCGTCCTCCG GCGGGGCTTC GGGCTTCACC TTCGGGCTCC GCAAAGGCAC TGCGGGCGGC GCAGCCGGAG GGGAAGCCCA GGAGACACTG CTCCGCTACG CCGCCGGCAC GGGGACAGCC TCGGCGACCG GCGCCGGCAC CGGCACCGGC ACCGGCACCG GCACCATGAC GGTGGACCGC GGGCAGTCCG GCCGGGCAGA CTTCACCCGC TATTTCGCGG GTGCCGCGGC GGACAATGCC AGCACGGCAT GGAGTTCCGA AACAGTGGCG CTGGCCGGAG GGGGGAGCGA GCGGCGGGTG AAGCTGCGGG CCCTGGTGGA CTCCTCCTCG GTGGAGCTCT TCGGCGGGGA CGGCACGGCA GCCATCACCT CCCTGGTCTT CCCGGACCCG TCCTCCACCG GGCTGTCTTT CAGCACCACC GGCGGGTCCG CGCGGCTGGT GTCCGCCAAA GTACACCAGC TGGCGGACAC CTCCCGCCTC ACTGCGGCGG TGCCGTCGGC CGTCCTTGGC CCGGTGAGCG GCGCGGCCCG GCACAACCTG GATTCCTACT CCGTGGTTCC CGGCGGCCGC TGGGAAAGCA CGGGAGCAGG GCTGGCCGGC ACGTTCGACA AGGACTCCAC CGCCCTGAGC GCCGCCACGT ACTCGGATGT CCGGGTGGCC GCGACAGTCC GTTTCGGCAG CGGACCCTAC GCGGGGGCCA TGCTCAACAG CGACAGGGTG CCGGAACGGG GTTACGGCGG CGCGGGGTCG GTGCTGCTGC GGGCCTCGGC CGACGGTGCC ACGGCGTACT ACGTCAACCT CGACCCGAAC CTGCGGCTGG CGCGCATCTT CAAGCTGCAG GACGGCGTCT TCGATTCCGC CGCCAGCGTC CTGGCCAGTG TCCCCGTGCT GCTCAGCCAC GGCGTCAGTT ACAGCGTGGA AGCCGCCGCG GCGGGGGAGC GGCTGACGGT GAAGCTCGAC GGCGTGGAAA TCCTCGCCGT CGATGATGCC TCCCTGTCCG CCGGCAAGGT GGGACTGAAC GTGTTCGACG GCCGGGCAGC CTATCAGGAC GTGCTGGTGA CAGGCTCGGG ATGA
|
Protein sequence | MSKRLVAGIT TAALGLGALG AGSLLAPAML AAPAQAAPGG PFPAGPITLG SSNSPMPDVV GISGNWTRQS DGGQSAVASP DQNAAAISEQ RISSTARYTA GVTVDPGSPY AVGALVFRSA ADASSGYAAT IDPNLDRVRL FDLATGQDVA PAATVPLDTG RSYSVDVHVD GPRIYVAVDG VPRIDATDQR YQDGHVGLHA YNGSVNFSEP RVRTIDANVS GWQAADGAGW TASATGLRGT APGGTNIRAV ATDQAPQVTD FTADVQVTSP YAVGAVLFRT NTAGTTGYGA EADANAGRLR LYRIEDNATL GTFATTITVN AVYRLRVTAD GGQLAVHWQT DLLDPNGYAP AITATDTAHG SGHIGLLAFN GSTVFQGMTL KGLDTSLQGW RTASGSWEPD VRGLRGATDG LAAGADAARF VPAVASDVVA SLDLDVAAPA TAAVVVRSAP DGTGGTELRV DPGAGTVQLK DRTTGTVLAG GAVPPDSFAA GQLNRIQLTV RGGQATALIN GVQSVAGPAG PATGSGFGLR VSGGGAYFQN VRADDVASYM NGLYQPGYHY SQNSGNSSDP NGLVYFDGEY HLFHQDRGRW AHAVSTDLLH WKQLPIALPH LAAGESWSGS AVVDANDSSG LFDGGQGLVA FYTSFNHDAA NGNQSVRAAY SKDHGRTWSI VQAQPVVENP GGPAGSWDFR DPKVTWDAAT GTWIMVVAGG DHLRFHTSTD LVHWTFTSAF GYGDWVRGGV WECPDFFELP VEGQPGARRW VLWWSTGAVR PTNGSAAQYV TGTWNGTSFT PDTGPDEVLQ ADSGRDYYAA MSFFGAPDGR RIMLGWMSNW DYAFSPPTGR WNGQLSVPRQ LSLKDIPGVG PRLAQEPITE LAGLRTSTWQ ASDVTVTPTS ANPLSAASGR SFELEAEVAI PSSGGASGFT FGLRKGTAGG AAGGEAQETL LRYAAGTGTA SATGAGTGTG TGTGTMTVDR GQSGRADFTR YFAGAAADNA STAWSSETVA LAGGGSERRV KLRALVDSSS VELFGGDGTA AITSLVFPDP SSTGLSFSTT GGSARLVSAK VHQLADTSRL TAAVPSAVLG PVSGAARHNL DSYSVVPGGR WESTGAGLAG TFDKDSTALS AATYSDVRVA ATVRFGSGPY AGAMLNSDRV PERGYGGAGS VLLRASADGA TAYYVNLDPN LRLARIFKLQ DGVFDSAASV LASVPVLLSH GVSYSVEAAA AGERLTVKLD GVEILAVDDA SLSAGKVGLN VFDGRAAYQD VLVTGSG
|
| |