Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_3796 |
Symbol | |
ID | 4447846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 4279311 |
End bp | 4281269 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639691620 |
Product | levanase |
Protein accession | YP_833271 |
Protein GI | 116672338 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1621] Beta-fructosidases (levanase/invertase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCTCA GGTTTCTCCC CAGCAAACTC ACGCCACGGG CCCTATGGGT CGCGGTGGCC GCTGCCCTGG CGCTCGTCCT GGCGGCGGTA GCCATCGTGG TGGGTGCGGG GTCGCAGCCT GGCCCGGGCA GCCCCGCCGC GGACGCGCTG GCCGACGTCG CCGGCGCCCA GGCGGAGGCC AAAAAAGACG CCGGCCGCTA TGGCTCGTTC TGGCTGGCCG GGGGCGACCG CAGCCTGGAG GGCCTGGCAC GCCCGCTCCG GACCGACGGC GTTTCCGACC TCCGCGCCAT TGAATGCGCC GATGGATGGG TGGCCGCCGC CAGGGTGGGG CAGGACGTCT TCGTACGCTC CAGCAGTCAG GACCAGGTGG TGCGGGCTGG CTCCGCGGAC ATCCGCCGCC CGGACTGCGT CACAGCTGTT GCAGTGGAGG CCATGCTGTC CGACCTGGGC GCACGGCCCG AGGTTCCGGC GCCGGCGGAG TCCGCGTTCG CGCGGCCGGA CGGTGCCTCC GCCTACCGCC CGGGCTACCA CATCACCCCC CGCGAAAACT GGATGAACGA TCCGCAGCGG CCGTTCTGGC TGGACGGCCT CTGGCACTAC TACTACCTCT ACAACGCCGG CTACCCGGAG GAGAACGGCA CTGAGTGGTA CCACCTCACC AGCACCGACC TGGTGCATTG GAAGGACGAG GGGGTGGCCA TTGAAAAGTA CAAGAACGGC CTGGGCGACA TCGAAACGGG CAGTGCCGTG GTGGATTACG AGAATTCCGC GGGGTTCGGC AAAGGTGCCG TCATCGCCGT CATGACTCAG CAGGACGACG GCATCCAGCG GCAGTCGCTG TTCTATTCCA CCGACAAGGG CTACACATTC AAGCCCTACG AGGGGAACCC GGTGATGGAC AACCCGGGGG AGCAGCACTG GCGCGATCCG AAGATCATCC GTGATAACGC GAACAACCAG TGGGTGATGG CCCTGGCGGA GGGCGAAAAG ATAGGTCTGT ACGCCTCCGC CAACCTGAAG GAATGGCGCT ACCTTTCGGC CTTTGAACGC AAGGGACTGG GGATCCTTGA ATGCCCGGAA CTCTTCCAGC TCGACGTCGA CGGCGATCCC GCCAAACGGA CCTGGGTCCT TGCTGCCAGC GCCAACGGGG CCGAGGAAGG GAAGTCCACC GGCGTCGCCT ATTGGACCGG GACCTGGGAC GGGACCCGGT TCGAGCCTTC GGACCAGAAG CACCAGTGGC TGGACGACGG CTCCGACTTC TACGCCGCCG TGACCTGGGA CGACCCCCGT CTCACGGAAA GCCAGCGCAT GGGGTCGCGC CACTCCATCG CGTGGCTGAA CAACTGGGCC TATGCCCGCA AGCTGCCCAC CGACGACTGG CACGGGGGCG CCGACACTCT GGTCAGGGAT ATCCGGCTGA AGACGGTCTC CGGCAAGCCC ACGCTCGTCT CCATGCCCAC TAGTGCGCTG AAGTCGCTGG AAGGAGACAC CGCCACCGTG GAAGACCGGA AACTGACTCC TGACGGTGCA GCCGGGCTAC CCGTGCCTGA CCGCGGAGCG TACCGGCTCG ACCTCACGCT CGAACGCGCG GCAGACGACG ACGGTTCCGA GGCCAAGGTG GAACTGCTCG CCGAAAACGG GGTCTTTGCC ACAGTGGGCT ATGACTTCGA ATCGGGAACC GCCTTCGTCA CGCGGGACGG TGCCGCCAAG GAAACGGCCG GACTCGCGCC CGATTACGGC GTGCTCCGGC GTGCGGAGTC CGCACCCCGC GAGGGCCGGG TCCGGCTGAC GGTCTATGTG GACCACAGTT CCGTGGAAGT GTTCGTCAAC GACGGCGAGA GAACCCTCAC GTCTCTCGTG TTCCCTGCCG GGGCGCCTAA GGGCCTGAAG GCGCTGACGA AGGACGGGAC GCTGACGCTC AAGTCGTTCA GCTATACACC GATGGCGGCC ACGTCCTGA
|
Protein sequence | MSLRFLPSKL TPRALWVAVA AALALVLAAV AIVVGAGSQP GPGSPAADAL ADVAGAQAEA KKDAGRYGSF WLAGGDRSLE GLARPLRTDG VSDLRAIECA DGWVAAARVG QDVFVRSSSQ DQVVRAGSAD IRRPDCVTAV AVEAMLSDLG ARPEVPAPAE SAFARPDGAS AYRPGYHITP RENWMNDPQR PFWLDGLWHY YYLYNAGYPE ENGTEWYHLT STDLVHWKDE GVAIEKYKNG LGDIETGSAV VDYENSAGFG KGAVIAVMTQ QDDGIQRQSL FYSTDKGYTF KPYEGNPVMD NPGEQHWRDP KIIRDNANNQ WVMALAEGEK IGLYASANLK EWRYLSAFER KGLGILECPE LFQLDVDGDP AKRTWVLAAS ANGAEEGKST GVAYWTGTWD GTRFEPSDQK HQWLDDGSDF YAAVTWDDPR LTESQRMGSR HSIAWLNNWA YARKLPTDDW HGGADTLVRD IRLKTVSGKP TLVSMPTSAL KSLEGDTATV EDRKLTPDGA AGLPVPDRGA YRLDLTLERA ADDDGSEAKV ELLAENGVFA TVGYDFESGT AFVTRDGAAK ETAGLAPDYG VLRRAESAPR EGRVRLTVYV DHSSVEVFVN DGERTLTSLV FPAGAPKGLK ALTKDGTLTL KSFSYTPMAA TS
|
| |