Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0054 |
Symbol | |
ID | 4447489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 60394 |
End bp | 61929 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639687848 |
Product | levanase |
Protein accession | YP_829555 |
Protein GI | 116668622 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1621] Beta-fructosidases (levanase/invertase) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGAAA CAACAATGCA CCCTGCCGCC CCCGCCGAAG ACACCGCCGC CGACTTCCGG CCGGTACTTC ACTACACAGC CAAGAACACC TGGCTGAACG ACCCCAACGG ACTCGTGTGG CACCAGGGCG TCTACCATCT CTTCTACCAA AACAACCCCT TCGACAACGT CTGGGGCAAC ATGTCCTGGG GGCACGCCAC CTCAACCGAC CTTCTGCACT GGACCGAACA CCCGGTTGCC ATCGCCTGCG ACGAGGAAGA AGACGTCTTT TCCGGCAGCA TCGTGGTGGA CCACGGCAAT ACGTCGGGAT TCGGCACAGT GGAAGACCCT GCCCTGGTGG CCATCTACAC GAGCGCCTTC AAGGAAGGCT CGGTGCACCA AGGGACACAA GCCCAGTCTC TCGCGTTCTC CACGGACGCC GGCATGACGT GGAACAAGTA CGCAGGCAAT CCGGTGCTTG GCCGCGACTC GGCCCATTTC CGGGATCCCA AAGTATTCCG CTACGAGGGA GCTGCCGGTT CCTGCTGGGT CATGGTGGCG GTGGAGGCCC GGCGCCAGCA GGTTGTGCTG TACCGCTCGG CCGACCTCAA GGATTGGGAA CACCTGAGCA CCTTCGGCCC TGCAAACGCG ACGGGAGGCG AATGGGAGTG CCCCGACCTG TTCCCGCTCC CCGTCGACAG AGACCCGGAC AACGTCAAGT GGGTCCTCGT AGTCAATGTC AATCCGGGTG CCGTGGCCGG CGGCTCGGGA GGGCAGTACT TCGTCGGCGA CTTCGACGGG GTGAAGTTCA CTGCCGACCC TGATTCACTC GTTCCAGCCG ATGCCGACGG GACCACTGAT CTCAGCCGCT GTCTGTGGCT CGACTGGGGA CGTGACTACT ACGCCGCCGT CTCCTTCAGC AATGCCCCGG AGAACCGCCG TATCATGATC GGCTGGATGA ACAACTGGGA CTACGCCAAC TTCTTGCCCA CGTCTCCATG GCGTTCCGGG ATGTCGCTTG CCCGCGAGAT CGAGCTCGCG ACGGTGGACG GTTTGCCCCG CCTGGTGCAG CGCCCGGTAC TGCCATTGGA CAGCGGCGAG CCGGCCTGCG CCATCCAGGA CGTGGAGCTT CACGACTCCC TGCTGCAACT GCCCGACGCA ATGCCCGGAT CAGCCCAGCT GATCGACGCC GAGATCTTGC CCGGCACGGC CCGGACCGTT GTTTTCCGGC TTCTCGGCGC ATCCGGCGGG AGCGCCGCAA CGGTTCTCAG CTTCGATGCC GTGACGGGCC TGCTCACCCT GGATCGCCGC AACTCCGGAA ACACCGCCTT CCACGGAAAG TTCGCGTCTG CCGAGTCGGC ACCGGTGAAG CTCGAAGCCG GCGTGCTAAG GCTCCGCGTA ATCGTCGACC AGTGCTCGGT GGAGGTCTTT GCCCAAGGCG GCAGGGTCGT CCTGAGCGAT CTGGTCTTCC CGATGTCCGG AAGCCTGGGC ACCGAAGTGT GCGTGGAGGG CGGCGCGGCC TTTGTTCGGA AACTGGCCGT CACGGGCTTG TCCTGA
|
Protein sequence | MTETTMHPAA PAEDTAADFR PVLHYTAKNT WLNDPNGLVW HQGVYHLFYQ NNPFDNVWGN MSWGHATSTD LLHWTEHPVA IACDEEEDVF SGSIVVDHGN TSGFGTVEDP ALVAIYTSAF KEGSVHQGTQ AQSLAFSTDA GMTWNKYAGN PVLGRDSAHF RDPKVFRYEG AAGSCWVMVA VEARRQQVVL YRSADLKDWE HLSTFGPANA TGGEWECPDL FPLPVDRDPD NVKWVLVVNV NPGAVAGGSG GQYFVGDFDG VKFTADPDSL VPADADGTTD LSRCLWLDWG RDYYAAVSFS NAPENRRIMI GWMNNWDYAN FLPTSPWRSG MSLAREIELA TVDGLPRLVQ RPVLPLDSGE PACAIQDVEL HDSLLQLPDA MPGSAQLIDA EILPGTARTV VFRLLGASGG SAATVLSFDA VTGLLTLDRR NSGNTAFHGK FASAESAPVK LEAGVLRLRV IVDQCSVEVF AQGGRVVLSD LVFPMSGSLG TEVCVEGGAA FVRKLAVTGL S
|
| |