Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0051 |
Symbol | |
ID | 4447486 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 56890 |
End bp | 58290 |
Gene Length | 1401 bp |
Protein Length | 466 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 639687845 |
Product | inulin fructotransferase (DFA-I-forming) |
Protein accession | YP_829552 |
Protein GI | 116668619 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTCAAGCA ACAACTACTA CGACGTGACC ACGTGGCCCG TCGGCAATCC GTCCGAGGAC GTCGGTGAAG TCATCAACAG CATCATCGCT GACATCAAGG ACCGGCAGAC GGTCACCGAT GCGAACAATG GAGGAAAGCC GGGCGCGGTG ATCTACATTC CGCCGGGGGA CTACCACCTT CGTACGCAGG TTTTGATCGA CATCAGCTTC CTCAGGATCC ATGGCTCGGG ACACGGCTTT ACGTCGTCCA GCATCCGGTT CAATGTTCCG GAAGACGAAT GGCCCGGGCT CCATGAGCTG TGGCCCGGTG GGAGCCGGAT TATCGTCGAC ATTCCGCCCG GCGGAGACGA AGGTGGAGAC GGGGAGGAAT CCAAGGGAGC CGCTTTCTAC GTTGAGCGGA GCGGGAGCCC GCGGATCAGC TCGGTGGAGT TCTCCAACTT CTGCATCGAC GGCTTGCACT TCGACCCGGA TGGCTCGGGG TCGCATCCGG AAAACACCTA CGTCAACGGC AAGACCGGTA TCTATGTTGC GAACGCCAAT GACTCTTTCC GCATAACCGG CATGGGGTTT GTCTACCTTG AGAACGCCCT CACCATCTAC AACGCGGACG CACTTTCCAT TCACGACAAC TTCATCGCTG AATGCGGCAG TTGCATCGAG CTGCGCGGGT GGGGGCAGGC ATCGAAGATC ACCGACAACC TGGTCGGAGC AGGCTTCAAA GGTCACTCAA TCTACGCCGA GAACCACGGC GGCCTCCTGG TAACTGCGAA CAACGTCTTC CCCCGTGGCG CAAGCAGCAT CCATTTCGTA GGCGTCACGC GTTCAAGCGT CACCAATAAC CGTTTGCATT CGTTCTACCC CGGGATGCTG ATCCTTGCGG AGAACAGTTC GGAAAACCTC GTGGCCACGA ACCACTTCCT GCGTGACCAT GAACCGTGGA CGCCGTTCCT TGGAGTCGAC AACGGACTGA ACGACCTCTA CGGACTGCTC TCTGTCAGCG GCAGCAATAA CTCTGTTATC GGCAACCACT TCTCCGAGAT CATCGATTCA CCCAGCATCC AGCCGGAAGG AGCGACGCCC GTCATCATCC GGCTGATGGC GGGGGTTGGC AACTTCGTCT CCAACAACCA CGTGGTGGCG ATGGACGTTC GATCAAAGGC AAGTGACTCC TGCTTCTCGG CCCAGGTGGA CGCTCTGTTG ACGACCGAGG CTTCGGACGG CCTCGCCGTT ACGGCCGTCA TGGTCGATTC CGAATCGGCC CGGAATACGA TCCTGGATTC CGGAAGTGAC GCCCAGGTCA TCGCAGACAG GGCCGTTAAC GCCTTGAGGG CCACGCCCAC CGTCGGTTTC CAGGCAGCCC ACGCACTTGT TGAGCCGCAC GTAGAATCAG CAACAACATA A
|
Protein sequence | MSSNNYYDVT TWPVGNPSED VGEVINSIIA DIKDRQTVTD ANNGGKPGAV IYIPPGDYHL RTQVLIDISF LRIHGSGHGF TSSSIRFNVP EDEWPGLHEL WPGGSRIIVD IPPGGDEGGD GEESKGAAFY VERSGSPRIS SVEFSNFCID GLHFDPDGSG SHPENTYVNG KTGIYVANAN DSFRITGMGF VYLENALTIY NADALSIHDN FIAECGSCIE LRGWGQASKI TDNLVGAGFK GHSIYAENHG GLLVTANNVF PRGASSIHFV GVTRSSVTNN RLHSFYPGML ILAENSSENL VATNHFLRDH EPWTPFLGVD NGLNDLYGLL SVSGSNNSVI GNHFSEIIDS PSIQPEGATP VIIRLMAGVG NFVSNNHVVA MDVRSKASDS CFSAQVDALL TTEASDGLAV TAVMVDSESA RNTILDSGSD AQVIADRAVN ALRATPTVGF QAAHALVEPH VESATT
|
| |