Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_0738 |
Symbol | |
ID | 4446772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 786372 |
End bp | 788168 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639688540 |
Product | trehalose synthase |
Protein accession | YP_830236 |
Protein GI | 116669303 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | [TIGR02456] trehalose synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.197847 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGTTTTA GTCCGCAAAA CCCGAGCCAG CATTTCACCC CCAAGGGCAC GTTTGAGCTC AACGCCCCCG GCCTGCAACA CGATCCCGTC TGGTACCGCA AAGCAGTGTT CTACGAAGTC CTGGTTCGCG GCTTTGCGGA CGGAAACGGC GACGGATCCG GCGACTTCCA CGGGCTCATC GACAAACTGG ACTACCTGCA GTGGCTGGGC GTCGACTGCC TCTGGCTGCC GCCGTTCTTC CAGTCGCCGC TCCGTGACGG CGGCTACGAC ATCTCGGACT ACAACTCCGT CCTGGACGAG TTCGGCACCA TCAGCGACTT CAAGCGGCTC GTGGCCGAGG CCCATGCCCG CGGCGTCCGC GTAATCATCG ACCTGCCGCT GAACCACACC TCGGACCAGC ACCCGTGGTT CCAGGAGTCC CGCAAGGACC CGGACGGACC GTTCGGCGAC TTCTACGTCT GGAGCGATAC CGACGAGAAG TACCAGGATG CCCGCATCAT CTTCGTGGAC ACCGAGGAAT CCAACTGGAC CTTCGACCCC ATCCGGCGGC AGTTCTTCTG GCACCGGTTC TTCAGCCACC AGCCCGACCT GAACTTTGAG AACCCGAAGG TTATCGACGC CCTTTTCGAC GTCGTCCGGT TCTGGCTGGA CCAGGGGATC GACGGTTTTC GCGCGGACGC CATCCCGTAC CTGTACGAGG AAGAGGGCAC CAACTGCGAG AACCTCCCCG CCACGCACGA TTTCCTGCGC AAGCTGCGGA AAATGGTGGA TGAGAACTAC CCGGGCAGGG TCATCATTGC CGAGGCCAAC CAGCCTCCGG TCGAGGTGGT GGAGTACTTC GGCACCGAGG AAGAACCCGA GTGCCACATG GCCTTCCACT TCCCCATCAT GCCGCGCCTC TACTACGCGC TGAGGGACCA GAAAGCGGCG CCCATCATCG AGACGATGAA GGACACCCCG GACATTCCCG AAGGCGCCCA GTGGGGTACC TTCCTGCGGA ACCACGACGA ACTCACCCTG GAAATGGTCA CCGCCGACGA ACGCGCGGCC ATGCTCGGCT GGTACGCACC GGACCCCCGC ATGCGCGCCA ACATCGGCAT CCGCCGAAGG CTGGCGCCGC TGCTGGATAA TTCCAGGTCC GAGATTGAGC TCATCAATGC CCTGCTGCTT TCTCTGCCGG GGAGCCCGTT CCTGTACTAC GGGGATGAAA TCGGCATGGG CGACAACATC TGGCTTGAGG ACCGCGACGC CGTCCGCACT CCCATGCAGT GGAACCCTGA CCGGAACGCG GGCTTCTCGC ACGCGGATCC GGGCAAGCTC TACCTGCCGG TCATCCAGTC GCTGGTGTAC AACTACGGCA TGGCCAATGT GGAGGCCGAG GCCGCGCACT CCGGATCGCT GCTCCGTTGG ACCCGGCAGA TCCTCAGCGT CCGTAAGAAC CACCCCGTCT TCGGGCTTGG CACGTTCAAG CACGTCGAGG CAGATCACGA CGTCGTGCTT GCTTACCTGC GGGAACTGGC CCCGGACAAT GCGGCGGGTG AAGACGCCGA ATCGATCCTG TGCGCGTTCA ACCTCTCGCA GCATCCCGTG GCCACCACGC TGCGGATTCC GCAGTACGCC GGCCGCGGCC TCCGGGACGT CTTCGGCGGC CAGCCGTTCC CCGCCATCGG CGACGACGGG CGTCTTACGC TGACGCTCGG CAGCCATGAT TTCTTCTGGC TGCGGATCCG TTCCGCTGCT TCCAACCCGG CCTCGCCGTT CACCCAGGCC ATGCCGGTCC TGTCCATCGA AGGCTGA
|
Protein sequence | MSFSPQNPSQ HFTPKGTFEL NAPGLQHDPV WYRKAVFYEV LVRGFADGNG DGSGDFHGLI DKLDYLQWLG VDCLWLPPFF QSPLRDGGYD ISDYNSVLDE FGTISDFKRL VAEAHARGVR VIIDLPLNHT SDQHPWFQES RKDPDGPFGD FYVWSDTDEK YQDARIIFVD TEESNWTFDP IRRQFFWHRF FSHQPDLNFE NPKVIDALFD VVRFWLDQGI DGFRADAIPY LYEEEGTNCE NLPATHDFLR KLRKMVDENY PGRVIIAEAN QPPVEVVEYF GTEEEPECHM AFHFPIMPRL YYALRDQKAA PIIETMKDTP DIPEGAQWGT FLRNHDELTL EMVTADERAA MLGWYAPDPR MRANIGIRRR LAPLLDNSRS EIELINALLL SLPGSPFLYY GDEIGMGDNI WLEDRDAVRT PMQWNPDRNA GFSHADPGKL YLPVIQSLVY NYGMANVEAE AAHSGSLLRW TRQILSVRKN HPVFGLGTFK HVEADHDVVL AYLRELAPDN AAGEDAESIL CAFNLSQHPV ATTLRIPQYA GRGLRDVFGG QPFPAIGDDG RLTLTLGSHD FFWLRIRSAA SNPASPFTQA MPVLSIEG
|
| |