Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_1193 |
Symbol | |
ID | 4446324 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | + |
Start bp | 1293294 |
End bp | 1296719 |
Gene Length | 3426 bp |
Protein Length | 1141 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639689000 |
Product | glycosyl transferase family protein |
Protein accession | YP_830687 |
Protein GI | 116669754 |
COG category | [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGGGAGGC CTCTCCGAGC GCGAGCGCCG TCGGCTGAGG AAGCGAGCAG TCTAATTCTT CAAGTAGTAC ATGTCACCGC CGTTGTGGTC GCCCACAACG GCGGTGACTA TCTTCCCAGG ACCTTGGCGG CATTGTCACA GCAAACCCGT CCGGTGGATG CGTCTATTGG CGTGGACACC GGCTCGCGGG ACAATTCCCT GGCTTTGCTT GACCAGGCGT TCGGCGAATC CAACGTGACC ACCTTCCAGC AGGGGCGGAC CGGGATGGGG GAGGCGGTCA AGGCCGCCCT TGCGGCCCTT GCCCCCCGGA ACGCGGACTC CCGTAACGAC ACCGAGTGGA TCTGGCTGCT GCACGACGAC GCCGCTCCCG CGCCGGAGGC GCTTGCCGAA CTGCTCCACG CCGTGGAGCG GGCGCCGTCG GTGACGGTGG CGGGGTGCAA ACAGCTGGAC TGGGATGCCA GGCGCCGGCT GATCGACGTC GGGCTTTCCA CCAGCCGCTG GGCTGAACGG CTGACCCTCA TTGAGGCGGA CGAACTCGAC CAGGGGCAAT ACGACGGCCG CAGCGATACC TTCGCGGTCA ACTCCGCCGG CATGCTGGTC CGCCGGGACG TCTGGGAGGA ACTGGGCGGG TTTGACTCCG CCCTTCCCGG CACAGGTGAC GATGTGGACT TTTGCTGGCG CAACCGCTTG GCCGGGCACC GCGTAGTGGT GGTGCCGACC GCCCGGATGT TCCACGTTTC GCACCGGCCG CATGCGCTGG GGAGTGCAAG TGCTGCCCGG AAAGCCCAGG TCTACCTGCG CCTCAAGCAC ACAGTGTGGT GGAAGGTACC GCTGCATGCG GCAGGTGCGC TGCTGGGGAG CATCTTCAAG CTGGTCCTGA GTATTGCCGT CAAGGATCCC GGCCACGGAT TCTCCCAACT GATGGCGACG TTCGTTGCGC TGGGCCGGCC GGGGGCTGTG ATCCGCGGAC GCCGCGACGC AGCCCGGACG CGGCGGATCC GCCGCTCGGT GATCAAGGGA CTGCAGACGC CGCGCCGCGA GGTCTGGGCC CACCGGCGTT CCCTTATTGA AGCCCTGGGC TCCGACGATC CGGGCGGGGA CTCGGCCCGG ACGGATCCGC TGGCCGAACA GCCCAGCGGG GACTCCACGG ATGACTTTGC CGCACTGACC ACTTCGGAAC GCGGCTGGGT GGGCAACGGC GCCCTCCTGG CCGTGATTCT CACAGCGGCG GCTTCCTTGA TCGGGCTCTC CGGGTTTTTC CGCGCCGAAG CCGTCTCCGG TGGCGCCCTG ATCCCGGTCT CGTCGAAGAT ATCGGACATC TGGCACCACG CCACCAGTTG GTGGATCGGA CTCGGTGCCG GGCTGCCCGG CCACGGCGAC CCCTTCGCCC TGCTTCTGTG GATCCTTGGT CTAGCCGGCG GCGGCGATGC CAACAGCGCC CTGGCGTGGC TGCTGATCCT TGCCATGCCG CTCTCCGCTC TCGCTGCGTG GTTTGCGGCC GGAGCCCTGA CCACGAGGCG GCGGTTCCGC CTCGTGGCCG CAGTTGTGTG GGGCGGGGCG CCCGCCTTGC AGGTTGCACT GAACCAGGGG CGCCTTGGCG CCCTGCTTGC GCACATCATG GTGCCGTTGC TGGTAGTGGC ATTGCTGCGC GCCACCGGCA CCGCCCGCGG GCAGGGCCGC TTCGCGGTTC CTGAGCCTGG GGATCGGCGC TTCCCGGACA AGCCTCCGGC CAAGCCGGGC GTCAACGGAA CTCCTTCCTG GACAGCCGCC GCGGCTGCCG GCCTGGCGCT GGCTGTTGTG GCAGCGTCCG CACCGTCCTT ACTGCTTCCG TCCGCCGTCG TGGTTGTCCT GGCCGGGGTC CTGCTGGGAA GGCGCGGCCG CACCGTATGG TGGGCGCTGC TGCCGAGCGC CGCGCTTTTC GTTCCGTTCG GCATTTCCGT GCTTGACCGC CCGCGGGCAC TGCTCGCGGA CCCCGGCGTC CCGCTCGGTT TTGACGGCGC ACCGTTATGG CAGCAGGTTC TGGGACAGCC GCTGTATTTC GATGCCGACG GCGGTCTGGC CGGGCTTCCG GTCTTTGAGG GGTCTGCCGC GCCGTGGGCG CTGCTCCTGG CACTGCTAAT CGGCTTTCCC GTCCTGGCAC TTGCAACGGC GGCACTGTTT GTCCCGGGCA GGCGCACCCG CGTGGCCCGC GCTTTGTGGG TTGCTTCCCT GATCATCCTT GCGGGCGGCT GGCTCGCGGG CCACGTGGCC AGCGGTGCCA GCGCTGACAC CCTCGTCACC CCATTCACCG GGCCGGTTGT GTCCGCATCC GGCTTCCTGC TGCTGGGCGC CGCCCTGATC GGGGCCGACG GCCTGCTCTG CTTCTCTGAG AAGGCAGCCG AAGCCCCGGC TGCCCGCCGG ATCGTCCTCC AGGCCCTGTC CGTCCTTGCC ATGGTGCTGC TCCTCGCCGG GCCCGTGGCC GGATTGACCG CCTGGGCAGC CGGGAACCTG TTGCAGTCCG CATCCGGGGC AGCCCCGGCA GCGGAGGGCA TGACCCCTCC GGGCCCGACC GCCCTCGGAA CACCGCGGCA GATTGCTCCC ACGGCCGCGC GCACCCTTCC GGCTACGGCC ATCGACCGCG GCCAAGGGCC CGAGCAGACC CGGACGCTGG TCATCGACAC GGCGGATGAC GGCAGCTTCG ACGCCTCGAT CGTGAGGGGT GCGGGGACCA CCCTCGACGG GCTGTCCACC ATTGCGGCCG CCCGCAACAT CATGGGTGAA CCCGGCAAGG AAACGGTCAG GGATGACGAC GCCGTCACGG CAGCGCTTCG CAGCGTCGTC GCCACCGTGG TCGCAGGCCA GGGTGTGGAC CCGCGGCCGG AGCTGGAACG CCTTGGCGCA GGATTTGTTG TCCTGCGGGC CTCGGATTCT GCTGCGCAGC TGACTGCGAG CCGCATGGAT GCCGTGCCAG GACTGGTGGC GGTGGGGCAG ACCAACGTCG GCTGGCTCTG GCGGATCAGC CCGCTCAACC AGCCGGTCAT CGACGCCGCT GACGTTGCGC ACCGGGTGCG CATCGTGGAC CGTGCCGGCG CTGCCATCGG GCTCCTGCCC TCGGGCCTGG AAAATGTTGA TGCCGCTGTC CCGGAAGGGC CCGAAGGCAG GCTCGTGGTT CTCGCCGAAC GTTCCGATCC GGGATGGACC GCGTGGATGG ATGGCCGGCG GCTGACGTCC ACAACGTCCG GTTGGTCCCA GGCGTTCACC CTCCCCGCCC AGGGCGGCCA GCTCACCATC CGTTACGACA ACCCGTGGGC AGTGTGGTCC GCTGTCGTGC AGGCAACGGT GATCGGACTC ACTGTCCTGC TGGCCATCCC TATGCCCGCC CGCCGGCCGA ACACCGGCCT CTCCCGGGAT GAAGGCTCCC TGCGTAAGGA ACATCAGCAT GCATAA
|
Protein sequence | MGRPLRARAP SAEEASSLIL QVVHVTAVVV AHNGGDYLPR TLAALSQQTR PVDASIGVDT GSRDNSLALL DQAFGESNVT TFQQGRTGMG EAVKAALAAL APRNADSRND TEWIWLLHDD AAPAPEALAE LLHAVERAPS VTVAGCKQLD WDARRRLIDV GLSTSRWAER LTLIEADELD QGQYDGRSDT FAVNSAGMLV RRDVWEELGG FDSALPGTGD DVDFCWRNRL AGHRVVVVPT ARMFHVSHRP HALGSASAAR KAQVYLRLKH TVWWKVPLHA AGALLGSIFK LVLSIAVKDP GHGFSQLMAT FVALGRPGAV IRGRRDAART RRIRRSVIKG LQTPRREVWA HRRSLIEALG SDDPGGDSAR TDPLAEQPSG DSTDDFAALT TSERGWVGNG ALLAVILTAA ASLIGLSGFF RAEAVSGGAL IPVSSKISDI WHHATSWWIG LGAGLPGHGD PFALLLWILG LAGGGDANSA LAWLLILAMP LSALAAWFAA GALTTRRRFR LVAAVVWGGA PALQVALNQG RLGALLAHIM VPLLVVALLR ATGTARGQGR FAVPEPGDRR FPDKPPAKPG VNGTPSWTAA AAAGLALAVV AASAPSLLLP SAVVVVLAGV LLGRRGRTVW WALLPSAALF VPFGISVLDR PRALLADPGV PLGFDGAPLW QQVLGQPLYF DADGGLAGLP VFEGSAAPWA LLLALLIGFP VLALATAALF VPGRRTRVAR ALWVASLIIL AGGWLAGHVA SGASADTLVT PFTGPVVSAS GFLLLGAALI GADGLLCFSE KAAEAPAARR IVLQALSVLA MVLLLAGPVA GLTAWAAGNL LQSASGAAPA AEGMTPPGPT ALGTPRQIAP TAARTLPATA IDRGQGPEQT RTLVIDTADD GSFDASIVRG AGTTLDGLST IAAARNIMGE PGKETVRDDD AVTAALRSVV ATVVAGQGVD PRPELERLGA GFVVLRASDS AAQLTASRMD AVPGLVAVGQ TNVGWLWRIS PLNQPVIDAA DVAHRVRIVD RAGAAIGLLP SGLENVDAAV PEGPEGRLVV LAERSDPGWT AWMDGRRLTS TTSGWSQAFT LPAQGGQLTI RYDNPWAVWS AVVQATVIGL TVLLAIPMPA RRPNTGLSRD EGSLRKEHQH A
|
| |