Gene Arth_1193 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1193 
Symbol 
ID4446324 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1293294 
End bp1296719 
Gene Length3426 bp 
Protein Length1141 aa 
Translation table11 
GC content69% 
IMG OID639689000 
Productglycosyl transferase family protein 
Protein accessionYP_830687 
Protein GI116669754 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGGAGGC CTCTCCGAGC GCGAGCGCCG TCGGCTGAGG AAGCGAGCAG TCTAATTCTT 
CAAGTAGTAC ATGTCACCGC CGTTGTGGTC GCCCACAACG GCGGTGACTA TCTTCCCAGG
ACCTTGGCGG CATTGTCACA GCAAACCCGT CCGGTGGATG CGTCTATTGG CGTGGACACC
GGCTCGCGGG ACAATTCCCT GGCTTTGCTT GACCAGGCGT TCGGCGAATC CAACGTGACC
ACCTTCCAGC AGGGGCGGAC CGGGATGGGG GAGGCGGTCA AGGCCGCCCT TGCGGCCCTT
GCCCCCCGGA ACGCGGACTC CCGTAACGAC ACCGAGTGGA TCTGGCTGCT GCACGACGAC
GCCGCTCCCG CGCCGGAGGC GCTTGCCGAA CTGCTCCACG CCGTGGAGCG GGCGCCGTCG
GTGACGGTGG CGGGGTGCAA ACAGCTGGAC TGGGATGCCA GGCGCCGGCT GATCGACGTC
GGGCTTTCCA CCAGCCGCTG GGCTGAACGG CTGACCCTCA TTGAGGCGGA CGAACTCGAC
CAGGGGCAAT ACGACGGCCG CAGCGATACC TTCGCGGTCA ACTCCGCCGG CATGCTGGTC
CGCCGGGACG TCTGGGAGGA ACTGGGCGGG TTTGACTCCG CCCTTCCCGG CACAGGTGAC
GATGTGGACT TTTGCTGGCG CAACCGCTTG GCCGGGCACC GCGTAGTGGT GGTGCCGACC
GCCCGGATGT TCCACGTTTC GCACCGGCCG CATGCGCTGG GGAGTGCAAG TGCTGCCCGG
AAAGCCCAGG TCTACCTGCG CCTCAAGCAC ACAGTGTGGT GGAAGGTACC GCTGCATGCG
GCAGGTGCGC TGCTGGGGAG CATCTTCAAG CTGGTCCTGA GTATTGCCGT CAAGGATCCC
GGCCACGGAT TCTCCCAACT GATGGCGACG TTCGTTGCGC TGGGCCGGCC GGGGGCTGTG
ATCCGCGGAC GCCGCGACGC AGCCCGGACG CGGCGGATCC GCCGCTCGGT GATCAAGGGA
CTGCAGACGC CGCGCCGCGA GGTCTGGGCC CACCGGCGTT CCCTTATTGA AGCCCTGGGC
TCCGACGATC CGGGCGGGGA CTCGGCCCGG ACGGATCCGC TGGCCGAACA GCCCAGCGGG
GACTCCACGG ATGACTTTGC CGCACTGACC ACTTCGGAAC GCGGCTGGGT GGGCAACGGC
GCCCTCCTGG CCGTGATTCT CACAGCGGCG GCTTCCTTGA TCGGGCTCTC CGGGTTTTTC
CGCGCCGAAG CCGTCTCCGG TGGCGCCCTG ATCCCGGTCT CGTCGAAGAT ATCGGACATC
TGGCACCACG CCACCAGTTG GTGGATCGGA CTCGGTGCCG GGCTGCCCGG CCACGGCGAC
CCCTTCGCCC TGCTTCTGTG GATCCTTGGT CTAGCCGGCG GCGGCGATGC CAACAGCGCC
CTGGCGTGGC TGCTGATCCT TGCCATGCCG CTCTCCGCTC TCGCTGCGTG GTTTGCGGCC
GGAGCCCTGA CCACGAGGCG GCGGTTCCGC CTCGTGGCCG CAGTTGTGTG GGGCGGGGCG
CCCGCCTTGC AGGTTGCACT GAACCAGGGG CGCCTTGGCG CCCTGCTTGC GCACATCATG
GTGCCGTTGC TGGTAGTGGC ATTGCTGCGC GCCACCGGCA CCGCCCGCGG GCAGGGCCGC
TTCGCGGTTC CTGAGCCTGG GGATCGGCGC TTCCCGGACA AGCCTCCGGC CAAGCCGGGC
GTCAACGGAA CTCCTTCCTG GACAGCCGCC GCGGCTGCCG GCCTGGCGCT GGCTGTTGTG
GCAGCGTCCG CACCGTCCTT ACTGCTTCCG TCCGCCGTCG TGGTTGTCCT GGCCGGGGTC
CTGCTGGGAA GGCGCGGCCG CACCGTATGG TGGGCGCTGC TGCCGAGCGC CGCGCTTTTC
GTTCCGTTCG GCATTTCCGT GCTTGACCGC CCGCGGGCAC TGCTCGCGGA CCCCGGCGTC
CCGCTCGGTT TTGACGGCGC ACCGTTATGG CAGCAGGTTC TGGGACAGCC GCTGTATTTC
GATGCCGACG GCGGTCTGGC CGGGCTTCCG GTCTTTGAGG GGTCTGCCGC GCCGTGGGCG
CTGCTCCTGG CACTGCTAAT CGGCTTTCCC GTCCTGGCAC TTGCAACGGC GGCACTGTTT
GTCCCGGGCA GGCGCACCCG CGTGGCCCGC GCTTTGTGGG TTGCTTCCCT GATCATCCTT
GCGGGCGGCT GGCTCGCGGG CCACGTGGCC AGCGGTGCCA GCGCTGACAC CCTCGTCACC
CCATTCACCG GGCCGGTTGT GTCCGCATCC GGCTTCCTGC TGCTGGGCGC CGCCCTGATC
GGGGCCGACG GCCTGCTCTG CTTCTCTGAG AAGGCAGCCG AAGCCCCGGC TGCCCGCCGG
ATCGTCCTCC AGGCCCTGTC CGTCCTTGCC ATGGTGCTGC TCCTCGCCGG GCCCGTGGCC
GGATTGACCG CCTGGGCAGC CGGGAACCTG TTGCAGTCCG CATCCGGGGC AGCCCCGGCA
GCGGAGGGCA TGACCCCTCC GGGCCCGACC GCCCTCGGAA CACCGCGGCA GATTGCTCCC
ACGGCCGCGC GCACCCTTCC GGCTACGGCC ATCGACCGCG GCCAAGGGCC CGAGCAGACC
CGGACGCTGG TCATCGACAC GGCGGATGAC GGCAGCTTCG ACGCCTCGAT CGTGAGGGGT
GCGGGGACCA CCCTCGACGG GCTGTCCACC ATTGCGGCCG CCCGCAACAT CATGGGTGAA
CCCGGCAAGG AAACGGTCAG GGATGACGAC GCCGTCACGG CAGCGCTTCG CAGCGTCGTC
GCCACCGTGG TCGCAGGCCA GGGTGTGGAC CCGCGGCCGG AGCTGGAACG CCTTGGCGCA
GGATTTGTTG TCCTGCGGGC CTCGGATTCT GCTGCGCAGC TGACTGCGAG CCGCATGGAT
GCCGTGCCAG GACTGGTGGC GGTGGGGCAG ACCAACGTCG GCTGGCTCTG GCGGATCAGC
CCGCTCAACC AGCCGGTCAT CGACGCCGCT GACGTTGCGC ACCGGGTGCG CATCGTGGAC
CGTGCCGGCG CTGCCATCGG GCTCCTGCCC TCGGGCCTGG AAAATGTTGA TGCCGCTGTC
CCGGAAGGGC CCGAAGGCAG GCTCGTGGTT CTCGCCGAAC GTTCCGATCC GGGATGGACC
GCGTGGATGG ATGGCCGGCG GCTGACGTCC ACAACGTCCG GTTGGTCCCA GGCGTTCACC
CTCCCCGCCC AGGGCGGCCA GCTCACCATC CGTTACGACA ACCCGTGGGC AGTGTGGTCC
GCTGTCGTGC AGGCAACGGT GATCGGACTC ACTGTCCTGC TGGCCATCCC TATGCCCGCC
CGCCGGCCGA ACACCGGCCT CTCCCGGGAT GAAGGCTCCC TGCGTAAGGA ACATCAGCAT
GCATAA
 
Protein sequence
MGRPLRARAP SAEEASSLIL QVVHVTAVVV AHNGGDYLPR TLAALSQQTR PVDASIGVDT 
GSRDNSLALL DQAFGESNVT TFQQGRTGMG EAVKAALAAL APRNADSRND TEWIWLLHDD
AAPAPEALAE LLHAVERAPS VTVAGCKQLD WDARRRLIDV GLSTSRWAER LTLIEADELD
QGQYDGRSDT FAVNSAGMLV RRDVWEELGG FDSALPGTGD DVDFCWRNRL AGHRVVVVPT
ARMFHVSHRP HALGSASAAR KAQVYLRLKH TVWWKVPLHA AGALLGSIFK LVLSIAVKDP
GHGFSQLMAT FVALGRPGAV IRGRRDAART RRIRRSVIKG LQTPRREVWA HRRSLIEALG
SDDPGGDSAR TDPLAEQPSG DSTDDFAALT TSERGWVGNG ALLAVILTAA ASLIGLSGFF
RAEAVSGGAL IPVSSKISDI WHHATSWWIG LGAGLPGHGD PFALLLWILG LAGGGDANSA
LAWLLILAMP LSALAAWFAA GALTTRRRFR LVAAVVWGGA PALQVALNQG RLGALLAHIM
VPLLVVALLR ATGTARGQGR FAVPEPGDRR FPDKPPAKPG VNGTPSWTAA AAAGLALAVV
AASAPSLLLP SAVVVVLAGV LLGRRGRTVW WALLPSAALF VPFGISVLDR PRALLADPGV
PLGFDGAPLW QQVLGQPLYF DADGGLAGLP VFEGSAAPWA LLLALLIGFP VLALATAALF
VPGRRTRVAR ALWVASLIIL AGGWLAGHVA SGASADTLVT PFTGPVVSAS GFLLLGAALI
GADGLLCFSE KAAEAPAARR IVLQALSVLA MVLLLAGPVA GLTAWAAGNL LQSASGAAPA
AEGMTPPGPT ALGTPRQIAP TAARTLPATA IDRGQGPEQT RTLVIDTADD GSFDASIVRG
AGTTLDGLST IAAARNIMGE PGKETVRDDD AVTAALRSVV ATVVAGQGVD PRPELERLGA
GFVVLRASDS AAQLTASRMD AVPGLVAVGQ TNVGWLWRIS PLNQPVIDAA DVAHRVRIVD
RAGAAIGLLP SGLENVDAAV PEGPEGRLVV LAERSDPGWT AWMDGRRLTS TTSGWSQAFT
LPAQGGQLTI RYDNPWAVWS AVVQATVIGL TVLLAIPMPA RRPNTGLSRD EGSLRKEHQH
A