Gene Arth_2900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2900 
Symbol 
ID4444422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3265914 
End bp3267179 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content67% 
IMG OID639690723 
Productglycosyl transferase, group 1 
Protein accessionYP_832379 
Protein GI116671446 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTCTGA TCCGCCGTGT CGCGTTCCTT TCCCTGCACA CCTCGCCCAT GGAACAACCC 
GGTTCCGGCG ACGCCGGCGG CATGAACGTA TACGTGCGGG CACTGGCGTC GGCGCTGGCT
GCCAGCGGCG TTGAGGTGGA AATCTTCACC CGGTCCACGT CGTCGGGACA ACCCGCCGTC
GAACACCCGG ATCCCGGCGT GTGCGTCCAC AACGTGATCT CCGGACCGCC GCGGAAGCTG
CCCAAAGAGG AGCTTCCGGA GCTGCTGCAC AGCATGGTGG CGGAAATTGA GCGCATCCGG
CAGCGCCAGC CGCACGGCCG GTACGACCTC ATCCATTCGC ACTACTGGGT GTCCGGCGTG
GCCGGGCTGG AGCTTTCCCG GCTGTGGGGC GTGCCGCTGG TGCACACCAT GCACACCATG
GCCAAGGTCA AGAACCTCCT CCTGCAGTCC GGCGAGAAGC CCGAACCGCG GCGGCGCGAG
GACGGCGAAC TCCGGATCGT CGACGGCGCC ACCCGGCTGA TCGCGAACAC CCCCGCCGAA
GCCGCCGAAC TCGTGTCGCA TTACAACGCC GACTTCGACC ACATCGATGT GGCTCCCCCG
GGTGTGGACC TGACTGTTTT TACGCCGGCC TTCCGCCCCA GGTCCCGGGC ACAGCTCGGC
GTACCTGCCG GCAAGTTCCA CCTACTGTTC GCCGGACGCA TCCAACGGCT CAAGGGCCCG
CAGGTCCTCG TCAAGGCAGC TGCCCTGCTC CGCTCGCGCC GCCCGGACAT CGACCTGCAG
GTGACCATCC TGGGCGCACT GAGCGGGGCC AAGGACTTCG ACCTGAAGTC GCTGATCAGC
GCTGCCGGAA TGGACGACGT CGTCACCCAC CACCCGCCGG TGAACGCACC GGAGCTGGCC
GGCTGGTTCC GCTCCGCCGA CGTCGTGGTG ATGCCGTCCT ACAGTGAATC CTTCGGCTTG
GTGGCGTTGG AAGCCCAGGC ATGCGGGACT CCGGTGGTGG CAACCCGGGT GGGCGGACTG
TCCCGGGCCA TTTTCGACGG ACGGACCGGG CTGCTGGTGG ACGGGCACAA AGCCGCTGAC
TGGGCCGACG TTCTGGAAGC GCTCTATGAC GACCCCGCCA CCCGCGGGGA CATGGGGCGG
GCTGCAGCCC TGCACGCCCA GGGTTTCGGC TGGCAGCGCA CTGCCGCCAT CACGCTGGAG
AGCTACCACG CCGCCGTAGA CCAGTACATT GATAGTCACA GAATTCCGGT GGGCCACTCG
CCTTGA
 
Protein sequence
MTLIRRVAFL SLHTSPMEQP GSGDAGGMNV YVRALASALA ASGVEVEIFT RSTSSGQPAV 
EHPDPGVCVH NVISGPPRKL PKEELPELLH SMVAEIERIR QRQPHGRYDL IHSHYWVSGV
AGLELSRLWG VPLVHTMHTM AKVKNLLLQS GEKPEPRRRE DGELRIVDGA TRLIANTPAE
AAELVSHYNA DFDHIDVAPP GVDLTVFTPA FRPRSRAQLG VPAGKFHLLF AGRIQRLKGP
QVLVKAAALL RSRRPDIDLQ VTILGALSGA KDFDLKSLIS AAGMDDVVTH HPPVNAPELA
GWFRSADVVV MPSYSESFGL VALEAQACGT PVVATRVGGL SRAIFDGRTG LLVDGHKAAD
WADVLEALYD DPATRGDMGR AAALHAQGFG WQRTAAITLE SYHAAVDQYI DSHRIPVGHS
P