Gene Arth_2254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2254 
Symbol 
ID4445245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2537920 
End bp2538996 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content66% 
IMG OID639690063 
Productglycosyl transferase, group 1 
Protein accessionYP_831734 
Protein GI116670801 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0765492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAATCA TTATCGACGC CCGCTTCACC CGCCTGGACC ACCACGACGG CATCAGCCGG 
TACGGCGCCA GCCTGATCGC AGCCACGGCC AAGATCGCAG ACGTCTCGAT GCTCATCAGC
GATTCCCGGC AGCTGGCCCT CCTCCCGGAT GTGCCCTACA CGCTGATCAA CAGCCCGCTC
TCCCCCGCGG AGCTGTTCGT GGCCGGCAAG GTCAACAAGC TGGGCGCCGA CGTCGTGGTG
TGCCCGATGC AGACGATGGG CACGTTCGGC CGGAAGTACG GCCTGGTGCT GACACTTCAC
GACCTCATCT ACTACGAGCA CCCTTCCCCG CCGGGCTTCC TGCCGGCACC CGTCCGTGTG
CTGTGGCGCC TGTACCACAA GGCGTACTGG CCGCAGCGGC TGCTCCTGGA CCGGGCGGAC
GTGGTGGCTA CCATCAGCCA CACCACCGAA GCACTGATCG CCAAACACCG CCTCACCCGG
CGTCCGGTCC GGATTGTGGG CAATGCGCCG CAGCACGGCC ATACTCCGCG CGACCCCGGT
GCCGGCGCGG ACAGGACGCT TCTCTACATG GGTTCGTTCA TGCCGTACAA GAACGTGGAA
ACCATGGTGC GGGGAATGGC CGAACTTCCC GACATGACGC TCCACCTGCT CAGCCGCATC
ACGCCGCAGC GGCAGGCGGA GCTGGAAGCG CTGGTGCAGC CGGGAGCCAA CGTTGTGTTC
CACAACGGCG TGACGGATGC CGAGTACGAG GCCCTGCTGG GCCGGACAAC GGCCCTGATC
AGCCTTTCAC GAGCGGAGGG CTACGGCCTC CCGCTCGTGG AGGCCATGTC CCATGGGACC
CCCGTAATCG CAAGCGACAT CCCGATCTTC CGTGAAGTAG GCCATGACGC GGTGAGTTAT
GTCCACCCGG ATTCCCCCTC GGAGTTCGCC AACGCGGTCC GCAGGCTGGA AGAGCCGGAA
GTGTGGAAAG CTCACTCACT CCGTTCGGTG GAACGCGCCG CGGAATTCAG CTGGGACCAC
TCGGCCCGGC AGCTTGTGCA GCTGGCCGAG GAGGCCGCCC GAATCAACCG CCGCTGA
 
Protein sequence
MKIIIDARFT RLDHHDGISR YGASLIAATA KIADVSMLIS DSRQLALLPD VPYTLINSPL 
SPAELFVAGK VNKLGADVVV CPMQTMGTFG RKYGLVLTLH DLIYYEHPSP PGFLPAPVRV
LWRLYHKAYW PQRLLLDRAD VVATISHTTE ALIAKHRLTR RPVRIVGNAP QHGHTPRDPG
AGADRTLLYM GSFMPYKNVE TMVRGMAELP DMTLHLLSRI TPQRQAELEA LVQPGANVVF
HNGVTDAEYE ALLGRTTALI SLSRAEGYGL PLVEAMSHGT PVIASDIPIF REVGHDAVSY
VHPDSPSEFA NAVRRLEEPE VWKAHSLRSV ERAAEFSWDH SARQLVQLAE EAARINRR