Gene Achl_3900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3900 
Symbol 
ID7295388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp4340355 
End bp4341620 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content59% 
IMG OID643592309 
Productglycosyl transferase group 1 
Protein accessionYP_002489941 
Protein GI220914632 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCGGC AAGTCACGCG CGTGGCTGCC TCTCAAAACC CTGGGGACCT GGCGGCCAAG 
GACCGGATGA GGATCTTGGT CTACCCACAT GAACTGGCAA TGGGTGGCAG CCAAATCAAC
GCCCTGGATC TTGCCGCCGC TGTCCGCGAA TTGGGGCACG ACGTGTGCAT CTACGCCACA
GACGGCGTCC TGCTGGACAA AGTGAACGAA CTGGAACTGC CCTACATCCG CGCCCCGCGA
AGCCGATACT CCGTTGATCC CGGAACCATC CGGAAACTGA ATTCAACCGT CCGCCAAATG
AATATCGATA TTGTGCACGC TTATGAATGG ACCGCAATAG TTAATACCGG GTTTGGTCCA
CATCTCACGG GGAAGCCCAA GGCTGTCATG ACGGTGCTTT CCATGGATAT TCCCGATTTT
CTTCCACGCG ATATTCCCAT GATCGTGGGT ACGCAGGGGC TCGCTGAGCT GGAGGCACAG
CGCCGGGATA ACGTTCATGT CATCGAGCCC CCGATCAATA CCGAACTTAA CAAATCCAGG
GGGACAGATA CGGCGCGCCG GCAAATCGGA ATTCCATCCC AGTCTCTGGT CGTAAGTATT
ATTGGTCGGC TCACCACGGA TCTCGGCAAA CTCGACGGTG TGCTGGCTGC CATAAGGGTC
ATCGACAAGC TGGCCCTTGT GCGGGACGTA ATCTTGGCCA TTGCCGGTGA CGGTGAGGGG
TTCACGGAGG TGCACGGGCT CGCATCCACC GTGAACCAAC GCCACGGGAG AGAAGTGGTT
CGCGTCTTGG GAAACGTCCT GGATCCCCGC CCCGTTTACG ACGCCGCCGA CGTCGTTCTT
GGCATGGGCA GTTCGGCGCT TCGAGGCATG GCATTCTCTA AGCCTCTGAT CGTGCAGGGA
CGTTGCGGCT TCTGGCGGAC GGCGAGCCCG GAGACTGAAG CGCAATTCCT GAAAGACGGA
TGGTTCGGAA CTGTGGGCGC GGGTGAAGTC GAACTCGAAA GTGCCCTGTC AGAGCTTCTG
GCCGATGCGC CACGGCGGGA GATGTTGGGA GACTACGGGC GCCGCCTGGT TGAAGAGAGG
TTCGGCGTGG CCCAGGCAGC AGAACGTCTC GTATCCATAT ATCGCGCCCA GCTTTCCGCG
CCCAGGAAAC CCGCGGCCCA TGCAGCCTCT GTCGCCCGAT CTGGATTCGA GACGGCGAAA
TTCCGACTCT CAATGCGAAG GCAGATGCGT TCCGTTCGCA TGACCAACTC CCCAGGGGGA
ACCTAA
 
Protein sequence
MGRQVTRVAA SQNPGDLAAK DRMRILVYPH ELAMGGSQIN ALDLAAAVRE LGHDVCIYAT 
DGVLLDKVNE LELPYIRAPR SRYSVDPGTI RKLNSTVRQM NIDIVHAYEW TAIVNTGFGP
HLTGKPKAVM TVLSMDIPDF LPRDIPMIVG TQGLAELEAQ RRDNVHVIEP PINTELNKSR
GTDTARRQIG IPSQSLVVSI IGRLTTDLGK LDGVLAAIRV IDKLALVRDV ILAIAGDGEG
FTEVHGLAST VNQRHGREVV RVLGNVLDPR PVYDAADVVL GMGSSALRGM AFSKPLIVQG
RCGFWRTASP ETEAQFLKDG WFGTVGAGEV ELESALSELL ADAPRREMLG DYGRRLVEER
FGVAQAAERL VSIYRAQLSA PRKPAAHAAS VARSGFETAK FRLSMRRQMR SVRMTNSPGG
T