Gene Achl_3908 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAchl_3908 
Symbol 
ID7295396 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter chlorophenolicus A6 
KingdomBacteria 
Replicon accessionNC_011886 
Strand
Start bp4350311 
End bp4353034 
Gene Length2724 bp 
Protein Length907 aa 
Translation table11 
GC content59% 
IMG OID643592317 
Productglycosyl transferase, WecB/TagA/CpsF family 
Protein accessionYP_002489949 
Protein GI220914640 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases
[COG1922] Teichoic acid biosynthesis proteins 
TIGRFAM ID[TIGR00696] bacterial polymer biosynthesis proteins, WecB/TagA/CpsF family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones120 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAATGT CTTCTTCACT TCGCGCACTT GTGCAGCCGG CAGCTTGGTT GGATGGGACG 
ACCTCCCGGC ATCCTGCCGG AGACGTCAAC GCCAAGTCTG GCGCGCCCCT CCAGCTTTCC
GCGGACAACG CTTGGGTCAC ATTGGGTGGT TCTCCAGTTC GGCTCCTGGA TTTTGAGGAG
GCAGTCGAAC TGATCATGCA GCGATCACGG CCTGGACGTA CGCCGCTGGC CGTGGCCTCC
GCGAACCTTG ACCACCTCCA GCATTTCGGC GCAGGTGCCC GTTGGGCCGG GATCCTTGAA
CGGCAAGATA CGCCCGAATG GCTGTCGTTG CTGGACGGAG CTCCTTTGGT CCGTCACGTA
CAAGGGATGA CGGGTCGAAC GTGGCCACGA CTCTCGGGCA GTGACCTGAT TGGGCCAATA
TTGGACCGCG CGGAGCTGGC CGGCATCCGG GTGGGCTTCC TGGGCGGATC CGAGGAAGTT
CACACACAGG TACGGTCAAG GTTGGCCACG AGCCATCCAA GGCTCGTGAT TTCGGGCTTT
TGGTCGCCGG CCCGAAGCGA GCTGGCCGAC CACGTGGCTT CCTCGTCCCT TGCCACCCGG
ATCGCTGCCA CCGACACAGA CATACTGGTT GTGTGCCTGG GAAAGCCGCG TCAGGAACTG
TGGATCGCAG AGTACGGATA CCAGACTGGA GCCAACGTGA TGTTGGCCTT CGGCGCCGCA
GTGGATTTCC TGGGGGGACG TGTCCGGCGC GCACCGGCGG TCGCCCAGAA CGTAGGCATG
GAATGGGCCT GGCGCTTGGC TTTGGAACCT CGGCGCTTGG CCAATCGGTA CCTGGTTCAG
GGGCCGGAGG CTTATCTCAG GTTGCTGTCA GTCAGTTCCT TTGGCCGAGA AAGCTTTGCG
CCTCGACAAC AGCCGCAGGA TTACGCCAGT AAAGACCTGA CTGACGAAGG GTTTTCACCT
CTGACGTCGG AAACGGATGT CGCGGTCATC ATCGTCACTT ACAACAATGA ACGGGATATT
CCGCTGCTTC TCAAAAGCCT ACAAGGGGAG TCACGGGAGC AATCCATCAA AGTCATAGTC
GCAGATAACT CCCCGGGTCC CTCCACACTC GCAGCCCTGG AAGGATTTTC AGACGTGCAT
GCGATTGCGA CCGGCGGAAA TTTGGGCTAT GCCGCTGCCA TTAATTTAGC CATGCAGGAA
ATTGGTGCTG CCCGTTCCTT CCTGGTTCTG AATCCGGATC TACAGGTCGA ACCGGGCGCC
ATACGCGCAA TGCGTCATCG GATGGCGATC TCGGGAGCCG GGGTTGTGGT GCCGCTCCTC
AAAGACGACA ATGGCACTGT TTACCCCTCG CTGCGCCGCG AACCCACCGT GACAAGGGCA
ATTGGCGACG CGGTCATGGG TAGCAAGCTC TCCGGAAGGC CCGCCTGGCT CTCTGAAATG
GACTTCGACA ACGAAAGCTA TATGCACGCC CATAAAGTGG ACTGGGCCAC GGGAGCAGCA
CTCCTCATTC ATCGCGACGT TGCACAACTA GTGGGTGATT GGGACGAGGA CTATTTTCTC
TACTCGGAAG AAACTGACTT CATGCATCGG GTCCGCCAGG CAGGATGGGA GATATGGTTC
GAGTCCCAAG CAGTGATGAG CCATTCCAGA GGTGGCTCGG GAACATCACT TGCCTTAAAC
GCCTTAATGG CGATCAACCG GATCAAGTAC ATTCGCAAGT TCCACACCCG ACCGTACTCG
CGAGCATTTC GAAGCGCTGT TATCCTCTCG GCTCTGCTGC GGGTGCCTGT GACCCCTGGA
ATCGGAGTCC TCGCGGCGGT GCTTCGTGAA GGATCGTGGG GTGAATTGCC TCATGCCGAG
ATCTATCCTG AAGGGGTACG TGTCCCCGCC GCGATACCGA CGGGCACAGT TATCATCCCA
GCCCACAACG AGGCCAGCGT GCTCCGACGG ACACTGGACG GTCTTGTCCC GGCCATGGTG
GGAGGTACGG TGGAAGTCAT CGTTGCCTGC AATGGTTGCA CCGACGATAC TGCATCTATT
GCACGATCCT ACAAGGACGC CAGGGTGATT GAAGTTGAGG AAGCCTCCAA GACCGCAGCC
CTGAATGCCG GAGATCAGGT GGCAACCCGC TGGCCGCGGA TGTATCTTGA TGCCGACATT
GAGCTTCCTT TGGAAGCGTT GTGTGCCACC CTGGAGCTTC TGGGTGAGGG CGGAGCCATT
CTTTGCGCTC GTCCGGCCTA CCGCTATGAC TTTAGCGGTG CTTCGTGGCC CGTCCGGGCG
TTCTACAGGG CACGGAACCG TCTTCCGAAG CCAGCTGAAT CCATATGGGG AGCGGGCGTG
TATGCAATCA GCAGGAAAGG GAAGGCGCGG CTCCCCGAAT TCCCCTCGGT AGCTGCCGAT
GACTGCTTGG TTGACCGGCT CTATAGTGAC AAGGAAAAGG CAGTTGTGCA GTGCGCGCCC
GCGACGGTTC GAACACCCCG CACAACCGGG AGTCTTTTGA AGACGTTAGG CAGGAACTAT
CGCAGCAATG TCATCTTGCG CGATGTTCCG GGGTCCCACA CTATGCAGAC ACTCAGGGAC
TTAGTCGGTT CAGTAAGCGG TCCGAGATCC GCGGTGGAAG CTGGCGTTTA CGCTGCTTTT
GCCTTGGCAG GCCGGCTTCA CGCCCGCCGG TGGGTGGGTC TCGAATCAGC AGCGTGGGAG
AGCGACGAGT CAAGCAGGCT GTAA
 
Protein sequence
MTMSSSLRAL VQPAAWLDGT TSRHPAGDVN AKSGAPLQLS ADNAWVTLGG SPVRLLDFEE 
AVELIMQRSR PGRTPLAVAS ANLDHLQHFG AGARWAGILE RQDTPEWLSL LDGAPLVRHV
QGMTGRTWPR LSGSDLIGPI LDRAELAGIR VGFLGGSEEV HTQVRSRLAT SHPRLVISGF
WSPARSELAD HVASSSLATR IAATDTDILV VCLGKPRQEL WIAEYGYQTG ANVMLAFGAA
VDFLGGRVRR APAVAQNVGM EWAWRLALEP RRLANRYLVQ GPEAYLRLLS VSSFGRESFA
PRQQPQDYAS KDLTDEGFSP LTSETDVAVI IVTYNNERDI PLLLKSLQGE SREQSIKVIV
ADNSPGPSTL AALEGFSDVH AIATGGNLGY AAAINLAMQE IGAARSFLVL NPDLQVEPGA
IRAMRHRMAI SGAGVVVPLL KDDNGTVYPS LRREPTVTRA IGDAVMGSKL SGRPAWLSEM
DFDNESYMHA HKVDWATGAA LLIHRDVAQL VGDWDEDYFL YSEETDFMHR VRQAGWEIWF
ESQAVMSHSR GGSGTSLALN ALMAINRIKY IRKFHTRPYS RAFRSAVILS ALLRVPVTPG
IGVLAAVLRE GSWGELPHAE IYPEGVRVPA AIPTGTVIIP AHNEASVLRR TLDGLVPAMV
GGTVEVIVAC NGCTDDTASI ARSYKDARVI EVEEASKTAA LNAGDQVATR WPRMYLDADI
ELPLEALCAT LELLGEGGAI LCARPAYRYD FSGASWPVRA FYRARNRLPK PAESIWGAGV
YAISRKGKAR LPEFPSVAAD DCLVDRLYSD KEKAVVQCAP ATVRTPRTTG SLLKTLGRNY
RSNVILRDVP GSHTMQTLRD LVGSVSGPRS AVEAGVYAAF ALAGRLHARR WVGLESAAWE
SDESSRL