Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Achl_3908 |
Symbol | |
ID | 7295396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter chlorophenolicus A6 |
Kingdom | Bacteria |
Replicon accession | NC_011886 |
Strand | - |
Start bp | 4350311 |
End bp | 4353034 |
Gene Length | 2724 bp |
Protein Length | 907 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 643592317 |
Product | glycosyl transferase, WecB/TagA/CpsF family |
Protein accession | YP_002489949 |
Protein GI | 220914640 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases [COG1922] Teichoic acid biosynthesis proteins |
TIGRFAM ID | [TIGR00696] bacterial polymer biosynthesis proteins, WecB/TagA/CpsF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 120 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAATGT CTTCTTCACT TCGCGCACTT GTGCAGCCGG CAGCTTGGTT GGATGGGACG ACCTCCCGGC ATCCTGCCGG AGACGTCAAC GCCAAGTCTG GCGCGCCCCT CCAGCTTTCC GCGGACAACG CTTGGGTCAC ATTGGGTGGT TCTCCAGTTC GGCTCCTGGA TTTTGAGGAG GCAGTCGAAC TGATCATGCA GCGATCACGG CCTGGACGTA CGCCGCTGGC CGTGGCCTCC GCGAACCTTG ACCACCTCCA GCATTTCGGC GCAGGTGCCC GTTGGGCCGG GATCCTTGAA CGGCAAGATA CGCCCGAATG GCTGTCGTTG CTGGACGGAG CTCCTTTGGT CCGTCACGTA CAAGGGATGA CGGGTCGAAC GTGGCCACGA CTCTCGGGCA GTGACCTGAT TGGGCCAATA TTGGACCGCG CGGAGCTGGC CGGCATCCGG GTGGGCTTCC TGGGCGGATC CGAGGAAGTT CACACACAGG TACGGTCAAG GTTGGCCACG AGCCATCCAA GGCTCGTGAT TTCGGGCTTT TGGTCGCCGG CCCGAAGCGA GCTGGCCGAC CACGTGGCTT CCTCGTCCCT TGCCACCCGG ATCGCTGCCA CCGACACAGA CATACTGGTT GTGTGCCTGG GAAAGCCGCG TCAGGAACTG TGGATCGCAG AGTACGGATA CCAGACTGGA GCCAACGTGA TGTTGGCCTT CGGCGCCGCA GTGGATTTCC TGGGGGGACG TGTCCGGCGC GCACCGGCGG TCGCCCAGAA CGTAGGCATG GAATGGGCCT GGCGCTTGGC TTTGGAACCT CGGCGCTTGG CCAATCGGTA CCTGGTTCAG GGGCCGGAGG CTTATCTCAG GTTGCTGTCA GTCAGTTCCT TTGGCCGAGA AAGCTTTGCG CCTCGACAAC AGCCGCAGGA TTACGCCAGT AAAGACCTGA CTGACGAAGG GTTTTCACCT CTGACGTCGG AAACGGATGT CGCGGTCATC ATCGTCACTT ACAACAATGA ACGGGATATT CCGCTGCTTC TCAAAAGCCT ACAAGGGGAG TCACGGGAGC AATCCATCAA AGTCATAGTC GCAGATAACT CCCCGGGTCC CTCCACACTC GCAGCCCTGG AAGGATTTTC AGACGTGCAT GCGATTGCGA CCGGCGGAAA TTTGGGCTAT GCCGCTGCCA TTAATTTAGC CATGCAGGAA ATTGGTGCTG CCCGTTCCTT CCTGGTTCTG AATCCGGATC TACAGGTCGA ACCGGGCGCC ATACGCGCAA TGCGTCATCG GATGGCGATC TCGGGAGCCG GGGTTGTGGT GCCGCTCCTC AAAGACGACA ATGGCACTGT TTACCCCTCG CTGCGCCGCG AACCCACCGT GACAAGGGCA ATTGGCGACG CGGTCATGGG TAGCAAGCTC TCCGGAAGGC CCGCCTGGCT CTCTGAAATG GACTTCGACA ACGAAAGCTA TATGCACGCC CATAAAGTGG ACTGGGCCAC GGGAGCAGCA CTCCTCATTC ATCGCGACGT TGCACAACTA GTGGGTGATT GGGACGAGGA CTATTTTCTC TACTCGGAAG AAACTGACTT CATGCATCGG GTCCGCCAGG CAGGATGGGA GATATGGTTC GAGTCCCAAG CAGTGATGAG CCATTCCAGA GGTGGCTCGG GAACATCACT TGCCTTAAAC GCCTTAATGG CGATCAACCG GATCAAGTAC ATTCGCAAGT TCCACACCCG ACCGTACTCG CGAGCATTTC GAAGCGCTGT TATCCTCTCG GCTCTGCTGC GGGTGCCTGT GACCCCTGGA ATCGGAGTCC TCGCGGCGGT GCTTCGTGAA GGATCGTGGG GTGAATTGCC TCATGCCGAG ATCTATCCTG AAGGGGTACG TGTCCCCGCC GCGATACCGA CGGGCACAGT TATCATCCCA GCCCACAACG AGGCCAGCGT GCTCCGACGG ACACTGGACG GTCTTGTCCC GGCCATGGTG GGAGGTACGG TGGAAGTCAT CGTTGCCTGC AATGGTTGCA CCGACGATAC TGCATCTATT GCACGATCCT ACAAGGACGC CAGGGTGATT GAAGTTGAGG AAGCCTCCAA GACCGCAGCC CTGAATGCCG GAGATCAGGT GGCAACCCGC TGGCCGCGGA TGTATCTTGA TGCCGACATT GAGCTTCCTT TGGAAGCGTT GTGTGCCACC CTGGAGCTTC TGGGTGAGGG CGGAGCCATT CTTTGCGCTC GTCCGGCCTA CCGCTATGAC TTTAGCGGTG CTTCGTGGCC CGTCCGGGCG TTCTACAGGG CACGGAACCG TCTTCCGAAG CCAGCTGAAT CCATATGGGG AGCGGGCGTG TATGCAATCA GCAGGAAAGG GAAGGCGCGG CTCCCCGAAT TCCCCTCGGT AGCTGCCGAT GACTGCTTGG TTGACCGGCT CTATAGTGAC AAGGAAAAGG CAGTTGTGCA GTGCGCGCCC GCGACGGTTC GAACACCCCG CACAACCGGG AGTCTTTTGA AGACGTTAGG CAGGAACTAT CGCAGCAATG TCATCTTGCG CGATGTTCCG GGGTCCCACA CTATGCAGAC ACTCAGGGAC TTAGTCGGTT CAGTAAGCGG TCCGAGATCC GCGGTGGAAG CTGGCGTTTA CGCTGCTTTT GCCTTGGCAG GCCGGCTTCA CGCCCGCCGG TGGGTGGGTC TCGAATCAGC AGCGTGGGAG AGCGACGAGT CAAGCAGGCT GTAA
|
Protein sequence | MTMSSSLRAL VQPAAWLDGT TSRHPAGDVN AKSGAPLQLS ADNAWVTLGG SPVRLLDFEE AVELIMQRSR PGRTPLAVAS ANLDHLQHFG AGARWAGILE RQDTPEWLSL LDGAPLVRHV QGMTGRTWPR LSGSDLIGPI LDRAELAGIR VGFLGGSEEV HTQVRSRLAT SHPRLVISGF WSPARSELAD HVASSSLATR IAATDTDILV VCLGKPRQEL WIAEYGYQTG ANVMLAFGAA VDFLGGRVRR APAVAQNVGM EWAWRLALEP RRLANRYLVQ GPEAYLRLLS VSSFGRESFA PRQQPQDYAS KDLTDEGFSP LTSETDVAVI IVTYNNERDI PLLLKSLQGE SREQSIKVIV ADNSPGPSTL AALEGFSDVH AIATGGNLGY AAAINLAMQE IGAARSFLVL NPDLQVEPGA IRAMRHRMAI SGAGVVVPLL KDDNGTVYPS LRREPTVTRA IGDAVMGSKL SGRPAWLSEM DFDNESYMHA HKVDWATGAA LLIHRDVAQL VGDWDEDYFL YSEETDFMHR VRQAGWEIWF ESQAVMSHSR GGSGTSLALN ALMAINRIKY IRKFHTRPYS RAFRSAVILS ALLRVPVTPG IGVLAAVLRE GSWGELPHAE IYPEGVRVPA AIPTGTVIIP AHNEASVLRR TLDGLVPAMV GGTVEVIVAC NGCTDDTASI ARSYKDARVI EVEEASKTAA LNAGDQVATR WPRMYLDADI ELPLEALCAT LELLGEGGAI LCARPAYRYD FSGASWPVRA FYRARNRLPK PAESIWGAGV YAISRKGKAR LPEFPSVAAD DCLVDRLYSD KEKAVVQCAP ATVRTPRTTG SLLKTLGRNY RSNVILRDVP GSHTMQTLRD LVGSVSGPRS AVEAGVYAAF ALAGRLHARR WVGLESAAWE SDESSRL
|
| |