Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Arth_4051 |
Symbol | |
ID | 4447782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Arthrobacter sp. FB24 |
Kingdom | Bacteria |
Replicon accession | NC_008541 |
Strand | - |
Start bp | 4572274 |
End bp | 4574928 |
Gene Length | 2655 bp |
Protein Length | 884 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639691882 |
Product | WecB/TagA/CpsF family glycosyl transferase |
Protein accession | YP_833526 |
Protein GI | 116672593 |
COG category | [M] Cell wall/membrane/envelope biogenesis [R] General function prediction only |
COG ID | [COG1216] Predicted glycosyltransferases [COG1922] Teichoic acid biosynthesis proteins |
TIGRFAM ID | [TIGR00696] bacterial polymer biosynthesis proteins, WecB/TagA/CpsF family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAAACC TGGACGCCGA CAAAGTCATC CTGGGTGGCA CTCCGGTCGA TCTGATGGAT CCCGAGCCGG CCTTGGAACT CATTCTGGCC CGCGCCGGAC AACGTGGCTT GCCGCCCCTC GGTGTTGCTT CCGTCAACCT GGACCATCTG CACCATTTCG GGGCCGGCGG ACGCTGGGAA GGCACGCTGC ATGCCGATCC GTCCAGCACG GTCGACTGGC TCTATCTCCT CGACGGTGCA CCGCTGGTGT CACAATCCCA ACGGCTGACC GGGCATCGCT GGCCGCGGCT GGCGGGCAGC GATCTCGCCT CGCCGCTACT GGACCGGGCC GAGCAATTGG GACTCAGGGT GGGTTTTCTT GGCGGTTCCA GCGAGAACCA GCAGCTTCTG GCCCAAAAGA TCGCCGAAGA GCATCCACGC CTCCAGGTCG CGGGAATGTG GTCCCCGGAC CGGGAAGAAC TGGCCTCCGC CTGCGACTCG GAACGGATCG CGGCAGCCAT TGCAGCCGCG GGCACGCAGA TTCTCTACGT GGGCCTGGGC AAGCCAAGGC AGGAACTCTG GATCGACCAC TATGCCGCCC TTACCGGCGC CGCGGTGCTG CTCGCCTTCG GCGCAGCCGT GGACTTCCTG GCCGGTAGGG TGCACCGGGC CCCCCAATGG GCCAGCGACC ACGGACTGGA GTGGGGTTAC CGGCTGGCCC TGGAACCCAA GCGGCTCGCC AGCCGATACC TGGTCGACGG TCCGCCGGCC TACATCAAGC TGCGCACAGC ATCGTGCACG GTACCCCACG CCCGGCTGGA GTCGGCGCCG TCGGCACCTC CAGCTCCGCG CACTCCCGGC CGCTTCACCG GACCCGAAGG CGCAGCGGAT GCCGCCGTCG TCGTCGTGAC CTTCAACAGC GCCAAAGGCG TCGGCCCCCT GCTGGCGAGT CTTCGGGAGG AGACGGCGGA TCTCACGCTG CGCGTGCTTG TGGCCGACAA CTCGTCCAGC GACGGCACCC TGGCACTGGT GCGCAGCGCC CACCCGGACG TCATCGCGTT CGCAACCGGA GGCAACCTCG GATATTCCGG CGGGATCAAT GCCGCCATGC GACAGGCCGG GGACAGCGCC ACCGTGGTGG TGCTCAACCC CGACGTCACC GTGGAACGGG GAAGCCTGAA AACCATGATG CACCGCTTGC GTTCCTCCCG TGCCGGGGCC GTCGTTCCGC GTCTCCTGGA CGAAAACGGC GCAACGAGTC CCTCCCTCTA CCGCGAGCCG AGCCTGGCCA ACGCGACGGG AGACGCATTG TTGGGACGGC GGTTCCCGGA CCGGCCGGGC TGGCTTGCCG GGACCGACTA CAACCCGGAG AGCTACGCCC ACCCGCACAC CGTGCACTGG GCCACCGGGG CTGCGCTGAT GGTCCGCCGC AGCCTGGCCG ATTCCTTGGC CTGGGACGAG TCCTACTTCC TTTATTCGGA GGAGGTCGAT TTCTTCCGCG GCCTGCGGAC GATGGGGGAA ACTATCTGGT ACGAACCGGC GGCAGCCATG ACCCATGCGG GTGCCGGATC GGGCGCGTCA CCAAGGCTCA ACGCCTTGCT GGCCGTCAAC CGTGTGAGGT ACATCCGCAA GTACCATTCG TCGGCCTACG CCAAGGTCTT TCACGGTGCG GTGATCCTGT CCGAGCTGCT CAGGTGCTGG AAACCGGACC GCCGGGGAGT GCTCCGGACT GTTCTGGACG AGGGCAGCTG GACCGACCTG CCGGGCGCCA CCAGGGATCC CGACCCCGCC CACTTTCCGG GCGGTTCGGT GATTATTCCG GCGCACAATG AGGCCAGTGT GATCGCCCGG ACGCTCGCCC CGCTTGCTCC GCTGGCGGCG GCCGGCCAGA TCGAAGTGGT TGTGGTGTGC AACGGCTGCT CGGACAACAC CGCCGCGATC GCCCGCGGCT TCGCGGGCGT GACAGTGCTT GAGATCGGAC GACCCTCCAA GTCAGCTGCC CTCAACGCCG GCGATGCAGC CGCCACGAAG TGGCCCAGGC TCTATCTGGA TGCTGATGTG CAAATCAGCC GGCACGCAGT GCGCGATGTG TTGACGGCGC TGGAGGCCGG CGGACCTTTG GCGGCGCGCC CGGCCGTGCA ATTCGACCTC CAGGACGCCC ATCCGCTGAT CCACTCCTAT TACCGCACGC GGCTGCGCCT GCCCTCTGCG CGGAACCGGC TGTGGGCGGG CGGCGTCTAC GGTTTGTCGG AGCAGGGGCG TAAGCGCTTC CAGGAATTCC CGGACCTGAC AGCGGACGAC CTCTTCGTCG ACCGGCTTTT CGAGCCGTCG GAAAAGGCTG TGCTCGACGT CGACCCCGTT GTAATCCGTC CGCCCAGGAC ACCAAGGGAC CAGGTGGCAG TCCTTCACCG CGTATACCGG GGTAACGCCG AACAGAACGG CGACGCCGGC CAGCACAGCA CCGCCCGGCA GACACTCGCG GAGGTGCTGC GCTCTGTTCG TGGCCCGCTC TCCGCTGCCC ACGCCGCGGT CTATTTGGGA TTCGCTGTCG CAGGCCGGCA TGGTGCCGCC GGCCCGGGTC CTGGCGGCTG GGAACGCGAC GAAAGCAGCC GTGCACCCGC AGGATTGCAT CCAGGGGCCC CGGCAGGTAA CACGGGGGTG TCCGGCGGCA AGTAG
|
Protein sequence | MENLDADKVI LGGTPVDLMD PEPALELILA RAGQRGLPPL GVASVNLDHL HHFGAGGRWE GTLHADPSST VDWLYLLDGA PLVSQSQRLT GHRWPRLAGS DLASPLLDRA EQLGLRVGFL GGSSENQQLL AQKIAEEHPR LQVAGMWSPD REELASACDS ERIAAAIAAA GTQILYVGLG KPRQELWIDH YAALTGAAVL LAFGAAVDFL AGRVHRAPQW ASDHGLEWGY RLALEPKRLA SRYLVDGPPA YIKLRTASCT VPHARLESAP SAPPAPRTPG RFTGPEGAAD AAVVVVTFNS AKGVGPLLAS LREETADLTL RVLVADNSSS DGTLALVRSA HPDVIAFATG GNLGYSGGIN AAMRQAGDSA TVVVLNPDVT VERGSLKTMM HRLRSSRAGA VVPRLLDENG ATSPSLYREP SLANATGDAL LGRRFPDRPG WLAGTDYNPE SYAHPHTVHW ATGAALMVRR SLADSLAWDE SYFLYSEEVD FFRGLRTMGE TIWYEPAAAM THAGAGSGAS PRLNALLAVN RVRYIRKYHS SAYAKVFHGA VILSELLRCW KPDRRGVLRT VLDEGSWTDL PGATRDPDPA HFPGGSVIIP AHNEASVIAR TLAPLAPLAA AGQIEVVVVC NGCSDNTAAI ARGFAGVTVL EIGRPSKSAA LNAGDAAATK WPRLYLDADV QISRHAVRDV LTALEAGGPL AARPAVQFDL QDAHPLIHSY YRTRLRLPSA RNRLWAGGVY GLSEQGRKRF QEFPDLTADD LFVDRLFEPS EKAVLDVDPV VIRPPRTPRD QVAVLHRVYR GNAEQNGDAG QHSTARQTLA EVLRSVRGPL SAAHAAVYLG FAVAGRHGAA GPGPGGWERD ESSRAPAGLH PGAPAGNTGV SGGK
|
| |