Gene Arth_4051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4051 
Symbol 
ID4447782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4572274 
End bp4574928 
Gene Length2655 bp 
Protein Length884 aa 
Translation table11 
GC content68% 
IMG OID639691882 
ProductWecB/TagA/CpsF family glycosyl transferase 
Protein accessionYP_833526 
Protein GI116672593 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases
[COG1922] Teichoic acid biosynthesis proteins 
TIGRFAM ID[TIGR00696] bacterial polymer biosynthesis proteins, WecB/TagA/CpsF family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAACC TGGACGCCGA CAAAGTCATC CTGGGTGGCA CTCCGGTCGA TCTGATGGAT 
CCCGAGCCGG CCTTGGAACT CATTCTGGCC CGCGCCGGAC AACGTGGCTT GCCGCCCCTC
GGTGTTGCTT CCGTCAACCT GGACCATCTG CACCATTTCG GGGCCGGCGG ACGCTGGGAA
GGCACGCTGC ATGCCGATCC GTCCAGCACG GTCGACTGGC TCTATCTCCT CGACGGTGCA
CCGCTGGTGT CACAATCCCA ACGGCTGACC GGGCATCGCT GGCCGCGGCT GGCGGGCAGC
GATCTCGCCT CGCCGCTACT GGACCGGGCC GAGCAATTGG GACTCAGGGT GGGTTTTCTT
GGCGGTTCCA GCGAGAACCA GCAGCTTCTG GCCCAAAAGA TCGCCGAAGA GCATCCACGC
CTCCAGGTCG CGGGAATGTG GTCCCCGGAC CGGGAAGAAC TGGCCTCCGC CTGCGACTCG
GAACGGATCG CGGCAGCCAT TGCAGCCGCG GGCACGCAGA TTCTCTACGT GGGCCTGGGC
AAGCCAAGGC AGGAACTCTG GATCGACCAC TATGCCGCCC TTACCGGCGC CGCGGTGCTG
CTCGCCTTCG GCGCAGCCGT GGACTTCCTG GCCGGTAGGG TGCACCGGGC CCCCCAATGG
GCCAGCGACC ACGGACTGGA GTGGGGTTAC CGGCTGGCCC TGGAACCCAA GCGGCTCGCC
AGCCGATACC TGGTCGACGG TCCGCCGGCC TACATCAAGC TGCGCACAGC ATCGTGCACG
GTACCCCACG CCCGGCTGGA GTCGGCGCCG TCGGCACCTC CAGCTCCGCG CACTCCCGGC
CGCTTCACCG GACCCGAAGG CGCAGCGGAT GCCGCCGTCG TCGTCGTGAC CTTCAACAGC
GCCAAAGGCG TCGGCCCCCT GCTGGCGAGT CTTCGGGAGG AGACGGCGGA TCTCACGCTG
CGCGTGCTTG TGGCCGACAA CTCGTCCAGC GACGGCACCC TGGCACTGGT GCGCAGCGCC
CACCCGGACG TCATCGCGTT CGCAACCGGA GGCAACCTCG GATATTCCGG CGGGATCAAT
GCCGCCATGC GACAGGCCGG GGACAGCGCC ACCGTGGTGG TGCTCAACCC CGACGTCACC
GTGGAACGGG GAAGCCTGAA AACCATGATG CACCGCTTGC GTTCCTCCCG TGCCGGGGCC
GTCGTTCCGC GTCTCCTGGA CGAAAACGGC GCAACGAGTC CCTCCCTCTA CCGCGAGCCG
AGCCTGGCCA ACGCGACGGG AGACGCATTG TTGGGACGGC GGTTCCCGGA CCGGCCGGGC
TGGCTTGCCG GGACCGACTA CAACCCGGAG AGCTACGCCC ACCCGCACAC CGTGCACTGG
GCCACCGGGG CTGCGCTGAT GGTCCGCCGC AGCCTGGCCG ATTCCTTGGC CTGGGACGAG
TCCTACTTCC TTTATTCGGA GGAGGTCGAT TTCTTCCGCG GCCTGCGGAC GATGGGGGAA
ACTATCTGGT ACGAACCGGC GGCAGCCATG ACCCATGCGG GTGCCGGATC GGGCGCGTCA
CCAAGGCTCA ACGCCTTGCT GGCCGTCAAC CGTGTGAGGT ACATCCGCAA GTACCATTCG
TCGGCCTACG CCAAGGTCTT TCACGGTGCG GTGATCCTGT CCGAGCTGCT CAGGTGCTGG
AAACCGGACC GCCGGGGAGT GCTCCGGACT GTTCTGGACG AGGGCAGCTG GACCGACCTG
CCGGGCGCCA CCAGGGATCC CGACCCCGCC CACTTTCCGG GCGGTTCGGT GATTATTCCG
GCGCACAATG AGGCCAGTGT GATCGCCCGG ACGCTCGCCC CGCTTGCTCC GCTGGCGGCG
GCCGGCCAGA TCGAAGTGGT TGTGGTGTGC AACGGCTGCT CGGACAACAC CGCCGCGATC
GCCCGCGGCT TCGCGGGCGT GACAGTGCTT GAGATCGGAC GACCCTCCAA GTCAGCTGCC
CTCAACGCCG GCGATGCAGC CGCCACGAAG TGGCCCAGGC TCTATCTGGA TGCTGATGTG
CAAATCAGCC GGCACGCAGT GCGCGATGTG TTGACGGCGC TGGAGGCCGG CGGACCTTTG
GCGGCGCGCC CGGCCGTGCA ATTCGACCTC CAGGACGCCC ATCCGCTGAT CCACTCCTAT
TACCGCACGC GGCTGCGCCT GCCCTCTGCG CGGAACCGGC TGTGGGCGGG CGGCGTCTAC
GGTTTGTCGG AGCAGGGGCG TAAGCGCTTC CAGGAATTCC CGGACCTGAC AGCGGACGAC
CTCTTCGTCG ACCGGCTTTT CGAGCCGTCG GAAAAGGCTG TGCTCGACGT CGACCCCGTT
GTAATCCGTC CGCCCAGGAC ACCAAGGGAC CAGGTGGCAG TCCTTCACCG CGTATACCGG
GGTAACGCCG AACAGAACGG CGACGCCGGC CAGCACAGCA CCGCCCGGCA GACACTCGCG
GAGGTGCTGC GCTCTGTTCG TGGCCCGCTC TCCGCTGCCC ACGCCGCGGT CTATTTGGGA
TTCGCTGTCG CAGGCCGGCA TGGTGCCGCC GGCCCGGGTC CTGGCGGCTG GGAACGCGAC
GAAAGCAGCC GTGCACCCGC AGGATTGCAT CCAGGGGCCC CGGCAGGTAA CACGGGGGTG
TCCGGCGGCA AGTAG
 
Protein sequence
MENLDADKVI LGGTPVDLMD PEPALELILA RAGQRGLPPL GVASVNLDHL HHFGAGGRWE 
GTLHADPSST VDWLYLLDGA PLVSQSQRLT GHRWPRLAGS DLASPLLDRA EQLGLRVGFL
GGSSENQQLL AQKIAEEHPR LQVAGMWSPD REELASACDS ERIAAAIAAA GTQILYVGLG
KPRQELWIDH YAALTGAAVL LAFGAAVDFL AGRVHRAPQW ASDHGLEWGY RLALEPKRLA
SRYLVDGPPA YIKLRTASCT VPHARLESAP SAPPAPRTPG RFTGPEGAAD AAVVVVTFNS
AKGVGPLLAS LREETADLTL RVLVADNSSS DGTLALVRSA HPDVIAFATG GNLGYSGGIN
AAMRQAGDSA TVVVLNPDVT VERGSLKTMM HRLRSSRAGA VVPRLLDENG ATSPSLYREP
SLANATGDAL LGRRFPDRPG WLAGTDYNPE SYAHPHTVHW ATGAALMVRR SLADSLAWDE
SYFLYSEEVD FFRGLRTMGE TIWYEPAAAM THAGAGSGAS PRLNALLAVN RVRYIRKYHS
SAYAKVFHGA VILSELLRCW KPDRRGVLRT VLDEGSWTDL PGATRDPDPA HFPGGSVIIP
AHNEASVIAR TLAPLAPLAA AGQIEVVVVC NGCSDNTAAI ARGFAGVTVL EIGRPSKSAA
LNAGDAAATK WPRLYLDADV QISRHAVRDV LTALEAGGPL AARPAVQFDL QDAHPLIHSY
YRTRLRLPSA RNRLWAGGVY GLSEQGRKRF QEFPDLTADD LFVDRLFEPS EKAVLDVDPV
VIRPPRTPRD QVAVLHRVYR GNAEQNGDAG QHSTARQTLA EVLRSVRGPL SAAHAAVYLG
FAVAGRHGAA GPGPGGWERD ESSRAPAGLH PGAPAGNTGV SGGK