Gene Arth_3206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_3206 
Symbol 
ID4444196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3612878 
End bp3614323 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content58% 
IMG OID639691030 
Productundecaprenyl-phosphate galactose phosphotransferase 
Protein accessionYP_832682 
Protein GI116671749 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAGAGA CGCACGCGCA TGACGCTTGG CGGCGCAGGT ACTCTCGGCG GTTGAAACTT 
GTTGATGCTT TTGTAATCGT TTGGGCAGTA GCTGGGGCGT TCGCGGTCCG TTTCGGTTTA
TCTGAGGTCC CTAACGGCAA TGATCGCGAC ATCGATTACG CGGTGCTGTC GGGGGCACTC
ATTGTCGCTT GGTGGTTCAT GCTTGAGTTC TGGGGCTCCC GCGATTCTCG GGTATTAGGC
TCCGGTTCCG AAGAATACAA AAGAGTCCTA GCCAGTTCAG CTTGGCTGTT TGGGTTTGTA
GCCGTGGTGT CATACGCCCT AAGAATCGAT ACGGCGCGAG GATTTGTGGG TCTGGCCTTT
CCAGCCGGGG CGCTTGGCCT GCTAGCAGCG CGGTGGCTGG TCCGCCAGCA CCTGAGCCTC
GAACGCAAGC ACGGCAAGAG TAACTCCCGT GTGCTGATTA TTGGGGGACC GCACTCGGCT
TCGCACTTAG TGCGTTCCCT AAGCAGTGCA CCAGCTGCAG GATATATGCC TGTTGCGGCA
CACTTGCCAG GAGCGACAGG AACAGCAGGG CTTTCCGGGC TCACAGTGCC CGTGACGGGT
TTAGACGCCG ACTTTGACAG TATTCTTGGC GTGATATTGG CCACGAACGT TGATGCCGTT
GCCATCTCGG CTGGCGTCAA CATGCATCCG CAAGATCTTC GAAGGCTAGG GTGGGAACTA
GCCGCGCGAG ACATCGGCAT GATCTTGGCG CCTGCCCTGA CCGACATTGC TGGACCTCGT
ATCCATACCC AGCCTGTCGC AGGTTTACCT CTGATCCATG TGTCCACGCC TAAGCTCACA
GGCGGGAAGA AAGTGGCCAA GCGGGCGTTC GATATAGTAG TTGCGGGTCT GCTGGTTGCC
TGCCTCGCTC CGCTGTTTCT CGTGTTGGCT GTACTCGTCC GCTTTACGGA TCCTGGCCCT
GTGTTCTATC GCCAAGAACG AATTGGTCTC CGCGGCACGA CTTTCCACAT GCTGAAGTTC
CGGTCTATGA AAGTGGACGC TGACGCCCAG TTGGGCGAGT TACTAGCAGC ACAAGGCTCC
GCTGATACAC CTCTTTTCAA GGTTGAAAAT GACCCGAGGA TCACACCCCT GGGACGGGTC
TTGCGAAAGT ACTCTCTGGA TGAACTGCCC CAGCTACTCA ATGTGCTGGG CGGCAGCATG
AGCCTTGTCG GCCCGAGGCC GCAGCGCGAA GGCGAAGTTG CCCTCTATGA CGACGCGGCC
CATCGGCGGC TCTACGTTAG TCCTGGCATG AGCGGCCTTT GGCAGGTCAG TGGGCGCTCC
AATCTTAGCT GGGAGGAGAG CATCCGGCTC GACCTCTACT ATGTGGAAAA CTGGTCGCTC
ATGGGTGACG TAGTCATTCT CTTCAAGACT TTCAAAGCCG TATTTGCAAG CACGGGCGCG
GTTTGA
 
Protein sequence
MEETHAHDAW RRRYSRRLKL VDAFVIVWAV AGAFAVRFGL SEVPNGNDRD IDYAVLSGAL 
IVAWWFMLEF WGSRDSRVLG SGSEEYKRVL ASSAWLFGFV AVVSYALRID TARGFVGLAF
PAGALGLLAA RWLVRQHLSL ERKHGKSNSR VLIIGGPHSA SHLVRSLSSA PAAGYMPVAA
HLPGATGTAG LSGLTVPVTG LDADFDSILG VILATNVDAV AISAGVNMHP QDLRRLGWEL
AARDIGMILA PALTDIAGPR IHTQPVAGLP LIHVSTPKLT GGKKVAKRAF DIVVAGLLVA
CLAPLFLVLA VLVRFTDPGP VFYRQERIGL RGTTFHMLKF RSMKVDADAQ LGELLAAQGS
ADTPLFKVEN DPRITPLGRV LRKYSLDELP QLLNVLGGSM SLVGPRPQRE GEVALYDDAA
HRRLYVSPGM SGLWQVSGRS NLSWEESIRL DLYYVENWSL MGDVVILFKT FKAVFASTGA
V