Gene Arth_4061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_4061 
Symbol 
ID4447792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp4583846 
End bp4585414 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content65% 
IMG OID639691892 
Productundecaprenyl-phosphate galactose phosphotransferase 
Protein accessionYP_833536 
Protein GI116672603 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAAC TCGGTAAAAC AAGCCCGTTC GTGCTGAAGT TCATTAGTCC CGGAGTGAGT 
TCCGGGACGG TCGCCGGCCC GCCGCGTGCC GCCGTCGGGC ATTCCCTCGG AGGCACCGAG
GGGCAGCTGC CCGGGCGCCG GCTCTCCGGC CGAGAATGGG CGCGCTTCCT TGCCCGCTTC
CTCCGCCTGA CCGACACCCT GGTAGTCGTT TTGTCGGTGC TGATCGGATA CGTGATCCGG
TTCGACGGCG GGCACTCCTT TTCCGAAAGC GGCGACAATG CGTTGGGCGC CGTCCTGGCC
GCAGGCCTCG TGGTGTTATG GACCGGCGCC CTCCACTTTT ACCGGACCAG GGATCCCAAG
GTGCTCGGAG TCGGTGCCAG CGAATACAAA CGCGTCGCCG TTGCAACCAC CCGGGTGTTT
GGCCTGCTGG CCGTGTTCCT GGTGGTATTC CGGCTGGAAC CGGCCTGGGC CTTTTATGTC
GCATCGTTTC CGCTGGGGCT CCTGGCCCTG GTGGGGAACC GCTGGTTCGC CCGCCGCTGG
TACAACCGGA AACAGTCCGA CGGTCAGTTC CTGGTGAGGG CGATAGTCAT TGGACAGCCC
GAAGATGTCC GCTACGTCCT CGGCCGGATC GCCAAGAAGA CGGGTGCAGC ATACGAAATC
CTTGGGGTGT CGCTCCCTGG TGCCCGCCGG GGAATGATGC TCGACGTCGA CGGCCGCCGT
GTTCCAGTGT TGTCCTCCAC CGACGACGTT GTCCGGACCG TGGGAAGGTA CGGCGCCGAA
GCCGTCATCG TGGCGGGACC GGTGCCGGGA GGCAACCAGT ACATCCGGGA GCTGGGGTGG
CGGCTGGAGG AGTATTCCGC CGAACTGGTC CTCGCTTCAA CACTCACGAA TGTCGCCGGG
CCGCGGATCC ACTGGCGGCC GGTGGAGGGA CTGCCTCTGA TGCACGTGGA CGTGCCGCAG
TACAGCGGCG GCAAGCATGT TCTCAAGCGG CTGCTGGACA TTGTGGCCGC CTCCTTGGCC
CTCATCCTGC TGTCGCCGCT GTTCCTTGTC CTGGCGGTCA TCGTCAAGCG GAACGGCCCC
GGACCCGTGT TCTTCAAGCA GGAGCGCGTC GGCAAAGCAG GCCGCCCGTT CAGGATGATC
AAGTTCCGTT CCATGGTCAC CGACGCGGAA GCGTCACTCG CGGCACTGAT GGCCCGCAAC
GAGGGCGCGG GCGTGCTGTT CAAGATGCAG AACGATCCGC GGGTGACCAG CTGCGGACGA
TGGATGCGCC GCTACTCACT CGATGAACTT CCACAGTTCT GGAACGTGCT CACCGGCGAA
ATGAGCCTGG TGGGCCCCCG GCCACCGCTG CAGCGCGAAG TGGAGGCCTA CGAACGGCAC
ACCCACCGGC GGCTGCTCAT CAAGCCCGGA ATCACCGGAC TCTGGCAGGT CAACGGACGC
TCGGACCTGC CCTGGGATGA GGCCGTGCGC CTCGATCTCT ACTACGTCGA GAACTGGTCG
ATCATGGGGG ACGTCATCAT CATGTGGCGG ACCTTCAAAG CAATGTGCAT GCCTGCGGGC
GCGTATTAA
 
Protein sequence
MSQLGKTSPF VLKFISPGVS SGTVAGPPRA AVGHSLGGTE GQLPGRRLSG REWARFLARF 
LRLTDTLVVV LSVLIGYVIR FDGGHSFSES GDNALGAVLA AGLVVLWTGA LHFYRTRDPK
VLGVGASEYK RVAVATTRVF GLLAVFLVVF RLEPAWAFYV ASFPLGLLAL VGNRWFARRW
YNRKQSDGQF LVRAIVIGQP EDVRYVLGRI AKKTGAAYEI LGVSLPGARR GMMLDVDGRR
VPVLSSTDDV VRTVGRYGAE AVIVAGPVPG GNQYIRELGW RLEEYSAELV LASTLTNVAG
PRIHWRPVEG LPLMHVDVPQ YSGGKHVLKR LLDIVAASLA LILLSPLFLV LAVIVKRNGP
GPVFFKQERV GKAGRPFRMI KFRSMVTDAE ASLAALMARN EGAGVLFKMQ NDPRVTSCGR
WMRRYSLDEL PQFWNVLTGE MSLVGPRPPL QREVEAYERH THRRLLIKPG ITGLWQVNGR
SDLPWDEAVR LDLYYVENWS IMGDVIIMWR TFKAMCMPAG AY