Gene RoseRS_3578 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3578 
Symbol 
ID5210556 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp4474200 
End bp4475738 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content60% 
IMG OID640597172 
Productundecaprenyl-phosphate galactose phosphotransferase 
Protein accessionYP_001277884 
Protein GI148657679 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGCG TCGTTCACAA GGAGCCGACG GTGAACAAAT CAACAGAAGC GACGACAACT 
CCATCCGTCG CTCACCCTCC GCTCCCGATC CGTCGTCGCC GGATCGCGCC TTCAGGCGTG
CTGATGGCAA TAGTCGATAC CTGCCTTATC TTGCTTGGGT TTGCTCTTGC GTACTGGATG
CGCTATGTCA TCGACTGGCC CCCGCCATTC GACCAGCTGG TGCGCGAAGT TCAGGCGCAA
AACTTTGTGC CGCTCAGCGC ATTCGCTCCC TTCGCCATCC TCCTGACCGC GTTGCTGATC
GTTCAGTTCG CCATGCGGGG GCTGTACCGC ATGCCGCGTA CCGCCGGAGT GCTCGACCAT
GCCAGCATCA TCGTCGGATC GACGACGACC GGCATTGCGA TTCTGATTGT AGTTGTCTTT
CTCTATAAGC CATCGGAGTT CTACTCGCGC CTGATCTTTG CTTTTGCGCT GGTGTCGATC
AGCACACTGC TGGTCGGCGG GCGCGCAGTG CTGATCGGTC TGCGCCGCTG GCGCTGGGTA
CGCGGCATCG ACCGTGAACG GGTGCTGGTC GTCGGCAACA CCGGTCTGGG GCGTGAGGTG
ATGGAAAGCC TGGTGGCGCA ACCCGATCTG GGGTATGCGC TCGTGGGCTT TCTCGATGAT
CGTGAGACAC CGCTCAACCG TCGCACGGTT CACTTTCGGC GCCTTGGTCC GATCAGCGAT
CTTGACGTAT GTCTGCGCGG TGGCGATATC GATCTGGTAA TCCTGGCGCT CCCGTTCTGG
GAACATCATC GCCTGCCGGA ACTGGTCGAC ATATGTCGCT ATGCGGGCGT CGAGTTCCGG
GTTGTGCCCG ATCTGTACCA GTTGAGTTTT GACCGGATCG ATATCGGCAA CCTGAGCGGC
ATTCCGTTGA TCGGCTTGAA AGAAGTCTCG CTGCGTGGCT GGAACCTGGT CGTCAAACGG
ACGATGGACC TGGCGCTGAC ATTGCTGGCG TTGCCCATCG TGTTCCCGCT GGGCGTGATG
CTGGCGATCA TTGTGCGGCT CGACTCGCCA GGACCGGCGA TTTTCCGGCA GCGCCGGATC
GGGCGTGATG GGCGCCCCTT CATCTGTTAC AAGTTCCGCA CGATGGTGGT CGATGCAGAG
GAACGGAAGG CTGAACTGGC TGCCCTGAAC GAAGCCGATG GTCCGCTCTT CAAAATCCGC
AACGACCCGC GGATGACCCG CGTCGGGCGG TTTTTGCGGC GTTATAGTCT GGACGAACTG
CCGCAACTGT GGAACATCCT GCGCGGCGAT ATGAGTTGGG TTGGTCCACG TCCGGCAACT
CCGGAGGAAG TTGCACAGTA CGAAGACTGG CACTATCGCC GCCTGACGGT TGTGCCTGGA
TTGACCGGTC TGTCGCAGGT GTTGGGGCGC AGCGATATTT CATTCGACGA AACGGTGCGC
CTCGATATTT TCTACACCGA AAACTGGACC CCCGGCATGG ATCTGCGTAT TCTGCTGCAA
ACGATCCCCG TGGTTATCTC CGGGCGCGGG GCATATTGA
 
Protein sequence
MNRVVHKEPT VNKSTEATTT PSVAHPPLPI RRRRIAPSGV LMAIVDTCLI LLGFALAYWM 
RYVIDWPPPF DQLVREVQAQ NFVPLSAFAP FAILLTALLI VQFAMRGLYR MPRTAGVLDH
ASIIVGSTTT GIAILIVVVF LYKPSEFYSR LIFAFALVSI STLLVGGRAV LIGLRRWRWV
RGIDRERVLV VGNTGLGREV MESLVAQPDL GYALVGFLDD RETPLNRRTV HFRRLGPISD
LDVCLRGGDI DLVILALPFW EHHRLPELVD ICRYAGVEFR VVPDLYQLSF DRIDIGNLSG
IPLIGLKEVS LRGWNLVVKR TMDLALTLLA LPIVFPLGVM LAIIVRLDSP GPAIFRQRRI
GRDGRPFICY KFRTMVVDAE ERKAELAALN EADGPLFKIR NDPRMTRVGR FLRRYSLDEL
PQLWNILRGD MSWVGPRPAT PEEVAQYEDW HYRRLTVVPG LTGLSQVLGR SDISFDETVR
LDIFYTENWT PGMDLRILLQ TIPVVISGRG AY