Gene Rcas_4266 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4266 
Symbol 
ID5541777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5506695 
End bp5508233 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content60% 
IMG OID640896373 
Productundecaprenyl-phosphate galactose phosphotransferase 
Protein accessionYP_001434311 
Protein GI156744182 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03023] Undecaprenyl-phosphate glucose phosphotransferase
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.164636 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.415736 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGTG TTGTTCAGAA GGAGCCGACG GTGAAAAAAA CAACTGAGGC GACGACCGCC 
GCATCCGTCG CTCGTGCTCC TTTTCCAATC CGTCGTAGAC GAAACGCGCC TTCCGGCGCG
CTGCTCCCCC TCTTCGACAT CTGCCTGATT CTCGCCGGTT TTGCCATCGC CTACTGGATG
CGCTATGAAC TCGATTGGCC TCCGCCGTTC GATCAACTGG TGCGCGAGGT GCAGGCGCAG
AACTTTGTGC CGCTCAGCGC CTTTGCGCCA TTTGCGCTCC TGCTGGCTGC GCTCCTGATG
GTTCAGTTCG CCATGCGCGG GCTGTATCGC CTGCCGCGCA CCGCCGGCGT GCTCGACCAC
AGCAGCATCA TCGTCGGCTC AACCACAACC GGTATCGCCA TTCTGATCGT TGTGGTCTTT
CTGTATAAGC CTTCGGAATT CTACTCGCGC TTGATCTTTG CATTTGCCTG GGGAACCATT
ATTGCGCTCC TCGTCGGATG GCGCGCCGTG TTGATCAGCA TACGCCGCTG GCGCTGGGTG
CGCGGCATCG ACCGTGAACG GGTGCTGGTG GTCGGCAACA CCGGTCTGGG GCGCGAGGTG
ATGGAGAGCC TGGTGGCGCA ACCCGATCTG GGGTATGCGC TCGTCGGTTT TCTCGATGAT
CGGGAGCGGG CGCCCAACCG GCGAACCTTG CATTTTCGAC AGATTGGACG AATCAGCGAT
CTCGAAACCT GTCTGCGCGG CGGGGATATC GATCTGGTCA TCCTGGCGTT GCCGTTTTGG
GAGCATCATC GCCTGCCCGA CCTGGTGGCA ACCTGTCGCT ACGCAGGGGT CGAGTTCTGT
GTCGTTCCCG ATCTCTACGA GTTGAGTTTC GACCGCATCG ATATCGGCAA CCTGGGCGGT
ATTCCGCTGA TTGGCTTGAA GGCGGTCTCG CTGCGCGGCT GGAACCTGGT GGTCAAACGA
GCCATGGATC TGGCATTGAC GCTGCTGACG TTGCCGCTGG TGATCCCACT GGGAGTGGCG
ATTGCGATCA TCGTGCGCCT CGACTCGCCT GGATCGGCGA TTTTCCGGCA GCGTCGGATC
GGGCGTGATG GACGCCCGTT CATCTGTTAT AAGTTTCGCA CGATGGTGAT CGATGCCGAG
GAGCGCAAAG CCGAACTCGC TGCGTTGAAT GAAGCCGATG GTCCACTCTT CAAAATGCGG
AACGACCCGC GGATGACCCG CGTCGGGCGC GTGCTGCGAC GTTACAGCCT GGATGAACTG
CCGCAGTTGT GGAATATTCT GCGCGGTGAA ATGAGTTGGG TGGGTCCGCG TCCGGCAACG
CCGGAAGAAG TCGCGCAGTA TGAAGACTGG CATTACCGCC GGTTAACGGT TGTGCCCGGT
CTGACGGGAC TATCGCAGGT GTTGGGGCGC AGTGATATTT CGTTCGACGA AATGGTGCGC
CTCGACATCT TTTACACTGA AAACTGGACA CCCGGCATGG ATCTGCGTAT TCTGCTGCAA
ACGATTCCGG TCGTTATCTC CGGGCGTGGG GCGTATTGA
 
Protein sequence
MNRVVQKEPT VKKTTEATTA ASVARAPFPI RRRRNAPSGA LLPLFDICLI LAGFAIAYWM 
RYELDWPPPF DQLVREVQAQ NFVPLSAFAP FALLLAALLM VQFAMRGLYR LPRTAGVLDH
SSIIVGSTTT GIAILIVVVF LYKPSEFYSR LIFAFAWGTI IALLVGWRAV LISIRRWRWV
RGIDRERVLV VGNTGLGREV MESLVAQPDL GYALVGFLDD RERAPNRRTL HFRQIGRISD
LETCLRGGDI DLVILALPFW EHHRLPDLVA TCRYAGVEFC VVPDLYELSF DRIDIGNLGG
IPLIGLKAVS LRGWNLVVKR AMDLALTLLT LPLVIPLGVA IAIIVRLDSP GSAIFRQRRI
GRDGRPFICY KFRTMVIDAE ERKAELAALN EADGPLFKMR NDPRMTRVGR VLRRYSLDEL
PQLWNILRGE MSWVGPRPAT PEEVAQYEDW HYRRLTVVPG LTGLSQVLGR SDISFDEMVR
LDIFYTENWT PGMDLRILLQ TIPVVISGRG AY