Gene Rcas_3974 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3974 
Symbol 
ID5541480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5180220 
End bp5181686 
Gene Length1467 bp 
Protein Length488 aa 
Translation table11 
GC content55% 
IMG OID640896082 
Productundecaprenyl-phosphate galactose phosphotransferase 
Protein accessionYP_001434025 
Protein GI156743896 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2148] Sugar transferases involved in lipopolysaccharide synthesis 
TIGRFAM ID[TIGR03022] Undecaprenyl-phosphate galactose phosphotransferase, WbaP
[TIGR03025] exopolysaccharide biosynthesis polyprenyl glycosylphosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0593846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00174462 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGATCT CAAAAGGAAT TGCAGCGCAC GCAAAGCCAG GACATATCGT TCGACGCGCC 
GATCGCTGGC TGTTGAACGG TGTGCTGGTT ATCAATGATA CCCTGGCGGT GCTTATCAGT
TTTGCAATCG CCTATGCGGT GCGTTTCTGG AGCGACCTGC CGATTTTCGA GGAAGGGTGG
GTCAATCCTG ATTTCTATGC GATGGTTGTG CTGGCGATGA CGCCGGTGTA TCTGGGATTG
TTTGCGGCGT ATGGTCTGTA CAATCCGGTG AATCTGCTGG GTGGTGCGAC TGAATATGCC
CGGATGTTCA ATGCTGTCAC CACTGGGGTC TTACTTGTCA TCATCGTCAG TTTCCTGGTT
CCCAACTTTA TTGTTGCGCG TGGATTTCTT ATTCTGTCGT GGGGTTTGCT GGTCATCTTC
GGCATTACCG GTCGTTTTGC GGTGCGCCGA TTTGTTTACG CTCAGCGTCA GGCAGGGCGG
TTTGTCAACC ACACGCTGAT CATCGGCGCG AATCCCGAAG GATTGGCGAT TGTCGAACAA
TTACGGTCGG CAAAAACCTG TGGGATGCGC ATCGTTGGCT TCGTCGATGA CTACCTGCCG
GTGGGCAGTG AACCAATCCC GGGTGTTCCG GTGTTGAGTG CGTCTACTGC GTTTGCGGAA
CAGATCCGGC AGCACGACAT CGATACCGTG ATTGTCGCCA ATACGGCAAT GATGCGAGAG
CAACTGCTCT CGCTCTACAG CACCCTCGAT ACCTTTCAGG ACGTTGAAGT GCGCCTGGCG
TCGGGTCTGT TCGAACTCCT GACGACCGGA GTGCGCGTTC GTGAAGAAGG TTTCGTTCCC
CTGCTGGTGC TCAACAAGAC GCGAATCACC GGCGTGCATC TGATTGCCAA GACCATACTC
GATTACACTC TGGCTGCCAC AGCAGTGATC TTCCTCATTC CTTTCTTCCT TGTTGTCGCT
TATCTGATCA AGCGCGACTC GCCCGGTCCG GTCATCTACC GGCGGCGGGT TGTCGGTCAG
GGGCGTCGTG AGTTCGATGC CCTCAAACTG CGCACCATGC ACATTGATGG CGATCGGTTG
CTGACGCCGG AGCAGAAACG CGAACTCGAA GAACACGGTA AGTTGAAGGA CGATCCGCGT
GTCACGAGAA TTGGCGCATT TCTGCGTAAG TATAGCCTCG ATGAGTTGCC GCAGTTGTTC
AATGTGCTGC GCGGTGAAAT GAGCCTGATT GGTCCGCGTA TGATCACCCG CCAGGAACTT
GAAAAATTCG GCAAATGGCA GCACAACCTC TCGACAGTGA AACCCGGTCT GACCGGCTTG
TGGCAGGTGA GCGGGCGGAG CGATCTGTCA TATGAGGATC GCGTGCGTCT GGATATGCAC
TATATCCGCA ACCATACGAT CTGGCTTGAT CTCCAGATCC TGTTTCAGAC CATACCGGCA
ATATTGACCG GTCGTGGCGC CTACTGA
 
Protein sequence
MAISKGIAAH AKPGHIVRRA DRWLLNGVLV INDTLAVLIS FAIAYAVRFW SDLPIFEEGW 
VNPDFYAMVV LAMTPVYLGL FAAYGLYNPV NLLGGATEYA RMFNAVTTGV LLVIIVSFLV
PNFIVARGFL ILSWGLLVIF GITGRFAVRR FVYAQRQAGR FVNHTLIIGA NPEGLAIVEQ
LRSAKTCGMR IVGFVDDYLP VGSEPIPGVP VLSASTAFAE QIRQHDIDTV IVANTAMMRE
QLLSLYSTLD TFQDVEVRLA SGLFELLTTG VRVREEGFVP LLVLNKTRIT GVHLIAKTIL
DYTLAATAVI FLIPFFLVVA YLIKRDSPGP VIYRRRVVGQ GRREFDALKL RTMHIDGDRL
LTPEQKRELE EHGKLKDDPR VTRIGAFLRK YSLDELPQLF NVLRGEMSLI GPRMITRQEL
EKFGKWQHNL STVKPGLTGL WQVSGRSDLS YEDRVRLDMH YIRNHTIWLD LQILFQTIPA
ILTGRGAY