Gene Rcas_1885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1885 
Symbol 
ID5539363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2421536 
End bp2422603 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content63% 
IMG OID640894022 
Productgalactose-1-phosphate uridylyltransferase 
Protein accessionYP_001431993 
Protein GI156741864 
COG category[C] Energy production and conversion 
COG ID[COG1085] Galactose-1-phosphate uridylyltransferase 
TIGRFAM ID[TIGR00209] galactose-1-phosphate uridylyltransferase, family 1 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00396912 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGTTCTCGC TCACCGATCA TCCGCACCGA CGCTACAACC CGCTGACACG CGAGTGGGTG 
CTGGTTTCGC CGCACCGCAC CAAACGCCCC TGGCAGGGGC AGGTCGAACA ACCGCCGCCG
GAACAGCGCC CGGCGTATGA TCCCAACTGC TACCTCTGTC CCGGAAACGC CAGAGCAAAT
GGCGAAGTCA ACCCACCGTA TGAAAGCACG TTTGTGTTCA CCAACGATTT CTCGGCGCTG
CTGCCGGATA TTCCGTCCGG TGAATATGCG TCGAGCGCCG GATCGCCGGA CGATGCCGCG
CCGCCACTCC TTTACGCCCG CAGTGAGCGC GGCATCTGTC GGGTGGTATG CTTCTCGCCG
CGCCACGATC TGACACTGGC GGAGATGGAC GTGCCAGACC TTGCGCGCGT GGTCGATGTG
TGGGCGCAGG AGTATGAAGC AATCGGCGCG TTGCCCTTCA TCGGATATGT GCAGATCTTC
GAGAATCGCG GCGCGATGAT GGGCGCGAGC AACCCGCATC CCCACGGGCA GATCTGGGCT
ACCGAGCGCA TGCCATCGCT GATTGTGCGC GAAGACGCTG CCCAGTGCGA CCACCTGGCG
GCGACCGGAC GTTCATTGCT GGCGGATTAC CTGGCGCTCG AACTCGAACG CGAAGAGCGC
ATCGTCTGCG CCAATGACCA TTTCGTTGCG CTGGTTCCAT TCTGGGCAGT CTGGCCCTTC
GAGACCATAA TCATCAGCCG CCGACACGTT GGCGCGCTCA GCGATTTGAC AGCAGACGAA
CGCATGGGTC TGGCGGATGT GCTGAAACGC CTGACCACGC GCTACGATAA TCTTTTCCAG
GTGTCGTTTC CCTATTCACT TGGGTTTCAC CAACGCCCTA CCGACGGCGC GCCCCACCCG
GCATGGCATC TCCATGCGCA CGCCTACCCG CCGCTGCTGC GCTCGGCGAC GGTGCGAAAA
TTCATGGTCG GCTTCGAGTT GCTTGCCGAA GCGCAACGTG ACATCACGCC CGAACAGGCG
GCTGAGCGCC TGCGCGCGTT GCCGGAACGC CACTATCGAT CCGTGTAG
 
Protein sequence
MFSLTDHPHR RYNPLTREWV LVSPHRTKRP WQGQVEQPPP EQRPAYDPNC YLCPGNARAN 
GEVNPPYEST FVFTNDFSAL LPDIPSGEYA SSAGSPDDAA PPLLYARSER GICRVVCFSP
RHDLTLAEMD VPDLARVVDV WAQEYEAIGA LPFIGYVQIF ENRGAMMGAS NPHPHGQIWA
TERMPSLIVR EDAAQCDHLA ATGRSLLADY LALELEREER IVCANDHFVA LVPFWAVWPF
ETIIISRRHV GALSDLTADE RMGLADVLKR LTTRYDNLFQ VSFPYSLGFH QRPTDGAPHP
AWHLHAHAYP PLLRSATVRK FMVGFELLAE AQRDITPEQA AERLRALPER HYRSV