Gene Rcas_0458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0458 
Symbol 
ID5537921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp583327 
End bp584670 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content60% 
IMG OID640892621 
Productglycosyl transferase group 1 
Protein accessionYP_001430607 
Protein GI156740478 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTG TTCTGCCGGT TCATCATTTT TTGCCGCGCT ACACCGCCGG CGCTGAGTTG 
TACACCTATC GCCTGGCGCG CTGGTTGCGC CACCATGGGC ATGATGTGGA AGTCGTCGCT
GTGGAGTCGG TCGATCATGA GAAACCGGGC GTCTTCGATG TAACGTCAGA CACGTTCGAT
GGCATTCCGG TGTATCGGTT GCATTTCAAT CGGTATGCGC AGGAGCGACA TTGGGAATAC
GATAATCCGC TCATCGGCGA ATGGTTTGCG CGTTACCTCG AACAGGAGCG CCCCGATTTG
GTTCACTTTC AGGCTGGCTA TCTGATGGGT ATCGCGCCGC TGCGAGCCGC CGTTGCCGCC
GGTCTGCCGA CTGTGCTGAC CCTGCACGAT TACTGGTTTA TCTGCCCGCG ATGGACCCTG
CTGCGCGGCG ATGGCGCGCT GTGCCGTGCC GTTCCCGATG ATCCTGCCGA ATGTGCCTGG
TGTCTGCGCC TCGACAGGCG CCGCTTCCGC CTGCCGGAGC GGGTGACAGG CGGCAGGTTC
GGGGAGATCA TGCGTCGCGC TGGCTATGAT CCGGGACGCG AGTCTATTGC GTTGCGTCGT
AAGGCGTTGC TCTCCTCGCT GGCGCTACCG GACGCCGTCG TCGCTCCTTC CCGTTTTCTG
GCGCAACAGG TGCGCGCTTA CGTGCAACCA GAACGTCTGC ACGTCTCGCG TCTTGGTCTG
GACCTGACAC TCTTCTCAGA CGTGCAGCGC GATAAAGAAC ATCGGACTCT CCGGATCGGA
TTCATTGGAC AGATCGCGCC ACACAAGGGG GTCCACCTGC TCATTGCCGC GTTTCAGAAG
GTGCAGGCGC GCTTCGGCCG CATCGAGTTG CACCTCTACG GCGGATTGAC CGCGAATCCG
GGGTATGTTG CCCGGCTGCG CCGTATGGCA GCAAAGGACC CGCGCATTCA TTTCCACGGC
AGAATCGAAA ACACGAGCGT TCCTGCGACG CTGGCGAGTC TCGATGCAAT CGTGGTTCCA
TCAATATGGT ACGAGAACTC GCCATTGGCG ATTATGGAGG CGCACGCTGC GGGAACGCCG
GTGGTAACTG CCGACATCGG CGGTATGGCT GAACTGGTGC GCGATGGCGT CGATGGTCTG
CACTTCCGCT TCAACGACGC TACCGACCTG GCGCATGTTT TGCAGCGATT GGTTGATGAG
CCTGATCTGT TGTCCCGCTT GCGATCCGGT ATTCAGCAAC CGCGTAGCAT CGATGAGGAG
ATGGTTCAGG TTCTCGCTAT CTACGATGAT GCCATCGCTC GCCGTAGCGC TCCGGTCGTG
TGTAGCAGTG AGCAGAACGA TTGA
 
Protein sequence
MKIVLPVHHF LPRYTAGAEL YTYRLARWLR HHGHDVEVVA VESVDHEKPG VFDVTSDTFD 
GIPVYRLHFN RYAQERHWEY DNPLIGEWFA RYLEQERPDL VHFQAGYLMG IAPLRAAVAA
GLPTVLTLHD YWFICPRWTL LRGDGALCRA VPDDPAECAW CLRLDRRRFR LPERVTGGRF
GEIMRRAGYD PGRESIALRR KALLSSLALP DAVVAPSRFL AQQVRAYVQP ERLHVSRLGL
DLTLFSDVQR DKEHRTLRIG FIGQIAPHKG VHLLIAAFQK VQARFGRIEL HLYGGLTANP
GYVARLRRMA AKDPRIHFHG RIENTSVPAT LASLDAIVVP SIWYENSPLA IMEAHAAGTP
VVTADIGGMA ELVRDGVDGL HFRFNDATDL AHVLQRLVDE PDLLSRLRSG IQQPRSIDEE
MVQVLAIYDD AIARRSAPVV CSSEQND