Gene Rcas_1988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1988 
Symbol 
ID5539466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2552774 
End bp2553898 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content65% 
IMG OID640894123 
Productglycosyl transferase group 1 
Protein accessionYP_001432094 
Protein GI156741965 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.590311 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.0366847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCGCA CATCTATCGT GCCGACGCTG TCGGATCACT GCTCCACCCC TCATCTTTCC 
GTCGTCTGGC AGTCGCGCTG GGGCATCCCC ATCGGATACG CTGTCTCTTC CGAGGCGCTG
GCGCTCGCAC TGGCTGCGCG CGGCGTCGAG TTGTCGTACC GCCCGACGCC GTGGCATATG
CCGGGAACAA TCCGCTACCC GGCGCTGCTG GCTGCCGCAG CGCGCGCGCC GCGCGACGAC
GTGCCGCAGG TCAGCTACGA CCAGGCAGAC CTGTTCTACA CCCGTCACGC GGGGTACAAA
ATCGGTTACA CCATGCTGGA AGTCGATGGA CTGCCGCGTG AGTGGGTCGC GGCGTGCAAT
GCCATGGACG AGGTCTGGAC GCCAAGCCGC TGGGGTGCGA CGGTCTTTGC CAATGCCGGA
GTGACGCGCC CGATCTATGT CATGCCGCTG GGGTATGATC CGGTGTGTTT TCGACCGGAT
GGACCGGCGC GCCGGATTGC GGAACGCTTC ACGTTTCTCT CGGTCTTTGA GTGGGGCGAA
CGCAAAGCGC CGGACATCCT GCTGCGCGCC TATGCAGCGT CGTTCACGCG ACGCGATGAC
GTGGCGCTGC TGTTGCGGGT GAACAATTTT GACGCAGAGG TCGATGTTGC GCGACAGATC
GCCGCGCTGC GCCTTCCCGC CGATGCTCCG CCGATTGCCC TGCTCTACAA CCGTTACATC
AGCGACGAGA GCCTGGGAGC GCTCTACCGC AGCGCCGATT GTTTTGTGCT GCCGACGCGC
GGTGAAGGAT GGGGGCTGCC GATCCTCGAA GCGATGGCGT GTGGGTTGCC GGTGATTGCC
ACCGACTGGA GCGGGCAAAC CGAATTCTTC CACGGCGGAG TCGGGTACCC GGTGCGGGTG
CGGCGGCTGG TTGCCGCCGA CGCCAAATGC CCCTACTACC TGGGCTGGCG CTGGGCGGAG
CCGGACATCG AGCACCTGAT CGCGCTGATG CGCCATGTGT ACGAGCATCC CAACGAAGCG
CGGGTGGTTG GCGCGCGCGC GGCGCAAGAA GCGGCAACGC GCTGGACGTG GGCGCACGCA
GCGGAGCGGA TTCACAGGCG GTTAATGGAT GCGACTGATG GGTAA
 
Protein sequence
MSRTSIVPTL SDHCSTPHLS VVWQSRWGIP IGYAVSSEAL ALALAARGVE LSYRPTPWHM 
PGTIRYPALL AAAARAPRDD VPQVSYDQAD LFYTRHAGYK IGYTMLEVDG LPREWVAACN
AMDEVWTPSR WGATVFANAG VTRPIYVMPL GYDPVCFRPD GPARRIAERF TFLSVFEWGE
RKAPDILLRA YAASFTRRDD VALLLRVNNF DAEVDVARQI AALRLPADAP PIALLYNRYI
SDESLGALYR SADCFVLPTR GEGWGLPILE AMACGLPVIA TDWSGQTEFF HGGVGYPVRV
RRLVAADAKC PYYLGWRWAE PDIEHLIALM RHVYEHPNEA RVVGARAAQE AATRWTWAHA
AERIHRRLMD ATDG