Gene Rcas_1627 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1627 
Symbol 
ID5539103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2100952 
End bp2102250 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content61% 
IMG OID640893764 
Productglycosyl transferase group 1 
Protein accessionYP_001431737 
Protein GI156741608 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03087] sugar transferase, PEP-CTERM/EpsH1 system associated 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.154338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.324472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATTC TCCTGCTGAC CCAGATCGTT CCCTACCCGC CCGACAGCGG TCCGAAAGTC 
AAGACGTACC ACCTGCTGCG ACATCTGGCA GCGCGCTACA GGGTCACACT CGTCTGCTTC
ACGCGCAATG CCCAGGAAGA AGCCGATGCC AACATCCTGC GCAACCTGGT TGCCGAAGTT
CACACCGTAC CGCTTAGGCG ATCCACTGTT CGGAATGCGC TGACTCTGGC GGGCAGCCTG
GTGCGCGGAC GCTCATGGAT CATCGAACGG GATGATAGCG CCACTATGCA TCGGTTGCTG
ACACGCCTGG TGCGCGAAGC GGAGATCGCC GGACGACCGT ATGACCTGGT GCATGCGGAT
CAACTCAACA TGGCGCAGTT TGCCGAACCG TTGCCGTTGC CGCGCCTGCT CGACGAACAC
AACGCCGTGT GGACAGTCTT TCGCCGGGTA GCGGCACAGG AACGGGGTCT CAAGCGGCTC
CTCTGGGAAC GTGAGTGGCG TCAGTTGCGC ACCTACGAGG GACGGGTCTG CCGCGAGTTC
GAGGCGGTCA CAGCCGTCAG TCACGAGGAT CGTCAGGCAT TAATCGACGC GATGGGGATA
GAGCGCCATA TTCCGGTCAT TCCGATTGCC GTCGATGCCG AACGGGAACA GCCAATTGCG
CGTCAGCCGG ATGCGCGTGG CATCCTCAGC CTGGCGACGA TGATGTGGCC CCCGAATGTC
GATGGCGTGC TCTGGTTTGC CCGCAGTATC TACCCGCTGA TCAAACAACA GGTTGAAGGA
GTGCGCTTCT TTATCGTCGG GCAGCGCCCC GTACCAGAAG TGCGCGCACT GCCAGAACAA
GATCCGACGA TTGAGGTGAC CGGCTACGTG CCAGACCCCA CGCCATACAT CGCCGCCTCT
GCCTGCCTGA TCGTTCCCTT GCGCAGCGGC GGCGGCATGC GCGTGAAAAT CCTCGAAGCG
CTGGCGCGCG GCATTCCGGT CGTTTCGACC ACAATTGGCT ACGAAGGGAT CGACCTGGTT
CCCGGCGAAC ATCTGCTGGT CGGCGACACA CCAGAGGCGT TCGCCGATGC AGTCGTTCGC
CTGCTGCGCG ACCCGGACTT CGGCGCGCAA CTGGCGGCAT CCGGGCGTCG GCGCCTGCTC
GAACGCTACG ACTGGCGCGC TGTGTGCCCG GCAATGGATC GGGTGTATGA GCGGATGAAG
AACCAAGAAC CAAGAACCAA GAACCAAGAA CCAAGAACCA AGAACCAAGA ACCGGAAACT
GAGAACTGCA AGCCTATCCA AAGCGCCGAA GAGGGCTGA
 
Protein sequence
MNILLLTQIV PYPPDSGPKV KTYHLLRHLA ARYRVTLVCF TRNAQEEADA NILRNLVAEV 
HTVPLRRSTV RNALTLAGSL VRGRSWIIER DDSATMHRLL TRLVREAEIA GRPYDLVHAD
QLNMAQFAEP LPLPRLLDEH NAVWTVFRRV AAQERGLKRL LWEREWRQLR TYEGRVCREF
EAVTAVSHED RQALIDAMGI ERHIPVIPIA VDAEREQPIA RQPDARGILS LATMMWPPNV
DGVLWFARSI YPLIKQQVEG VRFFIVGQRP VPEVRALPEQ DPTIEVTGYV PDPTPYIAAS
ACLIVPLRSG GGMRVKILEA LARGIPVVST TIGYEGIDLV PGEHLLVGDT PEAFADAVVR
LLRDPDFGAQ LAASGRRRLL ERYDWRAVCP AMDRVYERMK NQEPRTKNQE PRTKNQEPET
ENCKPIQSAE EG