Gene Rcas_1488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1488 
Symbol 
ID5538963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1902527 
End bp1903810 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content65% 
IMG OID640893626 
Productglycosyl transferase group 1 
Protein accessionYP_001431600 
Protein GI156741471 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0380983 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.669661 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACATCC TGCTTCCAAC CGACGTTTTC CCCCCCGGCT GCGGCGGTGC AGGCTGGAGC 
GCACACGCGC TGGCGCTGGC GTTGATCACG CGCGGTCACA CCGTGACTGC CATTGTCCCG
CGCGAAGGAG ACCGTGGACT TGTCACCGGC GAAACGCGCG GCGTACCGAC GGTTTCTTTC
GGTTATCGCG CGCCGCGCAT CCCCTTCGTG CGCAACTACG CGCGCAACGA GCGGTTGTGG
CCCCGCCTGG CGCGGGTGAT GGTCGAAGCC AGCGCAACAC ACACGTCACA TCATCCGGCA
ATGATCATTC ATGCCCAACA CGTTCAGGTT GCGCCTGCCG CTGTGATGGC CGGGCAGCGG
CTTGGCGCAC CGGTGGTCAT CAGTGTGCGC GATCACTGGC CCTGGGATTA TTTTGCAACC
GGGTTGCACG GCGACCGCAT CCCCCATCCC CGTCAGACAT GGGCGTCGCT GGCAGCCGAT
CTGCCCGCTC GCCTGGGACC TTTGCGCGGC GCGGCGGCGC TGCCTGCCAT CCCCTATATG
CTGGCGCACC TGGCGCGCCG CCGCGCCGTC CTGCGCCAGG CAGACGCCGT GATTGCCGGC
AGTCGCTACA TCGCCGGACG CCTGGTGGAC CTGGTTGAAC CTCAACGACT GCACATTATT
CCGAATATTG TCGATCTGGC GGCAATTGAT GCCGTGATCG CCACTCCCTC GCACCTGGTT
GCGCCCGATG AGCGCTTCGT CCTCTATGTC GGCAAACTCG AACGCAATAA AGGAGCGCAT
CTGCTCGGTG AAATTGTCCG ACAGACAGGC GCAGCGCTCC ACCGCTACAC CCTCGTGATC
GCGGGTAGCG GCCCGTTGCG GACGGAACTG GAGCAAACGG TGCGCGCCGT TGGCATGCGC
GCCCGATTTC TCGACTGGAT CGACCACGAC GAGGTGTTAC GTTTGATGGC GCGATGTGAT
CTGTTGCTCT TCCCCTCGGC GTGGGGCGAG CCATTGAGCC GCGTGCTGCT CGAAGCATGC
GCCTGCGGCG CGCCTATCCT GGCAATGCCC ACCGGCGGCA CACCGGACAT TATTCTCGAT
GGAGAAAGCG GCGCACTCGC GGCAACAGTG CCTGGTTTTG CGCGTCGTCT GACCGAACTG
CTCGAACGAC CGGTCGAGCG CCAGGCGCTT GGCGCCGGAG CACGCCGCCT GGCGGCGCGT
CGCTTCGCCC CCGATATCGT TGCCGGGCAG GTGGAACGTC TCTATCAGTC GCTTGTAGCA
CCGAAGCAGT ATGCAGCGCA GTAG
 
Protein sequence
MHILLPTDVF PPGCGGAGWS AHALALALIT RGHTVTAIVP REGDRGLVTG ETRGVPTVSF 
GYRAPRIPFV RNYARNERLW PRLARVMVEA SATHTSHHPA MIIHAQHVQV APAAVMAGQR
LGAPVVISVR DHWPWDYFAT GLHGDRIPHP RQTWASLAAD LPARLGPLRG AAALPAIPYM
LAHLARRRAV LRQADAVIAG SRYIAGRLVD LVEPQRLHII PNIVDLAAID AVIATPSHLV
APDERFVLYV GKLERNKGAH LLGEIVRQTG AALHRYTLVI AGSGPLRTEL EQTVRAVGMR
ARFLDWIDHD EVLRLMARCD LLLFPSAWGE PLSRVLLEAC ACGAPILAMP TGGTPDIILD
GESGALAATV PGFARRLTEL LERPVERQAL GAGARRLAAR RFAPDIVAGQ VERLYQSLVA
PKQYAAQ