Gene Rcas_0496 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0496 
Symbol 
ID5537959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp637727 
End bp638956 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content68% 
IMG OID640892658 
Productglycosyl transferase group 1 
Protein accessionYP_001430644 
Protein GI156740515 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTGG TTGTGTCGCT GGAACATCAC TTTGATCGCA CGCCGGACGG CGTGGTATGG 
ACGCAGACGC AGTTTCCGTA CCGCTTCTGG CAGCGGTATC TGGAAGTCTT CGATCAGGTG
CGCGTCGTTG CGCGGGTGCG CGATGCGCAG CGGGCAGCCC CCGACTGGCA GCGCGCCGAT
GGGCGGCGCG TCTCCTTTGC CGCCATCCCG GAGTACCTGG GTCCGCAGCA ATACCTGCTG
CGCCTGCCGC AGATTCGCGC GGCGGCACGC CGCGCGATCG GCAACGAGGA TGCCGTCATT
CTGCGGGTCA GTTCGCAGAT CGCCGGTCTG CTCCACCCTT CCCTGCGGCG TGAAGGACGG
CCGTATGGCG TTGAAGTGGT CAGCGATCCG TATGACGTGT TTGCGCCCGG TGCGGTCCGG
CATCCGCTGC GCCCGTTCTT CCGCTGGCTG TTTCCGTATC GGCTACGGCA ACAGTGCGCG
AACGCCACGG CGGCATGCTA TGTCACCGAG CAGGCGCTCC AACGGCGCTA TCCGTGCCCA
TCGTTTGCGG TGGCGGCGTC GGATGTCGAG TTGCCCGAAG AGGCGATTGT CCCGGCGCCG
CGCCCGCTGC ACGCCCCCGG CGCCCCTTGT CGCCTGATCT TCGTCGGCAC ACTGGGGCAG
TTGTACAAAG CGCCGGATGC GGTGATCGAC GCAGTGGCGG CATGCGCGCG CGCAGGCGTC
GATCTGTCTC TCACGCTGGT CGGCGACGGC AAACATCGTT CGGAGATGGA AGAGCGCGTC
CGCGCGCTGG GCATTGCAGA CCGGGTGACC TTTCGCGGGC AGGTGACGAC GGGCGCAGCG
GTGCGCGCCG AACTCGACGC GGCGGATCTG TTCGTGCTGC CGTCGCGGCA GGAAGGCATG
CCGCGCGCGA TGATCGAGGC GATGGCGCGC GCGCTTCCCT GCATTGGCTC GACAGTCGGC
GGCATCCCCG AACTGTTGCC GCCGGAAGAC CTCGTTCCTC CCGGCGATGC CCTGGCGCTG
GCGCAGGTTA TCCGTGAGAT GGTCGCCAGC CCGGAGCGGA TGGCACGCGC CTCAGCGCGC
AACCTCGAAC GCGCGCGGTC ATTCGCCGAG TCCCGGTTGC GCGAGCAACG ACTGGCGTTC
TTCCAGCGGG TGCGCGAACA GACCGAAGCC TGGCTCGCCG ACCGTCACGC AGCGACCGTA
TGGACGGCGG GACGTCGCAC CCTGCCGTGA
 
Protein sequence
MRVVVSLEHH FDRTPDGVVW TQTQFPYRFW QRYLEVFDQV RVVARVRDAQ RAAPDWQRAD 
GRRVSFAAIP EYLGPQQYLL RLPQIRAAAR RAIGNEDAVI LRVSSQIAGL LHPSLRREGR
PYGVEVVSDP YDVFAPGAVR HPLRPFFRWL FPYRLRQQCA NATAACYVTE QALQRRYPCP
SFAVAASDVE LPEEAIVPAP RPLHAPGAPC RLIFVGTLGQ LYKAPDAVID AVAACARAGV
DLSLTLVGDG KHRSEMEERV RALGIADRVT FRGQVTTGAA VRAELDAADL FVLPSRQEGM
PRAMIEAMAR ALPCIGSTVG GIPELLPPED LVPPGDALAL AQVIREMVAS PERMARASAR
NLERARSFAE SRLREQRLAF FQRVREQTEA WLADRHAATV WTAGRRTLP