Gene Rcas_1350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1350 
Symbol 
ID5538822 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1727541 
End bp1728761 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content63% 
IMG OID640893487 
Productglycosyl transferase group 1 
Protein accessionYP_001431464 
Protein GI156741335 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAAT CATTACGCAT TGTCCACATC GTCGGTAGCG CCTTTGCCGG TGGATGGGCG 
TTCCATCCGC TCTGCCGATT GCGCGATGCC GGGCACAGGG TACACCTGAT TTGCCCGGAG
GACGGTCCTC TGCCAGAACG GACGCGCGCT GCCGGCATCC CAACTCATAT CATTCCCTTT
CCGCGCCGCA TCCGGCATAT GCACTCAGCC GCAGCATATG TCGCGCGTGT GGCGGCATGG
CTGCGCCGGG AACAGATCGA TGTCGTTCAC AATCATCTAG CGCCTGCCAA TGTGTGGGGG
CGCCTGGCGG CGTATGCGGC CGGCGTTCCG GTGCGCCTGA CGCAGTGGCC CGGTCCGTTG
CCGCTTGAAA TTCCCGCCTC GCGCCGGATC GAAATGGCGC TGGCGCGTCT CGATAGCGCG
ATTATCGCGT CCAGCACCGC TACACAGCGC ATTTACGAGG CATGTGGCGT CATGCGCGAC
CGAATTCGCC TGATCTACTA CGGGTTTCCG TTCGAGCCGT TCGATCCGAC GATTGACGGC
AGTCCGATCC GGCGCGAGTT CAACATTGCG CCCGATGTGC CACTGGTGAC GATGGTCGCC
TATATGTACT CACCGTTGCA AGAACGCTCG CTCCGGGGAT TGAACGTCTT CGGCGGCATT
GGACTGAAGG GACACGAAAT CCTGATCGCC GCCGCAGCGC GTGTCCGTGA GGTCAATCCC
GCCGTGCGCT TTCTGATCGT CGGTGATTCG CTGGCGCCCG GCGAAGCGGA ACGCTATAAG
CGCAGACTTC ATCAGATGGT CGCCGATCTG GAATTGCAGC AGACGGTCAT CTTCGCCGGG
AAACGCACCG ATATTCCGTC AATTCTGGCG GCAGGCGATG TGGCAGCGGT TCCGTCGCTG
TCCGAGAACG TTGGCGGCGC GGTCGAACCG TTGCTCATGG AGCGTCCGGT GGTTGCCAGC
GCAGTCGGCG GGTTGCCCGA CGTGGTGCGC GACGGTGAAA CCGGCTACCT TGTGCCACCG
CGCGATCCGG GCGCGCTGGC GGACGCGTTG CTGCGCATGC TGGCGCTGCC TCCGGCTGCC
CGTCACGCCA TGGGGCGGAG AGGGCGCGCG ATGGTGCGGG AACTGTTCGA TCTGCCCACA
ACAGTGCGAC AAACCGAGCA ACTGTACTAC GACATGCTGA ACGCTACGGG CATGGGGGGT
CTACGATGGG CGCGCCGCTG A
 
Protein sequence
MSQSLRIVHI VGSAFAGGWA FHPLCRLRDA GHRVHLICPE DGPLPERTRA AGIPTHIIPF 
PRRIRHMHSA AAYVARVAAW LRREQIDVVH NHLAPANVWG RLAAYAAGVP VRLTQWPGPL
PLEIPASRRI EMALARLDSA IIASSTATQR IYEACGVMRD RIRLIYYGFP FEPFDPTIDG
SPIRREFNIA PDVPLVTMVA YMYSPLQERS LRGLNVFGGI GLKGHEILIA AAARVREVNP
AVRFLIVGDS LAPGEAERYK RRLHQMVADL ELQQTVIFAG KRTDIPSILA AGDVAAVPSL
SENVGGAVEP LLMERPVVAS AVGGLPDVVR DGETGYLVPP RDPGALADAL LRMLALPPAA
RHAMGRRGRA MVRELFDLPT TVRQTEQLYY DMLNATGMGG LRWARR