Gene Rcas_3648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3648 
Symbol 
ID5541150 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4774557 
End bp4776773 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content60% 
IMG OID640895768 
Productglycosyl transferase family protein 
Protein accessionYP_001433715 
Protein GI156743586 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase
[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0250737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0159543 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATCTCT GGCTGCTCAC ATCACAATTT CCTCCGGCAG TGATCGGCGG CATTGCGCGC 
TATGTTGCCA ATGCCGCCGA TATGTTTGCC CGCGCCGGTC ATCGGGTGAC GGTGGTGACG
GTCGATCAGA CTGAAGGCGA GGAGGTTCTG CCATCGGGGG TGCGGGTCCT TCGCTTCGTG
CCGCGAGGTC GGGCGTTCGC CCCATATGTT GCCAAGGACG CGACCGATCA CAATCCGGCG
TTCCCGTACA ATATCATGGC GTATCCGATT GCACTGAGTT ATGAACTGGC GGAACGGGTT
TCGTCGTATG TGCAGCGCGA TGGCGTGCTT CCCGATATGA TCGAATGCCA GGAATATCTG
GCGCTGCCGT ATTACCTGAT CCAGCGCCGC CTGGTCGAGC GCCATCCGCT CGGTGACGTT
CCGCTGCTGT TGCACCTTCA CAGTCCCGAC TTTTGCATTA CGCGCGCCAA CCGCCAACCG
CGCTATCGTT TGCCGGGGTA CTGGATCGGG CGGATGGAAC GGTTCTGTAT GCATGCGGCG
GATGCCATGC TGTCACCGAG TTTCTTTCTT GCGAACCAGA TCGCCGGAGA GATGATGCAG
CTCTCGCACG CAATTGAGGT CGTTCCGTAC CCGTACCTAC CCTTGCCGGC GCTTGAAACT
ATGCCGGAGC GCGGCGATCT TGTGTTCTTT GGGCGGCTGG AGCCGCGGAA GGGGGTCATG
GAGTTTGTTG GAGTGTGCGA TCACCTCTGG TCACGTGGCT ACGATTTTCG GCTGACGCTC
ATAGGTGACG ACACGCCCTT CGGTCCGCGT GGAATGTCGG TTGGTGCGTG GCTACGCTGG
CGGTATGGTC GCCGGATTGA TGAAGGGCGC CTGATCATGA CCGGCGCATC GCTTCCGCCA
GAAGACCTCT GGCGGCGACT GGAAAAGGCA TGGGCGGTGG TTGTGCCTTC GACATGGGAG
AATTATCCGA ATGTCTGCAT CGAAGCGATG GCGTTGGGGA AGCTGGTGAT TGCCTCGACT
TCCGGCGGTC AGGCGGAAAT GATCGGCGCC GACGGGGATT GCGGGATACT GTTCCGTTGG
GATCAGCCGG GCGATTGCGT TGCTGCCGTG GAACGGGCGT TGGCGCTGTC GGTCGATGAG
GTGCGGGCAA TCGGTGCGCG CGCGCGGGAT CGCATCACGG CGCTGACATC GTTTGAGGCG
GTGCTGCCGC GACGGATGGC GCATTTCGAG CAGATCACGA CGTGTATGCG ACCTCGCCGA
TCTTTTCCGG GTCTGATGCC GGATGTTCCT GCACCAATGC GCACGGTGAC GCCCGACACA
GTTCCGGGGC TGCTTTCGGT GGTTGTGCCC TATCACAACC TTGGAGCATA CCTTGCCGAA
ACGATTGCCA GCATCGTCGC TTCTTCCTAT CGTCCGCTGG ACATCGTGAT CGTCGATGAC
GGAAGCGACG ATCCTGCAAG TGTGGCGGCG CTCGAACAGG TCGGTCAGAT GCATCCCGAT
CTGGTGCGGA TTGTCCGGTG TGAGCGCGGC GGGTTGGCGC GCGCGCGCAA TCGCGGCGCT
CAGGCGGCGC GTGGTGAGTT CCTGGCGTTT GTCGATGCGG ATGATCTGGT CGAGCCTTCC
TTTTTTGAGC GCGCCATCGA TGTGCTGCAA CGGTACGATA ACGTCAGTTT CGTCTATTCC
TGGGTGCGCT TCTTCGGCGA GTCCGATGCG TGCTGGCCCA CCTGGAATGT TGAGTTCCCC
TATCTTCTGG CGCACAATAT GTTGACGGCG TTTGTCGTCG TGCGGCGGAG CGATTTTCTT
GCCTGGGGTC AGAACGATCC ATCGCTCAGC GATGCGCTCG AAGATTATGA TGCCTGGATC
TCGATGGTTG AGCAGGGGTG CATTGGGGTC AGTCTCCCCG ACCCTCTGGT GCGCTACCGG
ATGCGCAGCG ACTCGATGTA TCACTCGTTG AGCGACGCGC AGATTCTGGA GATGTACGAC
CGGATCGTGG CTCGCCATCG TTCTGTGTAC GAACGCTATG GCGCGGAACT CTTTGCGTTG
CAGAACGAAA ACGGACCAGG ATGGCGCTGG AACCATCCGG CGACCGATCC GCCCGATGTT
GTGCAGCATC AGCAGATTGC CGCACTGCAA CATCGAGTCG CTCGACTCGA ACGCTTGCTG
GCGATACCAC TCAGAGTGCG CCGAACGTTG CGTGACCTGT GGAGACGTCA GAGGTGA
 
Protein sequence
MNLWLLTSQF PPAVIGGIAR YVANAADMFA RAGHRVTVVT VDQTEGEEVL PSGVRVLRFV 
PRGRAFAPYV AKDATDHNPA FPYNIMAYPI ALSYELAERV SSYVQRDGVL PDMIECQEYL
ALPYYLIQRR LVERHPLGDV PLLLHLHSPD FCITRANRQP RYRLPGYWIG RMERFCMHAA
DAMLSPSFFL ANQIAGEMMQ LSHAIEVVPY PYLPLPALET MPERGDLVFF GRLEPRKGVM
EFVGVCDHLW SRGYDFRLTL IGDDTPFGPR GMSVGAWLRW RYGRRIDEGR LIMTGASLPP
EDLWRRLEKA WAVVVPSTWE NYPNVCIEAM ALGKLVIAST SGGQAEMIGA DGDCGILFRW
DQPGDCVAAV ERALALSVDE VRAIGARARD RITALTSFEA VLPRRMAHFE QITTCMRPRR
SFPGLMPDVP APMRTVTPDT VPGLLSVVVP YHNLGAYLAE TIASIVASSY RPLDIVIVDD
GSDDPASVAA LEQVGQMHPD LVRIVRCERG GLARARNRGA QAARGEFLAF VDADDLVEPS
FFERAIDVLQ RYDNVSFVYS WVRFFGESDA CWPTWNVEFP YLLAHNMLTA FVVVRRSDFL
AWGQNDPSLS DALEDYDAWI SMVEQGCIGV SLPDPLVRYR MRSDSMYHSL SDAQILEMYD
RIVARHRSVY ERYGAELFAL QNENGPGWRW NHPATDPPDV VQHQQIAALQ HRVARLERLL
AIPLRVRRTL RDLWRRQR