Gene Rcas_3637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3637 
Symbol 
ID5541139 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4755711 
End bp4757981 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content65% 
IMG OID640895757 
Productglycosyl transferase group 1 
Protein accessionYP_001433704 
Protein GI156743575 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase
[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.871876 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00349398 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCCCTCC CTCCCTTTCC CCTTTCCCCT TCGCTTCCCG CCTACGGCTA TGCGCCGGTC 
GATCCGGCGG CGACGCCGGT GGTGAGTATT GTCACGCCAT GTTACAACTC TGGCGCGATC
TTCCTCGATA CCGCGCGCTC GGTGCTGCGT CAGTCGTTGC AGCAGTGGGA ATGGATCATT
GTCAACGATG GGTCGGACGA TGCGGCCACG CTGCGCGTCC TGGCGTTGCT CCGCGCCGTG
AATGACCCGC GCATTCGCGT GGTGGACCGG TCCCGCTGCG GTCTTTCGGC GGCGCGCAAT
GCGGGTGTCG CCGTCAGTCG CGCACCGTTG CTCTTCTTCC TCGATAGTGA TGACCTGCTG
GCGCCGACGG CGCTCGAACA GTGCGCCTGG GCGCTGGCGT CGCGTCCGCA GAGCGCTGCC
GTGGCGACCC GGTGCGCGAC GTTCGGCGCG GCGCAGCAGG AGACGCGCCG CGGGTTTGCG
ACGCGCCATA TGTTTCCGCA CGATAATCCG CTGACAGTCA GTGTTCTGCT GCGCCGTTCG
GCGTTCGACC GCGTCGGCGG GTTCGATGAG CGTTTCCGCG ATGGGTTGGA GGATTATGAG
TTCTGGGTGC GCCTGGCAGC TGCCGGGTTG TGGGGACACG ATATTGCGGA GGCGCTGGTG
TGGCTGCGCC GCAAGGCGCC AGAGACGTAT CGCGGGTATC GCTGGACGTT TCGTGAGGAT
CGGGGGGCAA TGGCGCGTAT GCGCAATGAT CTGCGCACGC GCTATCCGCA GGTGTTCCGC
GATGGTCCGC CGCGCGTGTC TGGTGAGCCG TCCCCCATTC TGCAACCGCA CGCGCTGATC
GATCCTGAGC CGCCGTTCGC CAATCGTCTG CAACCGGTCG GAGCGCGGCG GGTGCTCATG
CTGGCGCCCT GGGTCGAGAT CGGCGGCGCG GATCGCTTTA CCATCGACCT GGCAGCCGGT
CTGCGCGCGC GTGGGTGTCG CGTGTCGGTG TGCCTGTCGC GTCCGTCGGC CAACCCCTGG
CTGAATGAGT TGCGCGGCGC TGCCGATGAG GTCTTTAATC TGCCGACGTT CCTTGCGCCT
GCCGATTATC CGCGCTTTCT GCGCTATCTG ATCGAGTCGC GCAGCATTAC GACGGTGGTG
GTGCAGAACG ATCTGTTTGC TTATCGGTTG CTGCCTTTCC TGCGCGCCTG GTGTCCTGAT
GTGCCGATTG TCGATTTTTT GCACATCGAA CAGGAGCACT ACCACGGCGG CGTGCCGCGC
GCAGCGCTGG AGTATGATCC GGTCATCGAT CTTCATATCA CATCATCGCA CCACCTGCGG
CAGTGGATGA TCGAGCGCAG CGCCGATCCG GTGCGGGTTG ATGTTTGTTA TATCAATGTG
GATACACAGC GCTGGCAACC CGATCCGGCG GCGCGCGCGC GCGTTCGCGC GGAACTGGGA
GCGCCCCCTG ATGTGCCGGT GATCCTGTTT GTCGGACGGT TGGCGCCTCA GAAGCGTCCC
CGACTGGTGG CAGAGATCGC GCGCGCGCTG ATGGAACAAG ATGTTCCCGG CATGTTTCTG
GTGATCGGCG ATGGTCCCGA CATGGGCTGG ATGCGGCGGT TTGTGCAGCG GCATCGTCTG
GAACGCCGGG TGCGGTTGTT GGGGTCGGCG CCATCGGCGC AGGTGCGGGA GATAATGGCA
GCCGCCGATA TTCTACTCTT GCCGTCGGAA CACGAAGGCA TTGCGTTTGT GCTCTTCGAG
GCGATGGCGA TGGGGCTGGC GCCGGTTGCC GCCGATGTCG GCGGGCAGCG TGAGTTGGTG
ACGCCCGACT GCGGTGTGCT CGTTCCGCTG GCGGGAGATC AGGTTGCGCA GTATGTCGAA
GCGCTGCAAC GCCTGATCGC CGATCCGCAG CGGCGCGCGG CGATGGGGCA GTCGGCGCGC
GCGCGGGTTG TGGCGCATTT TGACCAGCAG CAGATGATCG ACCGCATGCT GGAACTCTTC
GAGCAGGCGG CGACCCTGGC GCGCGATGCG CCGCGCCCGT CGGTGGATCG CGGTCTTGGT
CTGGCGACGG CGTCCCTGGC GATTGAGTAT TTTCAGTTCC GCGAGGCGCT GCTCCGGCTG
GCGCCGGTGC GTTGGGCGCG CGCGGCGCGC TGGTCCTCTG CTTGGGAAAC GGTGCGGCGG
ATTGCAGAGG TGCGCACCCT GCTCGACCGC CTTGATCGCC GGATATATGT GCTGCGTCGT
GAGGTCATGT GGCGAATCAA GCGCGCGCTG GGGAAGGAGT ATAATCAGTG A
 
Protein sequence
MPLPPFPLSP SLPAYGYAPV DPAATPVVSI VTPCYNSGAI FLDTARSVLR QSLQQWEWII 
VNDGSDDAAT LRVLALLRAV NDPRIRVVDR SRCGLSAARN AGVAVSRAPL LFFLDSDDLL
APTALEQCAW ALASRPQSAA VATRCATFGA AQQETRRGFA TRHMFPHDNP LTVSVLLRRS
AFDRVGGFDE RFRDGLEDYE FWVRLAAAGL WGHDIAEALV WLRRKAPETY RGYRWTFRED
RGAMARMRND LRTRYPQVFR DGPPRVSGEP SPILQPHALI DPEPPFANRL QPVGARRVLM
LAPWVEIGGA DRFTIDLAAG LRARGCRVSV CLSRPSANPW LNELRGAADE VFNLPTFLAP
ADYPRFLRYL IESRSITTVV VQNDLFAYRL LPFLRAWCPD VPIVDFLHIE QEHYHGGVPR
AALEYDPVID LHITSSHHLR QWMIERSADP VRVDVCYINV DTQRWQPDPA ARARVRAELG
APPDVPVILF VGRLAPQKRP RLVAEIARAL MEQDVPGMFL VIGDGPDMGW MRRFVQRHRL
ERRVRLLGSA PSAQVREIMA AADILLLPSE HEGIAFVLFE AMAMGLAPVA ADVGGQRELV
TPDCGVLVPL AGDQVAQYVE ALQRLIADPQ RRAAMGQSAR ARVVAHFDQQ QMIDRMLELF
EQAATLARDA PRPSVDRGLG LATASLAIEY FQFREALLRL APVRWARAAR WSSAWETVRR
IAEVRTLLDR LDRRIYVLRR EVMWRIKRAL GKEYNQ