Gene Rcas_3097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3097 
Symbol 
ID5540593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4014024 
End bp4015196 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content62% 
IMG OID640895216 
Productglycosyl transferase group 1 
Protein accessionYP_001433169 
Protein GI156743040 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.367624 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATTG GCGTCGCCAT GATCCTGCAC GATTTCTACC CATCCATCGG CGGGGCGCAA 
ACGCATACGC TGGCGCTCAG TCGGGCGCTG CGTGCGCGCG GCATCGATGC GATAGCGGTG
ACGCGCCCGT ATCCTGGAAC GCTGGCATAT GAAGAGGTTC AGGGCATTCC GACCTATCGC
GTTGGGATGC ACGGCGGGCG CGTGCTTGCC GGGGTGAGTT ACCTGGCCGC CGGTCTTGCG
CTGCTCATAC GTGAACGAAA CCGCTACCAG ATTCTGCACT GCCATCAAAT GATTTCACCG
ATGACGCTGG CGCTGATGGC GCGCGCGCTG CCCGAAAAAC GACTGGTTAT CAATCCGCAT
GGGCGCGGTC CGCGTGGTGA TGTGGCAAAA CTGACCAGGT TGCGCCCGCT AACGGGGAAA
CTGCGCGTTG CAGCGGCGCT GCGCTGGGGT GATGCCTTTG TTGCGATTTC CCGCGATATT
CACGATGAAT TGTGCGCGAT GGGCGTTCAG GAAGAGCGCA TATGGGATAT TGCCAATGGC
GTTGATGTGG AACGTTTTGC GCCGGCCTCA CTCGACGAGC GGACGGAACT GCGGCGCCGG
CTTGGTCTGC CGGACGGAAG ACTGGTCGTC TTCGTTGGGC GGTTGACGGT CGCCAAAGCG
CTCGATGTGC TGCTGAACGC CTGGGCGCAA CGTGATACGA CACTGGCGGA CGCGCGGCTG
ATCATTGTGG GGGATGGCGA GTTGCGCAAT GACCTCATGC GTCAGGCGCG CGATCTGGGT
GTCGAGCAGT CCGTGATGTT CGCCGGCGCA ACCAATGATA CGGCAGCATA TCTGCGCGCA
TGTGACGCAT TCGTGCTGTC TTCGCGCACA GAAGGGATGC CGGTCGCGCT GCTCGAAGCG
ATGGCATGTG GTCTGCCGTG CGTCGCCACG TGCGTTGGCG GTTCGATGGA GATCATTGAG
GATGGGGTGA ACGGGTGCCT GGTGATGCCG GAAGATGCCG GTGCGCTGGC GCGGGCAGTG
GCGCAAGCGC TTGCAACGCC AGAGTGGGGC GTCCATGCGC GGCGGCATAT TCAGGAGCGA
TACGCTATCG ATACAGTGGC ACAACGCTAT GTGGCGCTGT ACGAACGCCT CGTGAACGGT
AGGAGTGCGG GTGCTGTGCG CACTCCTGCG TAA
 
Protein sequence
MPIGVAMILH DFYPSIGGAQ THTLALSRAL RARGIDAIAV TRPYPGTLAY EEVQGIPTYR 
VGMHGGRVLA GVSYLAAGLA LLIRERNRYQ ILHCHQMISP MTLALMARAL PEKRLVINPH
GRGPRGDVAK LTRLRPLTGK LRVAAALRWG DAFVAISRDI HDELCAMGVQ EERIWDIANG
VDVERFAPAS LDERTELRRR LGLPDGRLVV FVGRLTVAKA LDVLLNAWAQ RDTTLADARL
IIVGDGELRN DLMRQARDLG VEQSVMFAGA TNDTAAYLRA CDAFVLSSRT EGMPVALLEA
MACGLPCVAT CVGGSMEIIE DGVNGCLVMP EDAGALARAV AQALATPEWG VHARRHIQER
YAIDTVAQRY VALYERLVNG RSAGAVRTPA