Gene Rcas_3106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3106 
Symbol 
ID5540602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4024369 
End bp4025517 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content62% 
IMG OID640895225 
Productglycosyl transferase group 1 
Protein accessionYP_001433178 
Protein GI156743049 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.277853 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACAG ATTTGCCCGG GTACCCGCTG CGTGTCTTGC ACGTTCGTCC GCGTTTGGGA 
ATCGGCGGTG CAACCGAGTA TCTCATCCGG CTGGCGGAGA GCCAGGCGAA CGCGGGGTAT
CACGTGGTGA TTGCGTCGGG AGGCGGCGAC TGGCTCAGGC GCATTGCAGG GTTTGCGCGC
TCCTATGATC GGCTGCCGCT GACGCCATAT CTTGGGTCCG GGAAGCGGAC GCCGAATCTT
CCGGGGTTAC TGGCATCGGG CCTCCAACTG GCGCGCATTA TTCGCGCCGA GCAGATCGAT
CTGGTCAATA CGCACCATCG CTTTGCGGCG CTGGCGGCCA GGCTGGCGTC GCGGCTGACC
GGCACGCCGG TCGTGACAAC GTTGCAAGAA GTGCCCTGGC GGAATCGCGG TCTGACACGA
TTTTCCCTGG GAACGCAGGC TATCACGATG AGTGCAATGA TGAAACGGTT CGTCATCGAC
GTGTGCGATA TTGCGCCGGA TCGGGTCACG GTGATTCCTA TCGGCATCGA CATCCCGGCG
CCGCTCTCCA TAGATCGTCG CCGCCAGTTG CTGGCGGAAC TGCGTCTCGA TGGCGCTGCG
CCGATCATCG TGAGCGTCGG GCGCTTGGTG TCGCGCAAAG GGCATATGTA TCTGATACGG
GCGTTGCCTG AGGTGATTCG GCGCTATCCC GACGTGCAGG TGGTGCTGGT GGGCGATGGC
GAGGAGCGCG CAACGCTCGA ACGAGAAGCG CAGGCGTTGG GTGTCGCCGA CAGGGTGACG
TTTGCCGGTG CGCGGAGCGA TGCGGTCGAT CTGATGGCGC TGGCTGATTT TACCGCGCTT
CCATCGCTCG AAGAAGAGTT TGGGATTGTC ATTACCGAGT CGTTTTCGTG CGGCAAGCCG
GTGGTGGCCA CCACAGTCGG CGGCATTCCC GAGCATGTGC GCTCGATGGA AAATGGCATA
CTCGTGCCGC CGCGCGACAG CCGCGCGCTG GCGGAGGCGA TCATCTTTTT GCTCGACCAT
CCGAATATGG TGCGTCAGTT TGGCGACTGC GCTCGGCGCA TGGTTGAGCA GCAGTATACC
CGGCAACGTT TTCTGGAACG CACAGAGGCG GTCTATCGCG CGGCGCAGAT GCGGGAGGTT
GGGCGATGA
 
Protein sequence
MVTDLPGYPL RVLHVRPRLG IGGATEYLIR LAESQANAGY HVVIASGGGD WLRRIAGFAR 
SYDRLPLTPY LGSGKRTPNL PGLLASGLQL ARIIRAEQID LVNTHHRFAA LAARLASRLT
GTPVVTTLQE VPWRNRGLTR FSLGTQAITM SAMMKRFVID VCDIAPDRVT VIPIGIDIPA
PLSIDRRRQL LAELRLDGAA PIIVSVGRLV SRKGHMYLIR ALPEVIRRYP DVQVVLVGDG
EERATLEREA QALGVADRVT FAGARSDAVD LMALADFTAL PSLEEEFGIV ITESFSCGKP
VVATTVGGIP EHVRSMENGI LVPPRDSRAL AEAIIFLLDH PNMVRQFGDC ARRMVEQQYT
RQRFLERTEA VYRAAQMREV GR