Gene Rcas_3107 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3107 
Symbol 
ID5540603 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4025517 
End bp4026797 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content62% 
IMG OID640895226 
Productglycosyl transferase group 1 
Protein accessionYP_001433179 
Protein GI156743050 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCGA TGTCACCGCG CCCGCGCACA GTTGCCTATA CGATGTCGCG CTTCCCCAAG 
CTGACCGAGA CGTTCATTCT CATCGAAATG CTCGAACTCG AACGTCAGGG AGTGCGGATC
GAAATCTTTC CGCTTATTCG GGAAAAGGCG CCGGTGCAGC ATGCCGATGC GCAGGAGATG
GTCGAGCGGG CGCACTACTG TCGGTTGTTG TCGCGCCCGA CGCTCGATGC GCAGGTCTAC
TGGCTGATGC GCCGTCCGGT CGCGTATCTG CGCGCCTGGT GGCGCGCGGT GCGCGGCAAT
CTGGCATCGC CCAAATTTCT GTCGCGCGCG CTGGTGGTCG TTCCGAAAGC GGCGTATGCA
GCGCGCCGCA TGGTCGATCT GGAGGTCGAT CATCTCCACG CGCACTATGC GACGCATCCG
GCGTTGCTCG CGTATGTGGT CAATCTGCTG ACCGGTATTC CCTACAGTTT TACGGTGCAT
GCCCACGATC TCTACGTCGA GCGCTCCATG CTGCGGGAGA AGGTCGCCGC CGCCCGTTTT
GTCGTGGCGA TTTCGGAGTT CAATCGCCGA ATGCTGATTG ATCTGTACGG ATCCGTCGCC
CAAGAGCGCG TCGTGGTGGT GCATTCCGGG ATCGATCCGA CCCTCTTTCG CCCGCGCGAG
CGGCGCGACT CCGGCGCGGT TTTTACCATC GTGTGCGTGG GCAGCCTGTC CGGCTACAAA
GGGCAGCGCT ATCTGATCGA TGCCTGTGAT TTGTTGCGCA AACGTGGGAT GGCATTTCAG
TGCCTGCTGG TTGGCGAGGG TGAGAATCGA CCGCTTCTTG AAGCGCAGAT TGAGCGCCTG
GGTCTGGAGC GACACGTTCG GCTGCTCGGT GCGCAACCAC GTCATCGGGT GAGCGACATT
CTGCAACAGG CGGATGTGAT GGTCTTGCCG AGTGTTGTGA TGCCGAACGG CAAGATGGAA
GGCATTCCCG TGGCGCTGAT GGAGGCGCTG GCATCTGAGG TTCCTGTGGT GGCAACGGCG
ATCTCCGGCA TTCCAGAACT GGTGCGCGAT GGCGAAACCG GTCTGCTCGT GCCGCAACGT
CATGCGGCGG CGCTGGCGGA TGCTCTGGCG CGCCTGTACG CCGACCGGGA CCTGGGGCGA
CGGCTGGCGG CAACCGGCAG GCAACTGGTG CTGCGCGAAT TCAATCTCGA ACGTAGTGCG
GAGCGATTGC GCATGCTGTT CGAGCGCGAC CGACGATCAT CGGAGCAGGG TTTGACTGTG
CAATCTGTTG AGGCGTCGTG A
 
Protein sequence
MSSMSPRPRT VAYTMSRFPK LTETFILIEM LELERQGVRI EIFPLIREKA PVQHADAQEM 
VERAHYCRLL SRPTLDAQVY WLMRRPVAYL RAWWRAVRGN LASPKFLSRA LVVVPKAAYA
ARRMVDLEVD HLHAHYATHP ALLAYVVNLL TGIPYSFTVH AHDLYVERSM LREKVAAARF
VVAISEFNRR MLIDLYGSVA QERVVVVHSG IDPTLFRPRE RRDSGAVFTI VCVGSLSGYK
GQRYLIDACD LLRKRGMAFQ CLLVGEGENR PLLEAQIERL GLERHVRLLG AQPRHRVSDI
LQQADVMVLP SVVMPNGKME GIPVALMEAL ASEVPVVATA ISGIPELVRD GETGLLVPQR
HAAALADALA RLYADRDLGR RLAATGRQLV LREFNLERSA ERLRMLFERD RRSSEQGLTV
QSVEAS