Gene Rcas_3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3100 
Symbol 
ID5540596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4017335 
End bp4018312 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content58% 
IMG OID640895219 
Productglycosyl transferase family protein 
Protein accessionYP_001433172 
Protein GI156743043 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0346657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGAAC CGCGGGTCAG TGTTGTCATT ACGGCATACA ATGCCGCCGA ATACCTCAGT 
GCTGCAATCG AGAGCGTGCT GGCACAATCG CATCCCGCAG ACGACGTCGT TGTGGTTGAT
GATGGGTCAA CCGACGCGAG TGCGGCGGTT GCGCAATCCT ATGCGCATCG TGGCGTTCGC
CTGATCCGGC AGGATAATCA GGGTCCCGGC GCTGCGCGCA ATCGCGGGAT ACGCGAGACG
AGCGGTGAGT TGGTGGCTTT TCTCGATGGC GATGATCTCT GGTTGCCGAA CAAACTTGAG
CGCCAGTTGG CGTATCTGGT AGCGCATCCC GAAACGGTCA TGGTCAGTTG CCTGCGTTGG
CGCTGGGATC AGACGACCGG CGAGCGACAC ATTGAGTATT TTGGCGTGCC GCCAGGACGC
ATCCTGGCAC ATGAGAATGT GGTGCGCAAT GTTATTGGCA ACCCATCGAT GACGCTTATC
CGGCGTTCTG TGTTCGATGC GGTCGGGATG TTCGATACGC AACTGCGCTG GGGGCAGGAT
TGGGATTTGT TTATTCGCAT TGCATCCTAC GGTCCCGTCG GTTTTGTGGA GGAGCCGCTC
ATGATCTATC GCTGGCATCC TGGCGGCATC TCGCACCATC GGGGCATCGA ACGGTTGGAT
ATGTTTCAGT CTATCGCATG TCGGGGTATT GCCCGTATTC AACCGGCGTG GCGTCGTCCG
CTTTTGCTCG CGCGACGGTT GAGTTGGGAT CAGTGTGATC GCGCGGCATA TGCATCGCAG
GTCGGGTTGT CACGGGCGCG TCGCGTATGG CACGCGGCGC TTGGATTGGC ATTATTCCCG
TTTGAACGTC CGGTGAGCAA AACAAAGCGG CTGCTGCGCT CAATCTTTGG TGATGACTCC
TACGCCGCTG TTGGGCGCAG GCTACGGTCG GTCGTGTACC GTCCCATCGG TGAACGGGAA
GGTCTTGAAC GGTTTTGA
 
Protein sequence
MTEPRVSVVI TAYNAAEYLS AAIESVLAQS HPADDVVVVD DGSTDASAAV AQSYAHRGVR 
LIRQDNQGPG AARNRGIRET SGELVAFLDG DDLWLPNKLE RQLAYLVAHP ETVMVSCLRW
RWDQTTGERH IEYFGVPPGR ILAHENVVRN VIGNPSMTLI RRSVFDAVGM FDTQLRWGQD
WDLFIRIASY GPVGFVEEPL MIYRWHPGGI SHHRGIERLD MFQSIACRGI ARIQPAWRRP
LLLARRLSWD QCDRAAYASQ VGLSRARRVW HAALGLALFP FERPVSKTKR LLRSIFGDDS
YAAVGRRLRS VVYRPIGERE GLERF