Gene Rcas_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1158 
Symbol 
ID5538624 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp1499938 
End bp1501149 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content63% 
IMG OID640893290 
Productglycosyl transferase family protein 
Protein accessionYP_001431273 
Protein GI156741144 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCAGGCAT ACCTGGAAGG GTTGGCGCCG GTGTTCGATA TTCGCCTGAT CGACGTTTGG 
CCCTGGTTGC CGCTGGCGCT GTTCAGCGCA TTACTGGCTT TCTGGTACAT CCAGGATGTC
ATCGCCACCC TTCGCGCGCC GCATCTCAAC CCGCATCCCG ATCTCCCTGA CACTGGTCCC
CTGGTGACGG TTATCATTCC TGCGCGCAAT GAGGCTGCGC GCATCGGCGC CTGTCTTGAA
GGTCTGGCGC GGCAGTCGTA CCGCTCCTTT GAAGTGATTG TCGTCGATGA CGATTCGAGC
GATGGCACCG CCGATGTCGT GCGCCGGTTC GCGGCGCGTC TTCCGGCGCT CACGATCCTT
TCCTCGAAAG GTTTGCCGCA CCATTGGGCG GGCAAGTGCT GGGCATGCTG GCAGGGGGCG
AATCGGGCGC GCGGCGACTG GTTCCTCTTC CTCGATGCTG ATGTTGCGCC GCAACCAGAA
TTGCTGGCAG CGCTCGTCGA GCGCGCGACC GCCGGACGCG ATATGATCAC GCTGGTGCCG
CTCATTCACC TGACCTCTTT CGCCGAACGT CTGGTGTTGC CGCCGTTCAT CGGGTTGATA
TCCATGATCT ATCCATTCGA TCGGGTGAAC GATCCATCGT CGCCGCTGGC ATTCGCCATT
GGTCAGTGCA TTATGGTGCG GCGCGATGTC TATGCTGCTG TCGATGGGCA TCGCGCGGTG
CGCGGGAGCG TGCTGGAGGA TATGGACCTG GCGCGGCTGG TAAAGCAGTC GGGCTATGCA
CTGGATGCCG CCATCGCGCC CGACCTGCTC GATGTGCGCA TGTACAATGG CTGGGAAACG
CTCACCGAAG GCTTGAAGAA AAATGCCGTC GCCGGGTATC GGAGCGGCGG CGTTCGTTCC
GGGTTGATGG GGATGCGCCT GGGGTTGATG GCGATCATGC CATGGAACAT GCTGATCGCC
GGGCAGGCGC TGCCGCTGAT CGGCGGTGAT CCTGCGCTGG CGCAGGCGCT GATGCTGGCA
GGCGCCGCGC TGCTGATTAT CAGCGCCCTC TGCTGGGGAG CGGTTGTTCG TTATCGCTTC
CGCATCTCGC CGCTCTGGGG CACTCTCTAC GGACCGGGAA CGGCGGTTTA TTTTGCGCTG
GCAGCGCAGG CGCTGTTCCA GATCCGCAGC GGCAAAGGGG TCGAGTGGAA AGGGCGCATG
TTCCCACGCT GA
 
Protein sequence
MQAYLEGLAP VFDIRLIDVW PWLPLALFSA LLAFWYIQDV IATLRAPHLN PHPDLPDTGP 
LVTVIIPARN EAARIGACLE GLARQSYRSF EVIVVDDDSS DGTADVVRRF AARLPALTIL
SSKGLPHHWA GKCWACWQGA NRARGDWFLF LDADVAPQPE LLAALVERAT AGRDMITLVP
LIHLTSFAER LVLPPFIGLI SMIYPFDRVN DPSSPLAFAI GQCIMVRRDV YAAVDGHRAV
RGSVLEDMDL ARLVKQSGYA LDAAIAPDLL DVRMYNGWET LTEGLKKNAV AGYRSGGVRS
GLMGMRLGLM AIMPWNMLIA GQALPLIGGD PALAQALMLA GAALLIISAL CWGAVVRYRF
RISPLWGTLY GPGTAVYFAL AAQALFQIRS GKGVEWKGRM FPR