Gene Rcas_3088 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3088 
Symbol 
ID5540584 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4002527 
End bp4003747 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content56% 
IMG OID640895207 
ProductC-methyltransferase 
Protein accessionYP_001433160 
Protein GI156743031 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCATT CGACTACGCG CCACGGTGAC GATCAGCCGA CAGCCGATGC TCGCTTCTGT 
CCGGTTTGCG GATCAGCGGA TGTTCATCTC TTTATGAATG TTTCGCAGGC GCCGGTCTAT
TGTAATGTGC TGTGGGAAAC GCGCGAAGCC GCACTACGCG CGCCGAAAGG CAATATTACG
CTTGGTTATT GCGCCGAGTG CAGCCATATA TACAACTACG CCTTTGATTC GTCAAAAATG
GAGTATTCAC AGGAATATGA AAATTCACTC CACTTCTCCG GGCGTTTCCA GCAGTATGCG
ACGGAATTGA CGGAACGATT GGTTGAACGG TATGATCTGT ATGACAGGGA TATCATCGAG
ATTGGCTGCG GCAAGGGTGA TTTCTTGCGT CAGATCTGTC GTGCTGGCAA TAATCGCGGC
ATTGGATTCG ACAAAAGTTT TGTGCCCGAT CCGGCGCGCG ATGCGCTTGA CCCAAATGTG
CGCTTTGTGG TGGATTTCTT TGGGGAATCA TATGCGCACG AACCGGCCGA TCTGATCGTA
TGTCGCCACG TGCTCGAACA TATCGAGCGT CCGCGCGTGT TTATCGAGAG CCTGCGCCGC
GTGATCGGCG ATCATCGTCA GCCAACGGTG TACTTTGAGG TGCCAAATGC GCTCTGGATG
CTGCGCGACC TGGGCATTTG GGACATTATC TACGAGCATT GTTCGTACTT CAGCCCGGCG
TCGTTGACGC ATCTGTTCGA GACATCCGGC TTCGAGACGC TTGATGTGCG CGAAGCGTTC
GGCGGGCAGT TCCTGTCCAT CGAGGCGCGA CCAACATCAC ATAGCGTTTC GCCGACGGCG
CAGGCGCGGC TTGATGGTGA ACGCATGTGG CATAATGTTG CAGCGTTTGG CGACAACTAT
GGCGCAAAAG TGGAGTATTG GCGCGCGCGA CTCGGTCATC TGGCGCGGCA ATCCAGGCGC
GTGGTGGTCT GGGGCGCCGG ATCGAAGGGA GTGACGTTTC TGAACGTCTT CCGCGATCTG
AACGCTATTG AATACGTGGT CGATATCAAT CCGCGCAAAC AGGGGAAATA TGTCGCCGGC
AGCGGGCAAC AAGTCGTTGA ACCGGCATTT CTGCGCTCGT ATCAGCCCGA TGCGGTGATC
GTCATGAATG CGCTGTATGT CGATGAGATT GGGCGGACGC TCGATGCGCT CGGCGTCACT
GCCGCCGTCG AAAGCGCGTA A
 
Protein sequence
MSHSTTRHGD DQPTADARFC PVCGSADVHL FMNVSQAPVY CNVLWETREA ALRAPKGNIT 
LGYCAECSHI YNYAFDSSKM EYSQEYENSL HFSGRFQQYA TELTERLVER YDLYDRDIIE
IGCGKGDFLR QICRAGNNRG IGFDKSFVPD PARDALDPNV RFVVDFFGES YAHEPADLIV
CRHVLEHIER PRVFIESLRR VIGDHRQPTV YFEVPNALWM LRDLGIWDII YEHCSYFSPA
SLTHLFETSG FETLDVREAF GGQFLSIEAR PTSHSVSPTA QARLDGERMW HNVAAFGDNY
GAKVEYWRAR LGHLARQSRR VVVWGAGSKG VTFLNVFRDL NAIEYVVDIN PRKQGKYVAG
SGQQVVEPAF LRSYQPDAVI VMNALYVDEI GRTLDALGVT AAVESA