Gene Rcas_0073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0073 
Symbol 
ID5537532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp90026 
End bp91207 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content42% 
IMG OID640892238 
Producthypothetical protein 
Protein accessionYP_001430228 
Protein GI156740099 
COG category[R] General function prediction only 
COG ID[COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000146563 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000120063 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGCTGGAT TAAAGCGGGA CATTAAGAGC TTATTCTATA TCACCCATAT CAACAACCTG 
GAGTCCATCT TTCAGCACGG CATCCTCTCC CATGCTCGTG TTGAAGAAAT GGGTATTCCG
TATACACCGA TATATGATGC TCAGATTGTT TCCAATCGTC GTGAGCGATC AACTCCAGAT
GGCAAAAGTC TTTGGAGATT TGCGAATCTC TACTTTCAAC CTCGCAATCC TATGTTATAT
AGGGTTATCA ACAAAATAGA TAAGAAAGAG GTGGCTATTA TTGGAGTCAG GCCAGAAGTC
CTAAGTTATC CGGGCTGCTA CATTTCAACA GGAAATGCAG CAAGTTCTCC TTCAGAGATT
TTACCTGTAA AGGAAGGAAT TAAAGCGATT TTTGAAATGT GGAAAATCAT TCATAATGAA
TGGTGGAATC CAGATGACGG ATCGAAACGA AAAATCATGG CTGAATGTCT GGTTCCTGAT
TTTGTTTCTC CTGATATGAT ACAAACAATA TATGTCGTCA ATCATGATGT GGCAAAGGAT
GTAAGGAGTC GGATTTCAGG AGCAAGAGTT CCCGTAGTCC CCGAACCCTC CATGTTCTTC
CAGCCTACCA GGCGAGAACG TGTTACCGGG AACCTGTTTT TGATAGAAGG TGATTTATTC
TTTTCAACAA TGCAGACTAT AACGATTAGT GTTAACACGA TGGGCATTAT GGGTAAAGGT
CTGGCATCGA GAGCAAAATA TCAATTTCCC GATGTATATG TAGTATATCA GGATGCATGC
CGCAGCAAGA AACTGCAAAT AGGCAAACCT TACCTGTATA AGCGGGAATC TTTCGTTGAT
GAACAGTTAG CCGATGATCC AGGATCATTA ACTAAACCTA ACTCAAATAA ATGGTTCCTC
CTTTTTGCGA CTAAGAGAAA TTGGCGAGAA AACTCAGATA TTGTTGGAAT CGAGAATGGA
TTGAAATGGA TCCAGCATAA CTACAAATCG GAGGGTATTG CGTCCCTGGC AGTTCCAGCG
TTAGGATGTG GATTAGGTGG ATTAGATTGG CGCGATGTTG GTCCACTGAT GTGTCAGTAT
CTTTCAGTGT TGGACATACC GGTAGCTATA TATTTGCCCA GGGAACGCGA AACTCCCGAG
GAATTTCTTT CGCGCGACTT TTTGTTGGGC ACAAAAGAAT GA
 
Protein sequence
MAGLKRDIKS LFYITHINNL ESIFQHGILS HARVEEMGIP YTPIYDAQIV SNRRERSTPD 
GKSLWRFANL YFQPRNPMLY RVINKIDKKE VAIIGVRPEV LSYPGCYIST GNAASSPSEI
LPVKEGIKAI FEMWKIIHNE WWNPDDGSKR KIMAECLVPD FVSPDMIQTI YVVNHDVAKD
VRSRISGARV PVVPEPSMFF QPTRRERVTG NLFLIEGDLF FSTMQTITIS VNTMGIMGKG
LASRAKYQFP DVYVVYQDAC RSKKLQIGKP YLYKRESFVD EQLADDPGSL TKPNSNKWFL
LFATKRNWRE NSDIVGIENG LKWIQHNYKS EGIASLAVPA LGCGLGGLDW RDVGPLMCQY
LSVLDIPVAI YLPRERETPE EFLSRDFLLG TKE