Gene Rcas_2010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2010 
Symbol 
ID5539488 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2576295 
End bp2577407 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content53% 
IMG OID640894145 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_001432116 
Protein GI156741987 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.393522 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTCG TCACGCCAGA TTCATGCCGC CTCCCCATCA ATGCAATTAT CCAGGGTGAT 
TGCATTCAGG TTTTGCAGAT GTTCCCCGAG CAGTCGGTAG ACCTGATTTT CGCCGATCCC
CCCTACAATC TTCAGTTGCG TCACGCATTG CTCCGTCCCG ATCAAACGGT CGTGGACGGG
GTTGATGATG CCTGGGATCG GTTTGAAGAT GTTCAGGAGT ATGATGCATT CACCAGAGCC
TGGCTTGGCG CTTGTCGCAG GGTGCTCAAG GACGATGGCA CGATCTGGGT GATCGGCACG
TACCATAATA TCTTTCGTGT CGGCGCGATC ATGATGGATC TGGGGTACTG GATATTGAAC
GATGTTATCT GGCACAAAAC AAATCCGATG CCCAATTTCC GCGGCGTTCG CTTCCAGAAC
GCAACCGAAA CATTGATATG GGCGAAGAAG TCTGCCGATC AGAAGAAATA TACGTTCAAT
TACCACGCAA TGAAGCATCT GAATGAAGAG AAGCAGATGC AGAATGTCTG GCATCTTCCG
CTCTGCACCG GCGCGGAACG GGTGAAGATC AATGGCAAGA AGGCGCACTC GACGCAAAAA
CCGGAAGCGC TGTTGTACCG GGTGATACTG TCATCGAGCA ATCCTGGCGA TCTGGTCCTT
GATCCATTTT TCGGTTCGGG CACAACTGGA GTAGTCGCGC GCAGACTGAA GCGACATTAT
ATTGGCATCG AGTTGGATCC GGCGTATGTC GAAATCGCCA GAACACGGAT CGAGAAGACA
CCAGTATCCG TATGCGATGA TGCGATGCTC GCAACGCGAT CGAAACGGGA CATGCCGCGT
GTCGGTTTTG GTCAACTTGT GGAGGCGCAG TATCTTCGCG TCGGTCAGAA TCTCTACTCA
AGCGATCGAA ACGTTGTGGC AATCGTCCGC GCGGACTCAC AACTGCAATG GGGCAATATC
ACCAGCTCCA TCCACAGAAT AGCCGCGCTT GCTCAGCATA AGCCCGCCTT CAATGGCTGG
GAGTACTGGC ACTATGAGGA TCAGGCGGGT CGTCTTGTCA GCATCGATTC TCTTCGTGAA
CAATATCGGT TCGACCAGGG CGTAGCGGAT TGA
 
Protein sequence
MQVVTPDSCR LPINAIIQGD CIQVLQMFPE QSVDLIFADP PYNLQLRHAL LRPDQTVVDG 
VDDAWDRFED VQEYDAFTRA WLGACRRVLK DDGTIWVIGT YHNIFRVGAI MMDLGYWILN
DVIWHKTNPM PNFRGVRFQN ATETLIWAKK SADQKKYTFN YHAMKHLNEE KQMQNVWHLP
LCTGAERVKI NGKKAHSTQK PEALLYRVIL SSSNPGDLVL DPFFGSGTTG VVARRLKRHY
IGIELDPAYV EIARTRIEKT PVSVCDDAML ATRSKRDMPR VGFGQLVEAQ YLRVGQNLYS
SDRNVVAIVR ADSQLQWGNI TSSIHRIAAL AQHKPAFNGW EYWHYEDQAG RLVSIDSLRE
QYRFDQGVAD