Gene Rcas_3593 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_3593 
Symbol 
ID5541094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp4689215 
End bp4690333 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content64% 
IMG OID640895712 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_001433660 
Protein GI156743531 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.316349 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGTC CGCTTCTTCC GAACCGTTCG CCTGCTGAGC GCGCCATTCC CGATGACGTG 
CTGCGCCTGG CACGCACCCG CTTCACGCAT GCGCTCGCTC GACTGCGCGA ATCTGCCTCG
ATTGCCGAGT TGCGCACCGC GCTGGATGCG CTCTGGCGCG AGATCGATGA CCCGTCCGGT
GCGTTGCTCG TCGCTGCCGA TGATGCGGAG ACCACCGCCT TCAGCAAACG GCGGTTACGC
GACGCATTCG AGCAGATTGC GCGCGCCTAC ACCCTCGAAC GCGCCCGCTA CTATCTCGAC
CGTCTGGCGC GCGCCGCGGG TGAGTCCCGT ACCGGCGCGA TCAACGAGAT CGACCTCAAC
CGCTGGAAAG AATACGACGA TGTTTTGACC GATAGTTTAT GGCTGTTTGA CCGGCGCGCC
GCCGGCGGCG CGCACCATGC GGGGTTCTGG GGCAATTTTG TGCCGCAGAT CCCCTATCAA
CTGATGCTGC GGTATACCCG TCGCGGCGAC CTGGTCCTCG ACCCGTTCGC CGGTTCCGGT
ACCACGCTGA TCGAAGCGCA GCGTCTGGGT CGATTGGCGA TTGGCGTGGA ACTGAACCCG
GCCGTGGCGC AACAGACGCG CGCGACGCTG GCGCGCGAAT CCGACGTTCG TTCGGCGCTG
TGCGCGCTTG AGGTCGGCGA TAGCGCCGCC TTCGATTGGC GCGCGACGCT GGAACGCTAT
GGCGTTCGCT CGGCGCAACT TGCCATTCTG CATCCGCCGT ACCACGACAT CATTCGCTTC
AGCGACGACC CGCGCGACCT GGCGAATGCG CCGTCGGTCG ACGCCTTTCT GTCGCGTCTT
GGCGCGGTCG TGGCGCAGGT TAAAGCGGCG CTCGACGCCG GACGCTACCT GGCGCTGGTG
CTCGGCGACA AATATGCCAA CGGCGAGTGG GTCCCGCTCG GATTTCTTGG CATGCAGGAA
GTGCTGCGCC ACGGATTCAC CCTCAAGAGC ATTGTCGTCA AGAACTTCGA GCAAACGACC
GGGAAGCGCG GTCAGCACGA ACTCTGGCGC TATCGCGCGC TGGTTGGCGG ATTCTATGTC
TTCAAGCATG AGTACATTTT CATCTTCCGG AACGCCTGA
 
Protein sequence
MARPLLPNRS PAERAIPDDV LRLARTRFTH ALARLRESAS IAELRTALDA LWREIDDPSG 
ALLVAADDAE TTAFSKRRLR DAFEQIARAY TLERARYYLD RLARAAGESR TGAINEIDLN
RWKEYDDVLT DSLWLFDRRA AGGAHHAGFW GNFVPQIPYQ LMLRYTRRGD LVLDPFAGSG
TTLIEAQRLG RLAIGVELNP AVAQQTRATL ARESDVRSAL CALEVGDSAA FDWRATLERY
GVRSAQLAIL HPPYHDIIRF SDDPRDLANA PSVDAFLSRL GAVVAQVKAA LDAGRYLALV
LGDKYANGEW VPLGFLGMQE VLRHGFTLKS IVVKNFEQTT GKRGQHELWR YRALVGGFYV
FKHEYIFIFR NA