Gene Rcas_2231 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2231 
Symbol 
ID5539712 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2879315 
End bp2880424 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content45% 
IMG OID640894364 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_001432332 
Protein GI156742203 
COG category[L] Replication, recombination and repair 
COG ID[COG0863] DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0921492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000652955 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCACTAC CTTTAAATCA GATTATTGAA GGTGATTGTG TGGAAATACT GAATACGTTA 
CCAGAAACAT CCATTGACCT TATTTTTGCC GATCCTCCCT ATCATTTACA GTTACAAAAC
GAACTTCATC GACCAAATAT GACGAAAGTG GACGCTGTCG ATGACGACTG GGACAAGTTC
GAGTCGATGC AAGCGTATGA TGAATTTACT CGAACGTGGT TAACGGCGTG TAAGCGGGTC
TTGAAACCAA CCGGCACCAT CTGGGTTATC GGAACGTACC ATAATATCTT TCGTGTTGGG
GCCATGATGC AGGATTTAGG GTTCTGGATC CTCAATGATG TTATCTGGAT AAAACTAAAT
CCGATGCCTA ATTTTCGTGG TGTCCGTTTT ACCAATGCCC ATGAAACCCT CATTTGGGCA
AGTACCGGTA AAGATGCAAC ATATACGTTC AACTATTACG CGATGAAAGG GTTGAACGAT
GAAAAGCAAA TGCGTTCTGA CTGGTGGCTT TTACCGTTAG CGACGGGATC GGAACGGGTA
AAAAATGAAC ATGGCGATAA AGCCCATTCC ACCCAGAAGC CGGAGGCGTT ACTGTATCGG
GTGATTTTGT CATCCAGCAA TCCCGGTGAT GTGGTGCTTG ACCCATTTTT TGGAAGTGGA
ACAACGGGTG TTGTCGCGAA ACGTTTGCAT CGAAATTGGA TTGGAATAGA AAAGGAGAAA
CGATATGTCC AGATTGCGCA AAAGCGCATT GACGCAATGC AGCCAGAGAT GTTTGACGCT
GCGACGTTTG ATGTAAAGAG CAAAGCCAAA TCTGCTCCTA AAGTGGAGTT TTCGGTTCTG
GTCGAACATG GGTATGTACA ACCTGGGCAA CGATTGTTTT TTGGAAAAGA CAAAACGAAA
GTGGCCACAA TCAAGCCTGA TGCTCGGCTC CGTACTGCGG ACGGCTTCGA AGGCAGCATC
CATCAGGCAG GTAGCCATTA CATGAACAAT GCGCCCTGTA ATGGGTGGGA GCATTGGTTT
ATCGAAGTTG ATGGTCAAAT GATCAGTCTT GACGAAGTGA GAGAAAAGTT TCGGGTAGAC
AAGGGGCTTT ACAATGAACG ATCAGGTTAA
 
Protein sequence
MPLPLNQIIE GDCVEILNTL PETSIDLIFA DPPYHLQLQN ELHRPNMTKV DAVDDDWDKF 
ESMQAYDEFT RTWLTACKRV LKPTGTIWVI GTYHNIFRVG AMMQDLGFWI LNDVIWIKLN
PMPNFRGVRF TNAHETLIWA STGKDATYTF NYYAMKGLND EKQMRSDWWL LPLATGSERV
KNEHGDKAHS TQKPEALLYR VILSSSNPGD VVLDPFFGSG TTGVVAKRLH RNWIGIEKEK
RYVQIAQKRI DAMQPEMFDA ATFDVKSKAK SAPKVEFSVL VEHGYVQPGQ RLFFGKDKTK
VATIKPDARL RTADGFEGSI HQAGSHYMNN APCNGWEHWF IEVDGQMISL DEVREKFRVD
KGLYNERSG