Gene Rcas_0407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0407 
Symbol 
ID5537869 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp514608 
End bp516038 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content47% 
IMG OID640892570 
Productsite-specific DNA-methyltransferase (cytosine-specific) 
Protein accessionYP_001430557 
Protein GI156740428 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.144423 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAATC CAGCCAAGGT TTCTGACATT TTGAAAGGCG AGCAGAATAT AGGCCTGAGT 
AAAATAGCCC ACTTGGATAA ACAGCTATTT CAACGTTTCA AAAGTAAATT CGCCGTTCAG
CCCTCACTCT CTCGGCTTCT GGTCAGTTTT CAAGCGAACA AAACCAGGCC TGTTTATCGC
TGGTACAAGT TCAAAGAAGC GTTCTCGGCT TCTTTAGTTG AGCATCTATT TCATAAGTAT
GGCATTACCG CAGGCAGAAT CTTGGATCCT TTTGCGGGCA GTGGGACAGC TTTATTTGCC
GCAAGTGCGA TGGGCATAGA TGCGGACGGT ATCGAACTAT TACCCATCGG TCACGAAATC
ATTACGGCCA AGCGAATTCT GGATGCAGAA TTTACATCCG AAGATTTCGA GAGATTGCGC
CGGTGGTCGG AGCTAAGAGT CTGGGAGCAA TCAGAAACGC GCGTTCCCTT GCCAGAGCTG
CGAATCACAC AAGGAGCTTA TCCAGGAAAA ACAAAAGAAG CCATTGAGAA ATATATAGGC
GCGTGCCAAC AAGAGAACAG CCGCGTCCAG GCTGTCTTGC GCTTTGCTTT ACTTTGCGTG
TTAGAGTCTA TCAGTTTTAC TCGAAAGGAT GGTCAATACC TTCGGTGGGA CTATCGTTCT
GGACGTACAC ACGGTAAGAA GATTTTCGAT AAGGGTGAAA TTCCAGAGTT CGGGCAAGCC
ATCAGCGAAA AGCTAAACGA GATTTTAGAA GATGCATCGC CCGTTCACCA AACAACTCTG
TTCCCTATTG AAAAACTCCA AGGCCAAATC TGTTTGTATG ATGGCTCATG TCTTCAAGTA
TTGCCCCGAT TATCCGATAA TGCCTACGAT GCCATTATGA CCTCACCGCC TTACTGCAAC
CGTTATGATT ATACGCGCAC GTATGCGCTG GAATTGGCGT TATTGGGCAC AGGTGAACAA
GGACTATTAC GACTCCGCCA AGAGATGCTC AGTTGCACGG TGGAGAACCG TGCAAAAGAC
CTGTTAAGTA TCAATCCGTT ATGGACAACG GCGCTGGCTG CCGCTGATGA GCAGGATCTG
TTACAGGCCA TCCTGACCTA CTTGGAAGAC CAAAAAGCGC AAAGAGCCTT GAACAACAAT
GGTATTCCCA GGATGGTCGG GGGCTATTTC TATGAAATGG CTTGCGTTAT TGCAGAATGT
GCGCGTGTTC TAAAGCCGAA CGCTCCCTTG TTTATGGTCA ATGATAATGT CCGTTATGCA
GGAGCGAGCA TTTCCGTAGA TATGATTCTC TCTGATATTG CAGAGAAATT AGGCTTTCAA
GTTGACCATA TCCTCGTTTT GCCAAACGGC AAGGGAAATA GTAGCCAGCA GATGGGGGAA
CATGGACGCG AACCCCTGCG CAAGTGTATT TACGTCTGGA GAAAATCGTG A
 
Protein sequence
MLNPAKVSDI LKGEQNIGLS KIAHLDKQLF QRFKSKFAVQ PSLSRLLVSF QANKTRPVYR 
WYKFKEAFSA SLVEHLFHKY GITAGRILDP FAGSGTALFA ASAMGIDADG IELLPIGHEI
ITAKRILDAE FTSEDFERLR RWSELRVWEQ SETRVPLPEL RITQGAYPGK TKEAIEKYIG
ACQQENSRVQ AVLRFALLCV LESISFTRKD GQYLRWDYRS GRTHGKKIFD KGEIPEFGQA
ISEKLNEILE DASPVHQTTL FPIEKLQGQI CLYDGSCLQV LPRLSDNAYD AIMTSPPYCN
RYDYTRTYAL ELALLGTGEQ GLLRLRQEML SCTVENRAKD LLSINPLWTT ALAAADEQDL
LQAILTYLED QKAQRALNNN GIPRMVGGYF YEMACVIAEC ARVLKPNAPL FMVNDNVRYA
GASISVDMIL SDIAEKLGFQ VDHILVLPNG KGNSSQQMGE HGREPLRKCI YVWRKS