Gene Rcas_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_1939 
Symbol 
ID5539417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2482983 
End bp2484110 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content41% 
IMG OID640894075 
ProductDNA methylase N-4/N-6 domain-containing protein 
Protein accessionYP_001432046 
Protein GI156741917 
COG category[L] Replication, recombination and repair 
COG ID[COG1041] Predicted DNA modification methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000888828 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000001156 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGAGC GCATGCTCCA AAATCTTCCC CTGTTCGAAG ATGACTATTT CGGATCACCA 
GATGAGATGC TCCACCTGAA CGAAGCGATT CTGGTAGATA TTATCCAAAA ATGTCAATCT
ATCAACTCAA TGCTCAGTTT ACGCCATGCT ATTACAGAAA TTAAGCGTCG CATCTGTGAT
CCTAACGACG AGTTACGAAA TAATAGCGAT GGCACACGAA CTGCTGATTT CAGCAAAAAC
TACCTAATTT CCGAACTTGA CCAAATATCT GACTCTCTAA CAATTGATCG AGCACAATAC
TATGTTGAAC GTTTACTTAA GAGCATTCTC GAGGTAAAAA CGGTTTCCAT TAACGACATA
AACCTCAATC GGTGGAAAGA GTACGATGAT ATATATACTG ATAGTTTGTG GCTTATTGAT
CGGCGTGATA GTTCTGGAGT TCACACAGCA GGTTATTGGG GTAACTTCGT CCCCCAAATC
CCCTATCAGA TGATGCGTCG CTATACTAAG AAGGGTGATT GGGTCTTAGA CACCTTCGCA
GGATCAGGAA CGACGCTCAT TGAAGGACAG CGATTAGGAA GGAACACCAT CGGTGTCGAA
TTACAACCTC AAATGGTTGA ACATGCGCGT CGTCTTATTT CATCTGAACC CAATAGATAC
AATGTTGTTA TTGATGTTAT CAATGATGAT AGTATGAATA TCGATTACGG TGCGGTATTA
CAAAAATACG GAGCGAAGTC CGTACAACTG GTTATTATGC ACCCACCTTA TTTCGATATC
ATTAAATTTA GTCATGACCC ACGCGATCTT TCGAATGCGC CATCAGTTGA AAGGTTCTTG
GAGATGATGG GAACACTTGT TGACAGGATC AATCCAATAC TCGACAAGGA ACGATATCTC
GTGCTAGTCA TAGGTGATAA ATACGTAAAG GGTGAATGGA TACCTCTTGG CTTCCAAACC
ATGAGTGAAG TCATGAAGCG CGGTTTTTCG CTCAAAAGTA TCATTATCAA GAACTTCGAG
GATACATCCG CAAAACGCAA TCAAAAAGAG TTATGGCGCT ATCGCGCACT TGCTGGAGGT
TTCTATATAT TTAAACACGA ATATATCTTC CTCTTCAAAA AACGGTGA
 
Protein sequence
MSERMLQNLP LFEDDYFGSP DEMLHLNEAI LVDIIQKCQS INSMLSLRHA ITEIKRRICD 
PNDELRNNSD GTRTADFSKN YLISELDQIS DSLTIDRAQY YVERLLKSIL EVKTVSINDI
NLNRWKEYDD IYTDSLWLID RRDSSGVHTA GYWGNFVPQI PYQMMRRYTK KGDWVLDTFA
GSGTTLIEGQ RLGRNTIGVE LQPQMVEHAR RLISSEPNRY NVVIDVINDD SMNIDYGAVL
QKYGAKSVQL VIMHPPYFDI IKFSHDPRDL SNAPSVERFL EMMGTLVDRI NPILDKERYL
VLVIGDKYVK GEWIPLGFQT MSEVMKRGFS LKSIIIKNFE DTSAKRNQKE LWRYRALAGG
FYIFKHEYIF LFKKR