Gene Rcas_2226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_2226 
Symbol 
ID5539707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp2873088 
End bp2875601 
Gene Length2514 bp 
Protein Length837 aa 
Translation table11 
GC content57% 
IMG OID640894359 
Productputative uncharacterized restriction enzyme 
Protein accessionYP_001432327 
Protein GI156742198 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000214566 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000161131 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAGTAACT TCTTCCCCCC GCGCCCCAAG GTCGAGCCGA AAATCTACGC CTACAAGGAC 
ACCAACCCCG AATACGAGGG CTTGCTCAAG ATCGGCTACA CCACCAAGAG CGTGCAGGAG
CGGGTGGCGC AGCAGTATCC CACCCTGCGG CCCGGCGGCA AGCTGCCGTA TTGCATCGTG
CTGGAAGAGC CGGCTATCCG CAACGATGGC ACAGTCTTCA CCGATCGCGA CGTGCACCGC
ATGCTGCGCA TCAACGGCAT CCGCAACGAG GGCGGCGAAT GGTTCCGCTG CACCGTCGAA
CAGGTCAGGG CGGCGATTAA TGCCGTGCGA GCTGGGCAAC TCTTGGAAGA GCAGCGCTCC
CTCAACTTCA CGATGCGGCC AGAGCAGGAG GCAGCGGTGG CAAAGACCAT GGCCTACTTC
CAAAGCTACC GCCGGGAAAA CGGCAAGCCG CCGCATTTCC TCTGGAACTG CAAGATGCGC
TTCGGCAAGA CCTTCGCCGC CTATCAACTG GCCAAACGCA TGGGCTGGAA AAAGGTTCTC
GTGCTGACCT TCAAGCCAGC GGTGCAGAGC GCCTGGGAAG AGGATTTGCG CTGCCACGTG
GATTTTCAGG GCTGGCAGTT CATCAAGCCC GGCGGGCTGA CCTACGAGCA GGCAGATAAA
AACAAGCCTA TCGTCTGCTT TGGCTCGTTT CAGGACTATC TCGGCCGCAA TCCGAAGACG
GGCGGCATCA AAGCCAAGAA CGAATGGGTG CATGCCACCC ACTGGCACTG CGTCATCTTT
GATGAATACC ACTTCGGCGC TTGGCGCGAG AAGGCCAAAG ACCTGTTTGA GGGCGAGGAT
GAAGCGGAAC GGAAAGCCTC GGAAGGGGAA GCCATTGATT ACTTCGATGA GGATATCCTG
CCCATCACCT CCGATCACTA CCTGTACCTT TCCGGCACAC CGTTCCGGGC CATTGCCACC
GGCGAGTTCA TCGAAGAGCA AATTTACAAC TGGACATATT CTGATGAACA GAAGGCAAAA
GAGGAGTGGG ACGACAGCAA CGGCCCCAAC CCCTATGCCG CATTGCCGCG CATGGTGCTG
ATGACCTATC AACTGCCAGA CGCCATTCGC GAAGTGGCGA TGCAGGGAGA GTTCAACGAG
TTCGATCTGA ACGTGTTCTT TTCGGCCGAG GGCGTGGGCG ACAAGGCGCG GTTCAAATAC
GAAGACGAGG TGCAGAAGTG GTTGGACCTC ATCCGTGGGG CATACCTGCC TACAAACCTG
GATAACCTCC GTTTGGGAGC GCAGAAGCCG CCGCTCCCCT ACTCGGATGT GCGGCTGCTG
AATGTGCTCA CCCATACGGT CTGGTTTTTG CCCAGCGTGG CCGCCTGCTA CGCCATGCGC
AATCTCTTAG CCAAACCACA CAACAAGTTC TATCACGATT ACAAGGCCAT CGTGGCTGCC
GGCGCCGCTG CGGGCATCGG CGTCAATGCC CTCCCGCCGG TGCTGGACGC CATGGGCGAT
CCGCTCAAGA CGAAAACCAT CACCCTCACC TGCGGCAAAC TGACCACCGG CGTCACGGTT
CGGCCCTGGA CGGGCATCTT CATGCTGCGC AACACATCTA GCCCAGAAAC CTACTTCCAG
GCCGCCTTCC GCGTGCAGAG TCCATGGACG ATTCAGAATC CTGACGGAAC CTCACCCAAC
GCGGAGCTGA TTCTCAAAGA AGAGTGCTAT GTGTTCGATT TTGCGCCGGA TCGCGCCCTG
CGGCAGATTG CCGACTACAG TTGCCGCCTG AATGTGAATG AGGATGACCC GGAAAAGAAA
GTGGAGGAAT TTATCCACTT CCTGCCGGTG CTGGCGTATG ATGGGAGTTC AATGCGACAG
ATTGATGCCG CCGGCGTGCT GGAAATGGCC ATGAGCGGCA CCACCGCCAC CCTGCTGGCC
CGCCGCTGGG AAAGCGCGCT ATTGGTGAAT GTGGACAATG ACACTCTGCG GCGGCTGATA
ACCAACGAAC AGGCGATGCA GGCGCTGATG AACATTGAGG GCTTCCGCAA CCTCAATCAG
GAGATTGAAA CCATCATCAA CAAGTCGGAG GCCGTGAAAC GGCTGAAAAA AGAGGCCAAC
GACCGAGCGC TTTCCGCTCA AGAAAAGCGT GAGCTGACCG AAGAGGAGAA GGAATTCAAG
AGCAAGCGCA AACAGATTCA GGAAAAGCTC ATCAAGCTGG CGACGCGCAT CCCCATTTTT
ATGTATTTGA CCGACCATCG GGAACGAACG CTGCGCGATG TGATCACCCA ATTGGAACCG
GGATTGTTCA AGAAAGTTAC GGGGCTGACG GTGAAAGACT TTGAACTCCT CGTCAGCCTC
GGCGTCTTCA ACAGCGCGCT GATGAACGAT GCAGTGTACA AGTTCAAGCG GTATGAAGAC
CCAAGTCTGG TTTATACGGG TATTAACCGT CACGAAGGAG AAGATGTCGG ACTGTACGAT
ACCGTCTTGC GCCGCGCGGA ATATGAGGCG ACATTGGTGC ATGAGGAAAG ATAG
 
Protein sequence
MSNFFPPRPK VEPKIYAYKD TNPEYEGLLK IGYTTKSVQE RVAQQYPTLR PGGKLPYCIV 
LEEPAIRNDG TVFTDRDVHR MLRINGIRNE GGEWFRCTVE QVRAAINAVR AGQLLEEQRS
LNFTMRPEQE AAVAKTMAYF QSYRRENGKP PHFLWNCKMR FGKTFAAYQL AKRMGWKKVL
VLTFKPAVQS AWEEDLRCHV DFQGWQFIKP GGLTYEQADK NKPIVCFGSF QDYLGRNPKT
GGIKAKNEWV HATHWHCVIF DEYHFGAWRE KAKDLFEGED EAERKASEGE AIDYFDEDIL
PITSDHYLYL SGTPFRAIAT GEFIEEQIYN WTYSDEQKAK EEWDDSNGPN PYAALPRMVL
MTYQLPDAIR EVAMQGEFNE FDLNVFFSAE GVGDKARFKY EDEVQKWLDL IRGAYLPTNL
DNLRLGAQKP PLPYSDVRLL NVLTHTVWFL PSVAACYAMR NLLAKPHNKF YHDYKAIVAA
GAAAGIGVNA LPPVLDAMGD PLKTKTITLT CGKLTTGVTV RPWTGIFMLR NTSSPETYFQ
AAFRVQSPWT IQNPDGTSPN AELILKEECY VFDFAPDRAL RQIADYSCRL NVNEDDPEKK
VEEFIHFLPV LAYDGSSMRQ IDAAGVLEMA MSGTTATLLA RRWESALLVN VDNDTLRRLI
TNEQAMQALM NIEGFRNLNQ EIETIINKSE AVKRLKKEAN DRALSAQEKR ELTEEEKEFK
SKRKQIQEKL IKLATRIPIF MYLTDHRERT LRDVITQLEP GLFKKVTGLT VKDFELLVSL
GVFNSALMND AVYKFKRYED PSLVYTGINR HEGEDVGLYD TVLRRAEYEA TLVHEER