Gene Rcas_0399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_0399 
Symbol 
ID5537861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp505058 
End bp508372 
Gene Length3315 bp 
Protein Length1104 aa 
Translation table11 
GC content59% 
IMG OID640892562 
Producttype III restriction protein res subunit 
Protein accessionYP_001430549 
Protein GI156740420 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00303076 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGAACACTG CATCATCATT CACCGTTCAT CATTCATCAT TCCAACAACT CGAAAACCGC 
CTGGTCCTGC TGGCCTGGCT CAATAGCCTC TTTGGCTACA CGAGCAACCG CGCGTTGCTG
GAAGACTGCA AGAGCGTGGA TGAAGGTTAT GGCCCAGATG GGCGCAGTTT TCTTTACCAT
CACCTGGTTG CCCGCGGCAG TCAGGTTAAA ATCTCCAACG ACGACCTGGC GCGCTACGAC
GAAAACATCC GTCTGCACCT GGAGAAGATC AACCGTGGCC GCATTGAACG CATCACGTTG
CGCTACTTTC AATACCTGGC CGCCTTGTAC ACGGAAATCT ATCTCGACCG TCTTTTCAAC
CACCGTGGAG GGGCGGACGG CCGTCCGCCC CTACTGGCCG ACCTCAATGC CTTCGCCACA
ATGCGGAATG ATGAACGCGG AATGCGGAAT GATGAACGCG GAATGCGGAA TGATGAACGC
GGAATGCGGA ATGATGAACG CGGAATGCGG AATGATGAAC GCGGAATGCA GCGCGATTCA
TCACTCATCA TTCAGGATTC AGATTTGACG AAACTTGCCT ACTGGATGGC CACCGGCAGC
GGCAAGACGC TCATCCTGCA CCTGAACTAC CACCAGTTTC TGCACTACAA CCGCGAACCG
ATAGACAACA TCCTGCTCAT CACGCCCAAC GAAGGACTGA GCGAGCAGCA CCTGGCCGAA
CTGGCCGCCT CAGGCATCCC GGCGCGGCGC TTCGACGTGA ATGCCAGCAG CCTCTGGACT
GGCGGTCGGG ACATCGTGCA GGTCATTGAG ATCACCAAAC TTGTGGAGGA AAAGCGCGGC
GGCGGGGTGA GCGTGCCGGT GGAGGCCTTC GAGGGACGCA ACCTCATCTT CGTGGACGAG
GGACACAAGG GCGCAGGCGG CGAGGCCTGG CGCAAGTACC GCGAGGCGCT CGGCGCGACC
GGTTTCACCT TAGAATACAG CGCCACATTT GGGCAGGCGC TCTCGGCGGC GCGTAACGAC
CCGCTCACCG CCGAATACGG CAAGTCCATC CTCTTTGACT ATTCTTACAA GTATTTTTAC
AGCGATGGGT TTGGCAAAGA CTTTCGCATC CTGAACCTGC GCGATGAAAC CCGGTCCGAC
CAGACGGAAA TGCTCCTGAT GGGCAATCTG CTCTCTTTCT ACGAGCAGGT GCGGCTCTAC
GAGGAGCAAA CCGACGCCCT GCGTCCATAC AACCTGGAAA AGCCGTTATG GGTGTTTGTT
GGGAGCACGG TCAACGCCGT TTATACCGAG AACAAACAGC CGCGCAGCGA TGTGCTGACC
GTCGTGCGTT TCCTGCATCA CGCCCTGAGC GACCGCGACT GGGCGATCAA GACCATCAAA
GCCGTTCTCG GCGGCAAGAC CGGATTGGTC TCGCCGGATG GACAAGATGT ATTTGCGGGC
AGGTTCGGCT ACCTGCGCAA GACCGGTGCG ACGCCGGAGC AGCTCTATGA CGATATCCTT
CGGCGCATTT TCAACGCTCC GGCCGGCGGC GGCTTGCACC TGTGCGAGAT CAAGAGCAGC
CCTGGCGAGC TGGGACTCAA AGCGGCCAAT GCCGGGGACT ATTTCGGCTT GATTTACATC
GGCGACACAT CGGCTTTCAA GAAACTGGCG GAAGAGGAAG CCGGCGACAT TGCGCTGGAA
GGAGACGCCA TAGCCCACTC GCTCTTCGAG CGCATCAACC GCCCGGATTC CGGTCTGCAC
GTGCTCATCG GCGCCAAGAA GTTCATGGAA GGCTGGAACT CCTGGCGCGT GGCCAGCATG
GGCCTCTTGA ACATCGGGCG GCAGGAAGGT TCACAAATCA TCCAACTCTT CGGGCGCGGC
GTCCGCCTGA AGGGCAAAGC CATGAGCCTC AAACGCAGCG CCGCGCTGGA GGGCAGCCAT
CCCCCACACC TGCGCCTGCT GGAAACGCTC AACATCTTTG CCGTGCGCGC CAACTATATG
GCGCAATTCC GCGACTATTT GGAACGCGAG GGGGTGGAGG TCGAACCGCC CATCGAGCTG
CCGCTCTTTG TCTGGGCCAA TGAACAATTC CTCAAGCAGG GTCTGGTCGT GCCGCGCCCG
CCGGAGGTCC GCGATTTTAC AGCGGAAACC GCGCTTCTGC TTGAACCGGA TTCTCGCATC
CGCGTGCATG TGGATCTGTC GGTCAAGGTG CAGGCGATCG AGAGCACGCG CCTGGGCCTG
CACACCGCCG ACGTGCGCGG CGGTCGGGCG CAGGCGATTC CCGCCGAAAG CCTGGACTGG
GTGGACTGGC AGCAGGCCTA TCTCGACGTG CTGGCTTACA AAGCGCGCAA AGGATGGAGC
AATCTGGTTA TCCAACCGGA GACGCTTCGG CAGATTGTCG AACTAAAGGA TAAGGGTGTC
CCGGTTTGCA CGGTAATTGC CGACGAGCGG GTAATTCGGC CGCAATCGTT CAGAGACCGC
GCCCTGCTTC AGCAAGTCGT GACGCAACTC CTTTGCCGCT ACGTGGATCG CTTGTATCAG
ACGCGCCGCG AGCAATGGGA TGAGCAAACC ATGGTCTATC GGCCGCTTGA CGAAAACGAT
CCCAACCTGG GTTTCAGGCC GCCAGGCGTG AACGAAAAAA GGGCGGGCTA TGTGATCAGA
GTGCCGCGTT CGGAGCAGCA GTTGGTTGAA GCGGTGCGCA CGTTACTCGA GGAGCAGGAA
CGCCTGTATC AGCAAGAAAA CGCCGGCCTG CCGCGCATCC ATTTCGACCG TCACCTCTAT
CAACCCCTGC TGGTAGATAT GCCTGGGAAG GCGCAGATTG CCCCGCCTGG GCTGAAGGAG
AGCGAAGCGC GCTTTGTGTG CGATCTGCGC GACTACTGGG ACGCAGAAAA GAACAATTCG
CTGGAGGGTA AAGAGGTTTT CCTGCTGCGC AACCTGAGCC GCGGCTATGG CATAGGCTTT
TTTGAGGAAC GCGGCTTTTA TCCGGATTTC ATCCTGTGGG TGGTGGATCA GGCAACAAAA
GCCCAGCGTA TTGTCTTCAT TGAGCCGCAC GGCATGGTGC ATGCCAAAGC CTACATCCAC
GACGAAAAGG CGCGTCTGCA CGAGCGGCTG CCCGTGCTGG CAGATGAAAT CGGGAAACGG
AGCGGCCGGA CCGACATCAC GCTGGACGCC TTTATTGTAT CGGCTACGTC GTACGACGAC
CTGCATCAGC ACTATGACGA CGGAACCTGG GATCGCGCCA GGTTTGCCGA AAGACATATC
CTTTTCCAAG AGCGTGGGCA GAATTACGAT TACCTGAGGA TACTGTTCGG GCAGCTAACC
AGCGCTTCCA CCTGA
 
Protein sequence
MNTASSFTVH HSSFQQLENR LVLLAWLNSL FGYTSNRALL EDCKSVDEGY GPDGRSFLYH 
HLVARGSQVK ISNDDLARYD ENIRLHLEKI NRGRIERITL RYFQYLAALY TEIYLDRLFN
HRGGADGRPP LLADLNAFAT MRNDERGMRN DERGMRNDER GMRNDERGMR NDERGMQRDS
SLIIQDSDLT KLAYWMATGS GKTLILHLNY HQFLHYNREP IDNILLITPN EGLSEQHLAE
LAASGIPARR FDVNASSLWT GGRDIVQVIE ITKLVEEKRG GGVSVPVEAF EGRNLIFVDE
GHKGAGGEAW RKYREALGAT GFTLEYSATF GQALSAARND PLTAEYGKSI LFDYSYKYFY
SDGFGKDFRI LNLRDETRSD QTEMLLMGNL LSFYEQVRLY EEQTDALRPY NLEKPLWVFV
GSTVNAVYTE NKQPRSDVLT VVRFLHHALS DRDWAIKTIK AVLGGKTGLV SPDGQDVFAG
RFGYLRKTGA TPEQLYDDIL RRIFNAPAGG GLHLCEIKSS PGELGLKAAN AGDYFGLIYI
GDTSAFKKLA EEEAGDIALE GDAIAHSLFE RINRPDSGLH VLIGAKKFME GWNSWRVASM
GLLNIGRQEG SQIIQLFGRG VRLKGKAMSL KRSAALEGSH PPHLRLLETL NIFAVRANYM
AQFRDYLERE GVEVEPPIEL PLFVWANEQF LKQGLVVPRP PEVRDFTAET ALLLEPDSRI
RVHVDLSVKV QAIESTRLGL HTADVRGGRA QAIPAESLDW VDWQQAYLDV LAYKARKGWS
NLVIQPETLR QIVELKDKGV PVCTVIADER VIRPQSFRDR ALLQQVVTQL LCRYVDRLYQ
TRREQWDEQT MVYRPLDEND PNLGFRPPGV NEKRAGYVIR VPRSEQQLVE AVRTLLEEQE
RLYQQENAGL PRIHFDRHLY QPLLVDMPGK AQIAPPGLKE SEARFVCDLR DYWDAEKNNS
LEGKEVFLLR NLSRGYGIGF FEERGFYPDF ILWVVDQATK AQRIVFIEPH GMVHAKAYIH
DEKARLHERL PVLADEIGKR SGRTDITLDA FIVSATSYDD LHQHYDDGTW DRARFAERHI
LFQERGQNYD YLRILFGQLT SAST