Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0399 |
Symbol | |
ID | 5537861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 505058 |
End bp | 508372 |
Gene Length | 3315 bp |
Protein Length | 1104 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640892562 |
Product | type III restriction protein res subunit |
Protein accession | YP_001430549 |
Protein GI | 156740420 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00303076 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAACACTG CATCATCATT CACCGTTCAT CATTCATCAT TCCAACAACT CGAAAACCGC CTGGTCCTGC TGGCCTGGCT CAATAGCCTC TTTGGCTACA CGAGCAACCG CGCGTTGCTG GAAGACTGCA AGAGCGTGGA TGAAGGTTAT GGCCCAGATG GGCGCAGTTT TCTTTACCAT CACCTGGTTG CCCGCGGCAG TCAGGTTAAA ATCTCCAACG ACGACCTGGC GCGCTACGAC GAAAACATCC GTCTGCACCT GGAGAAGATC AACCGTGGCC GCATTGAACG CATCACGTTG CGCTACTTTC AATACCTGGC CGCCTTGTAC ACGGAAATCT ATCTCGACCG TCTTTTCAAC CACCGTGGAG GGGCGGACGG CCGTCCGCCC CTACTGGCCG ACCTCAATGC CTTCGCCACA ATGCGGAATG ATGAACGCGG AATGCGGAAT GATGAACGCG GAATGCGGAA TGATGAACGC GGAATGCGGA ATGATGAACG CGGAATGCGG AATGATGAAC GCGGAATGCA GCGCGATTCA TCACTCATCA TTCAGGATTC AGATTTGACG AAACTTGCCT ACTGGATGGC CACCGGCAGC GGCAAGACGC TCATCCTGCA CCTGAACTAC CACCAGTTTC TGCACTACAA CCGCGAACCG ATAGACAACA TCCTGCTCAT CACGCCCAAC GAAGGACTGA GCGAGCAGCA CCTGGCCGAA CTGGCCGCCT CAGGCATCCC GGCGCGGCGC TTCGACGTGA ATGCCAGCAG CCTCTGGACT GGCGGTCGGG ACATCGTGCA GGTCATTGAG ATCACCAAAC TTGTGGAGGA AAAGCGCGGC GGCGGGGTGA GCGTGCCGGT GGAGGCCTTC GAGGGACGCA ACCTCATCTT CGTGGACGAG GGACACAAGG GCGCAGGCGG CGAGGCCTGG CGCAAGTACC GCGAGGCGCT CGGCGCGACC GGTTTCACCT TAGAATACAG CGCCACATTT GGGCAGGCGC TCTCGGCGGC GCGTAACGAC CCGCTCACCG CCGAATACGG CAAGTCCATC CTCTTTGACT ATTCTTACAA GTATTTTTAC AGCGATGGGT TTGGCAAAGA CTTTCGCATC CTGAACCTGC GCGATGAAAC CCGGTCCGAC CAGACGGAAA TGCTCCTGAT GGGCAATCTG CTCTCTTTCT ACGAGCAGGT GCGGCTCTAC GAGGAGCAAA CCGACGCCCT GCGTCCATAC AACCTGGAAA AGCCGTTATG GGTGTTTGTT GGGAGCACGG TCAACGCCGT TTATACCGAG AACAAACAGC CGCGCAGCGA TGTGCTGACC GTCGTGCGTT TCCTGCATCA CGCCCTGAGC GACCGCGACT GGGCGATCAA GACCATCAAA GCCGTTCTCG GCGGCAAGAC CGGATTGGTC TCGCCGGATG GACAAGATGT ATTTGCGGGC AGGTTCGGCT ACCTGCGCAA GACCGGTGCG ACGCCGGAGC AGCTCTATGA CGATATCCTT CGGCGCATTT TCAACGCTCC GGCCGGCGGC GGCTTGCACC TGTGCGAGAT CAAGAGCAGC CCTGGCGAGC TGGGACTCAA AGCGGCCAAT GCCGGGGACT ATTTCGGCTT GATTTACATC GGCGACACAT CGGCTTTCAA GAAACTGGCG GAAGAGGAAG CCGGCGACAT TGCGCTGGAA GGAGACGCCA TAGCCCACTC GCTCTTCGAG CGCATCAACC GCCCGGATTC CGGTCTGCAC GTGCTCATCG GCGCCAAGAA GTTCATGGAA GGCTGGAACT CCTGGCGCGT GGCCAGCATG GGCCTCTTGA ACATCGGGCG GCAGGAAGGT TCACAAATCA TCCAACTCTT CGGGCGCGGC GTCCGCCTGA AGGGCAAAGC CATGAGCCTC AAACGCAGCG CCGCGCTGGA GGGCAGCCAT CCCCCACACC TGCGCCTGCT GGAAACGCTC AACATCTTTG CCGTGCGCGC CAACTATATG GCGCAATTCC GCGACTATTT GGAACGCGAG GGGGTGGAGG TCGAACCGCC CATCGAGCTG CCGCTCTTTG TCTGGGCCAA TGAACAATTC CTCAAGCAGG GTCTGGTCGT GCCGCGCCCG CCGGAGGTCC GCGATTTTAC AGCGGAAACC GCGCTTCTGC TTGAACCGGA TTCTCGCATC CGCGTGCATG TGGATCTGTC GGTCAAGGTG CAGGCGATCG AGAGCACGCG CCTGGGCCTG CACACCGCCG ACGTGCGCGG CGGTCGGGCG CAGGCGATTC CCGCCGAAAG CCTGGACTGG GTGGACTGGC AGCAGGCCTA TCTCGACGTG CTGGCTTACA AAGCGCGCAA AGGATGGAGC AATCTGGTTA TCCAACCGGA GACGCTTCGG CAGATTGTCG AACTAAAGGA TAAGGGTGTC CCGGTTTGCA CGGTAATTGC CGACGAGCGG GTAATTCGGC CGCAATCGTT CAGAGACCGC GCCCTGCTTC AGCAAGTCGT GACGCAACTC CTTTGCCGCT ACGTGGATCG CTTGTATCAG ACGCGCCGCG AGCAATGGGA TGAGCAAACC ATGGTCTATC GGCCGCTTGA CGAAAACGAT CCCAACCTGG GTTTCAGGCC GCCAGGCGTG AACGAAAAAA GGGCGGGCTA TGTGATCAGA GTGCCGCGTT CGGAGCAGCA GTTGGTTGAA GCGGTGCGCA CGTTACTCGA GGAGCAGGAA CGCCTGTATC AGCAAGAAAA CGCCGGCCTG CCGCGCATCC ATTTCGACCG TCACCTCTAT CAACCCCTGC TGGTAGATAT GCCTGGGAAG GCGCAGATTG CCCCGCCTGG GCTGAAGGAG AGCGAAGCGC GCTTTGTGTG CGATCTGCGC GACTACTGGG ACGCAGAAAA GAACAATTCG CTGGAGGGTA AAGAGGTTTT CCTGCTGCGC AACCTGAGCC GCGGCTATGG CATAGGCTTT TTTGAGGAAC GCGGCTTTTA TCCGGATTTC ATCCTGTGGG TGGTGGATCA GGCAACAAAA GCCCAGCGTA TTGTCTTCAT TGAGCCGCAC GGCATGGTGC ATGCCAAAGC CTACATCCAC GACGAAAAGG CGCGTCTGCA CGAGCGGCTG CCCGTGCTGG CAGATGAAAT CGGGAAACGG AGCGGCCGGA CCGACATCAC GCTGGACGCC TTTATTGTAT CGGCTACGTC GTACGACGAC CTGCATCAGC ACTATGACGA CGGAACCTGG GATCGCGCCA GGTTTGCCGA AAGACATATC CTTTTCCAAG AGCGTGGGCA GAATTACGAT TACCTGAGGA TACTGTTCGG GCAGCTAACC AGCGCTTCCA CCTGA
|
Protein sequence | MNTASSFTVH HSSFQQLENR LVLLAWLNSL FGYTSNRALL EDCKSVDEGY GPDGRSFLYH HLVARGSQVK ISNDDLARYD ENIRLHLEKI NRGRIERITL RYFQYLAALY TEIYLDRLFN HRGGADGRPP LLADLNAFAT MRNDERGMRN DERGMRNDER GMRNDERGMR NDERGMQRDS SLIIQDSDLT KLAYWMATGS GKTLILHLNY HQFLHYNREP IDNILLITPN EGLSEQHLAE LAASGIPARR FDVNASSLWT GGRDIVQVIE ITKLVEEKRG GGVSVPVEAF EGRNLIFVDE GHKGAGGEAW RKYREALGAT GFTLEYSATF GQALSAARND PLTAEYGKSI LFDYSYKYFY SDGFGKDFRI LNLRDETRSD QTEMLLMGNL LSFYEQVRLY EEQTDALRPY NLEKPLWVFV GSTVNAVYTE NKQPRSDVLT VVRFLHHALS DRDWAIKTIK AVLGGKTGLV SPDGQDVFAG RFGYLRKTGA TPEQLYDDIL RRIFNAPAGG GLHLCEIKSS PGELGLKAAN AGDYFGLIYI GDTSAFKKLA EEEAGDIALE GDAIAHSLFE RINRPDSGLH VLIGAKKFME GWNSWRVASM GLLNIGRQEG SQIIQLFGRG VRLKGKAMSL KRSAALEGSH PPHLRLLETL NIFAVRANYM AQFRDYLERE GVEVEPPIEL PLFVWANEQF LKQGLVVPRP PEVRDFTAET ALLLEPDSRI RVHVDLSVKV QAIESTRLGL HTADVRGGRA QAIPAESLDW VDWQQAYLDV LAYKARKGWS NLVIQPETLR QIVELKDKGV PVCTVIADER VIRPQSFRDR ALLQQVVTQL LCRYVDRLYQ TRREQWDEQT MVYRPLDEND PNLGFRPPGV NEKRAGYVIR VPRSEQQLVE AVRTLLEEQE RLYQQENAGL PRIHFDRHLY QPLLVDMPGK AQIAPPGLKE SEARFVCDLR DYWDAEKNNS LEGKEVFLLR NLSRGYGIGF FEERGFYPDF ILWVVDQATK AQRIVFIEPH GMVHAKAYIH DEKARLHERL PVLADEIGKR SGRTDITLDA FIVSATSYDD LHQHYDDGTW DRARFAERHI LFQERGQNYD YLRILFGQLT SAST
|
| |