Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4093 |
Symbol | |
ID | 5541604 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5303902 |
End bp | 5306970 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640896205 |
Product | SMC domain-containing protein |
Protein accession | YP_001434143 |
Protein GI | 156744014 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0419] ATPase involved in DNA repair |
TIGRFAM ID | [TIGR00618] exonuclease SbcC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.240758 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.708538 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCCCTC GAACGCTGAC ACTCCAGAAC TTCATGTGCT ACCGCGAAGG GTTGCCGCCG CTGGTGCTCG ACGGCATCTC GATTGCGTGT CTGGCGGGTG ACAATGGCGC CGGCAAATCG GCGCTGCTCG ATGCGATCAC CTGGGCATTA TGGGGTGAAG CGCGTCTGAA GAGCGACGAT GATCTGGTAG CGCTTGGCGC CACCGAGATG ATGGTGGACC TGGAGTTCAC CCTCGATGGG CAGGATTATC GGGTGATCCG CCGCCGTATC CGCGGCAAGC GCGGCGGTCA GAGCCAGCTC GACTTCCAGG TGCGCGATGA GAACGGCTGG CGCTCGCTGA CCCCAGGTGG CATTCGTGAA ACGCAGCAGT TGATCATCCG CACGCTGCGC ATGGATTATG AGGTCTTCGC CAATTCGGCG TATCTCCGCC AGGGACACGC CGATGAGTTC ACCCGCAAAG AACCGGCGAA ACGGAAGCAG GTGCTGGCGG ATATTCTTGG TTTGAGCGTG TATGAGGACC TGGAAAGTCG GGCGAAGGAG CGCGCGCGCG CCATCGAAGG GCAGATCCGC GGTCTCGAAG GTCAGATCGG CGAATTGCGC CGCCAGGCGG AACGACGCGA CGTGCTCGTG GGGTTCGTGC GCGACGCTGA ACAGCGTGTC GCAGATGCGC GGCAGCGCAT CACAGAAGCG GAACAGGCGT TCCAAGCGGC GGTTGCCAAA GTGCAGGAAC TCGAAACAGT GCGCACTATC CGCGATAACC GTCAGGAGCA GATTCATCAG CGCCGCGCCG AAAGGGATGC GCAGAAGCAG TGGCTGGATC GGCAGATGGA CATCCGGGAA CGCGCTGAGG GCTGGATTGC GCGCCGTGCC GAGATTGAGG AAGGCATTCG CGCGCTGCGC GCTGCCGAAG CAGAGCGTGA TCGCCTCGCC GCGCTGCGCG ATGAGTATGA TCGGCTTCAG CAGCGTCGCG CGACCCTCGT GCAGGCGCTT GCCGAAGCCG AACACGCTCT CCGTGCCGAT CTGCGCGTTG CTGAGACGCA GGTGCAGACG ATGCGCGAAC GTGCTGCTCG CCGCCCGAAA CTGGTGGCGG AACGGGAGCG CCTGGCGGCA CAGTTGACGG ATCAGACGCC GATGATAGAG GCGTTGTCGG CTGCGCGCAC CCGCCGTACT GACCTGACCG ACCGCCTCCG TCGCGTCAAT GAGCTGCTGC GCCGCCGCAC GGAGCTGGAA GGGGATATCA AACTGAAACA CGACTCGCTG GTCGCCACGC GCGAGGAGCA GAAGCGAATT CTGCGCACGC TGGCAGATCA ACTGAAACAC GAAACGCGCT GGCGCGCCGA ACTTGCCGAA GCGCTCGCGG AACGCACCCG GATCAACGAA GAAAAAGACT GCCTTGAAAT ACTCCGCAAC GATGAGCGCG TCCTTGCCGA GCAGGTTGGC GGCATCCGCG CTGAATGCGA GACGGTTCGA CGACAGGGTG AGCAGATCAA CGAAAAACTA CGCTTGCTTG GTCCTGATGT GACGGTGTGC CCGCTCTGCA AGAGCGAACT CGGTCACGAC GGCATTGTCC ACATCCAGGC GGAATACGAG CGTGAGCGTC AGGCGCTGCG CCAGTGGTAT GCCGCTGCCA AACGCGATGC CGATCAGCTC GAAGCGCAGC TCAAACGCCT GCGCAACGAC ATTCGTGCTG CCGAGAACCG TATTGCTGCG CTTCCCGATC TTCAGGGGCG CATTGCGCGT CTGGAAAGCG ACCTTGCCAG ATGCGACACA CTTCGCCAGC AACAGATCGA GGCGCAGCGC CTGCATGATG ATGTGGCGAT GCGCCTGATG AAGAACGATT ATGAGTTGGC GGCGCGCGAA GAGTTGAAGC GCATCGATGC CGAGATGACG GCGCTTGGCG CCATCGAGAC GCTCGAACGT GAAATCGGCG CGCTTGATCG CCAGGTTGCC GCCCTCGAGA ATCGCAGCCG CGAACAGGCG ACGCTTCAGG CGCAGGTCGA TGCGCTGCAC CGTGAAATCC GGCAGATCGA CGACGACGAC CCGGCGTTGC ACGAACAGGA GCAGATTGTC GCGGAATTGA GCAGACGCCT GGCGCAGAAC GATTTTGCGC ACGACGAGCG CGCAGCGCTT GCCACGCTCG ACGAGCAGAT TGCGGCGTTG GGGTATAGCC GCGAACGGTA TGATCAGGCG AAGGCTGAGG CACAGGCGTT GACTCGTTGG GAGGAAGACC TGACGCGCCT GCAACACGCT GAAGAGTGGA TTGCCGAGAA CGACGATGAC ATCGCGCGCG CGGCGGAGCG CCTCCGGCAA CTTGACGCGC AGATCGCCTC CGACGAAGCC GAGGTGCAAC GCCTCGATGA ACGCCTGCGC GACCTGGCGC CCGCCGCGCG CGCGCGCGAC GCCGCCAGAG CCAGGCTCGA CGACCTCCAC CGCGAATTGC TGGCGTTCCA GAAGGACCTT GGCGAACACG AGGCGAACCT GCGCCGCGCT GAGGAAGCTG CGCGCGGCCT CGCTGATGCC GAAGCGCTCC GCCTGGCGCT TCTCGAACGG AAAAGTCTGT TCGATGAATT GACCCTGGCG TTTGGCAAAA AGGGCGTTCA GGCGATGCTG ATTGAAACCG CTCTTCCCGA ACTGGAGCGT GAAGCCAATC GCCTCCTCGA CCGGATGACC GATAATCAGT TGCATCTGAC GTTCGAGACG CAGCGCGATA CGAAGAAGGG CGATGTCGTC GAAACGCTGG AGATCAAAAT CGCTGATGCG CTCGGTACGC GCGTCTACGA CGCCTATAGC GGTGGTGAAG CGTTCCGTCT CGACTTCGCC ATCCGGATCG CGCTCTCGAA ACTGCTGGCG CGCCGCGCTG GCGCGCGCCT CGAGACCCTG ATCATCGATG AAGGGTTCGG CTCGCAGGAC GCGCGCGGAC GCGAACGCCT GGTTGAGGCG ATCATCTCGG TTCAGCACGA CTTTCGCCGT GTGCTGGTGA TCACGCACAT TCAGGAATTG AAGGATATGT TTCCTGTGCA GATCGAAATT GTCAAAACGC CGCACGGCAG CGTCTGGAGT CTCGCGTGA
|
Protein sequence | MIPRTLTLQN FMCYREGLPP LVLDGISIAC LAGDNGAGKS ALLDAITWAL WGEARLKSDD DLVALGATEM MVDLEFTLDG QDYRVIRRRI RGKRGGQSQL DFQVRDENGW RSLTPGGIRE TQQLIIRTLR MDYEVFANSA YLRQGHADEF TRKEPAKRKQ VLADILGLSV YEDLESRAKE RARAIEGQIR GLEGQIGELR RQAERRDVLV GFVRDAEQRV ADARQRITEA EQAFQAAVAK VQELETVRTI RDNRQEQIHQ RRAERDAQKQ WLDRQMDIRE RAEGWIARRA EIEEGIRALR AAEAERDRLA ALRDEYDRLQ QRRATLVQAL AEAEHALRAD LRVAETQVQT MRERAARRPK LVAERERLAA QLTDQTPMIE ALSAARTRRT DLTDRLRRVN ELLRRRTELE GDIKLKHDSL VATREEQKRI LRTLADQLKH ETRWRAELAE ALAERTRINE EKDCLEILRN DERVLAEQVG GIRAECETVR RQGEQINEKL RLLGPDVTVC PLCKSELGHD GIVHIQAEYE RERQALRQWY AAAKRDADQL EAQLKRLRND IRAAENRIAA LPDLQGRIAR LESDLARCDT LRQQQIEAQR LHDDVAMRLM KNDYELAARE ELKRIDAEMT ALGAIETLER EIGALDRQVA ALENRSREQA TLQAQVDALH REIRQIDDDD PALHEQEQIV AELSRRLAQN DFAHDERAAL ATLDEQIAAL GYSRERYDQA KAEAQALTRW EEDLTRLQHA EEWIAENDDD IARAAERLRQ LDAQIASDEA EVQRLDERLR DLAPAARARD AARARLDDLH RELLAFQKDL GEHEANLRRA EEAARGLADA EALRLALLER KSLFDELTLA FGKKGVQAML IETALPELER EANRLLDRMT DNQLHLTFET QRDTKKGDVV ETLEIKIADA LGTRVYDAYS GGEAFRLDFA IRIALSKLLA RRAGARLETL IIDEGFGSQD ARGRERLVEA IISVQHDFRR VLVITHIQEL KDMFPVQIEI VKTPHGSVWS LA
|
| |