Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3848 |
Symbol | |
ID | 5210830 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | - |
Start bp | 4806619 |
End bp | 4809546 |
Gene Length | 2928 bp |
Protein Length | 975 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640597444 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_001278152 |
Protein GI | 148657947 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.00424286 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000162445 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTCTGCTG ATCGGATCGT GGTGCGCGGG GCGCGCGTCC ATAATCTTAA GAATATCACG GTCGCCATGC CGCGCAATGC GCTGGTGGTG ATCACCGGTC TCTCCGGCTC CGGCAAGTCG TCGCTGGCGT TCGATACGAT TTTTGCCGAA GGTCAGCGTC GCTATGTCGA ATCGCTTTCC GCCTATGCCC GCCAGTTCCT TGGTCAGATC GACAAGCCGG ACGTTGACGC CATCGAAGGA TTGTCGCCTG CCATTGCCAT CGATCAGAAA GGGCTGGCGC GCAATCCGCG CTCAACCGTC GGCACCATTA CCGAAATCTA CGATTATCTG CGACTGCTGT TTGCCCGGAT TGGACGACCG CACTGCATCC ACTGCGGTCG TCCGCTCACG CGCCAGTCGG CGCAACAGAT GATCGATACG ATCCTCGGTC TGCCGGGTGG CAGCCGCGTG CTGCTGCTTG CGCCCCTCGT TCGCGATCAG AAGGGCGACC ATCAGACCCT CTTCGATCAG GTGCGCAAGC AGGGGTTCGT CCGTGTGCGC GTCGATGGCG AGGTGCGCGA TCTTGGCGAC GACCTGCGCC TTGACCGTCA TCGTCCGCAT ACGATCGAGG TCGTTGTGGA CCGGCTGGTC ATTCCATCAG CCGATCCCTC GCAGCAACAG ACGCAGTTTC GCGTGCGCGT CGCCGATTCG GTCGAAACGG CGCTGCGGGT CGGCGGCGGC GTGGTGATCG TCCAGATCGT CGGCGGTGAT GAACTCACTC TGTCGCAGCG TTACGCCTGC CCGGTGCACG GACCGGCTTC CATCGGGGCG CTCGAACCGC GCGATTTTTC ATTCAACAAT CCATCGGGCG CCTGCCCTGC CTGTGATGGT CTGGGAAGCG TGCTGGAGTT CGACCCTGAA CTGGTGATCC CCGACCGATC CCGTTCACTT GCGGAAGGCG CCGTCGCGCC CTGGTCGAAC GTCAGTCGCG CGCAGCGCCG CTACTTCGAC GATCTGCTGA CATCACTCGC CGGGCACCTC GGTTTTTCAC TGCACACACC AGTGCGCGAT CTGCGCCCGG AGGTGATCGC TACGCTGCTC TACGGTTCTA ACGGCGATGT GATGCCACTC CGTTACCACA TACGCGGCGA AGAGCGCATG GTTGAGGCGC CGTTCGAGGG GGTTATCCCC GGTCTGCGCC GTCGTCTGGC GGAATGCACC GATGAAACCG AACGCGCACA GATTGAGCAG TTTATGACCC CGCGCACATG CCCGGCATGC AACGGCGCGC GACTGCGTCC CGAACTGCTC GCCGTCACCG TCGCCGGATA CACGATTGCG CAGGTGGCGG CGCTCCCTGT CGCTGAAGCG TGGTCGTGGG CGAAAACACT GGCTGCCGAC GTTGACGAGG TGGTCGCCCG CTGGCGTGAG ACGCGCGAAA GCGACCTGCG CTCGTCCATC TATGCTCTTA CGGTGCGCGA ATGTCAGATC GCGGCGCCAA TCCTGAATGA CATCTGCGCC CGACTCCGGT TTCTAAATGA AGTCGGGCTG GGGTATCTCA CGCTTGACCG CACTGCGACG ACCCTTGCTG GCGGCGAGGC GCAGCGCATT CGCCTGGCGA CGCAGATCGG CGCCGGGCTG AGCGGTGCGC TCTATGTGCT GGACGAGCCG AGCATCGGGT TACACCCACG TGATACGGCG CGCCTGCTCA ACACGTTGCG TCAACTGCGC GACCTGGGCA ACAGCGTGCT GGTTGTTGAA CACGACGAAG AGATCATTCG CGCAGCCGAC TGGATCGTTG ACATTGGTCC CGGTGCGGGA GAGCGCGGCG GCGAGGTGAT CGTCAGCGGT CCGCTCGATG CAGTGCTGGC AGAGCCGCGC TCACTTACCG GACAGTACCT CTCCGGCAAA CGCACGATTG CCGTGCCACG CCGCCGACGA CCCGGCAACG GCGCATTCCT GATGATCAGG GGAGCGCGCG AGCACAACCT GAAGAACATC GACGTCGCCA TCCCGCTGGG ATGTCTGGTG GCAGTCACCG GCGTCAGCGG GTCTGGCAAA TCGACCCTGA TCAACGACAC CCTCTACCCG CGGCTGGCGC AGGCGCTCCA TGGCGCGCGC GCGCGCCCCG GCGCCCACGA CGCAATCTAT GGCATCGAAC AGATCGATAA GGTGATCGAC ATCGACCAGT CGCCCATCGG GCGCACGCCA CGCTCCAACC CGGTCACCTA CACCAAGGCG TTCGACCCGA TCCGCAAATT GTTCGCGCAA ACGCCTGAAG CGCGCGCGCG CGGCTATGAT GCCAGTCGTT TCTCGTTCAA CATTCCCGGA GGACGGTGCG AGCACTGCAA CGGCGAAGGA TTGATGCAGA TCGAAATGCA GTTCCTTCCC GATCTGTACG TCACCTGCGA TGTCTGTCAC GGCGCGCGCT ACAACCGTGA AACGCTCGAC ATTCGTTACC GGGGCAAGAA CATCGCCGAA GTGCTCGACA TGACGGTTGA AGAAGCGGCG GCATTTTTCG AGCGTGTCCC ATCCATCGCC GAAAAGTTGC AAACCCTGAT CGACGTCGGG TTAGGGTATA TTCGCCTGGG TCAACCTGCG ACCACCCTCT CCGGCGGTGA GGCGCAACGC ATCAAACTGG CGACCGAACT GAGTCGGCGC GCCACCGGGC GCACCCTCTA CATCCTCGAT GAACCGACGA CCGGCCTGCA CGTCGCTGAT GTTGACCGGT TGCTGCGCGT CTTGCAGCGC CTGGTGGACG CGGGAAACAC CGTTCTGGTG ATCGAACACC ATCTCGATGT GATCAAATGC GCCGATTGGG TGATCGACCT GGGACCGGAG GGAGGCGAGG AGGGCGGGCG TGTTGTCGCC GTCGGCACGC CTGAACAGGT CGCGCGAACG CCAGGATCGT ACACCGGTCA GTGCCTGGCG CGGGTGGTTG AAGGTTGA
|
Protein sequence | MSADRIVVRG ARVHNLKNIT VAMPRNALVV ITGLSGSGKS SLAFDTIFAE GQRRYVESLS AYARQFLGQI DKPDVDAIEG LSPAIAIDQK GLARNPRSTV GTITEIYDYL RLLFARIGRP HCIHCGRPLT RQSAQQMIDT ILGLPGGSRV LLLAPLVRDQ KGDHQTLFDQ VRKQGFVRVR VDGEVRDLGD DLRLDRHRPH TIEVVVDRLV IPSADPSQQQ TQFRVRVADS VETALRVGGG VVIVQIVGGD ELTLSQRYAC PVHGPASIGA LEPRDFSFNN PSGACPACDG LGSVLEFDPE LVIPDRSRSL AEGAVAPWSN VSRAQRRYFD DLLTSLAGHL GFSLHTPVRD LRPEVIATLL YGSNGDVMPL RYHIRGEERM VEAPFEGVIP GLRRRLAECT DETERAQIEQ FMTPRTCPAC NGARLRPELL AVTVAGYTIA QVAALPVAEA WSWAKTLAAD VDEVVARWRE TRESDLRSSI YALTVRECQI AAPILNDICA RLRFLNEVGL GYLTLDRTAT TLAGGEAQRI RLATQIGAGL SGALYVLDEP SIGLHPRDTA RLLNTLRQLR DLGNSVLVVE HDEEIIRAAD WIVDIGPGAG ERGGEVIVSG PLDAVLAEPR SLTGQYLSGK RTIAVPRRRR PGNGAFLMIR GAREHNLKNI DVAIPLGCLV AVTGVSGSGK STLINDTLYP RLAQALHGAR ARPGAHDAIY GIEQIDKVID IDQSPIGRTP RSNPVTYTKA FDPIRKLFAQ TPEARARGYD ASRFSFNIPG GRCEHCNGEG LMQIEMQFLP DLYVTCDVCH GARYNRETLD IRYRGKNIAE VLDMTVEEAA AFFERVPSIA EKLQTLIDVG LGYIRLGQPA TTLSGGEAQR IKLATELSRR ATGRTLYILD EPTTGLHVAD VDRLLRVLQR LVDAGNTVLV IEHHLDVIKC ADWVIDLGPE GGEEGGRVVA VGTPEQVART PGSYTGQCLA RVVEG
|
| |