Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_0572 |
Symbol | |
ID | 5538035 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 763756 |
End bp | 766686 |
Gene Length | 2931 bp |
Protein Length | 976 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640892733 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_001430719 |
Protein GI | 156740590 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.114911 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.199492 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGGCTG ACTGGATCGT GGTGCGCGGA GCGCGCGTCC ACAATCTTAA GAATATCACG GTCGCCATGC CGCGCAATGC GCTGGTGGTG ATCACGGGCC TCTCCGGCTC CGGTAAGTCG TCGCTGGCAT TCGACACCAT TTTTGCCGAG GGGCAGCGTC GCTATGTCGA GTCGCTCTCC GTCTATGCGC GCCAGTTCCT CGGTCAGATC GATAAGCCGG ATGTCGATGC GATTGAAGGG TTGTCGCCTG CGATTGCCAT CGACCAGAAG GGTTTGGTGC GCAATCCGCG CTCGACGGTC GGCACGGTCA CCGAAATTTA CGATTACCTT CGCTTGCTCT TCGCCCGGAT TGGACGACCG CACTGCGTTC ACTGCGGTCG TCCGTTGATC CGCCAGTCGG CGCAGCAGAT GATCGATACG ATCCTCGATC TGCCTCCCGG CAGTCGCATT CTGCTGCTGG CGCCGCTCGT GCGCGATCAG AAGGGCGACC ATCAGCCGCT CCTCGATCAG GTGCGCAAAC AGGGGTTTGT GCGCGTGCGC GTCGATGGCG AGGTGCGCGA CCTGGCGGAC GATCTGCGCC TGGATCGCTA CCGCCCCCAT ACCATCGAGG TGGTCGTTGA TCGTCTGGTC ATACCGCAAT CCGATACAGC GCCGCATCAA TCCCAGTTGC GGGTGCGCGT CGCCGATTCG GTCGAAATGG CGCTGCGGGT TGGTGATGGG GTGGTGATCG TGCAGATCGT TGGCGGCGAT GAACTGCCCC TCTCGCAACG GTATGCCTGC CCGGTGCATG GTCCTGCAAC GATTGGGGCG CTCGAACCGC GCGATTTTTC GTTCAATAAT CCGTCCGGCG CCTGCGCGAC GTGCGACGGT CTCGGCAGTG TGCTGGAGTT CGATCCCGAT CTGGTCATTC CAGACCGCTC ACGTTCGCTG GCGGACGGCG CCATCGCGCC ATGGGCGAAT GTCAGTCGCG CACAGCGCCG CTACTTCGAC GATCTGCTGG CATCGCTCGC CGATCACCTG GGTTTTTCGC CGCACACGCC GCTGCGCGAT CTTCCTCCCG AAGTGATTGC GACAATCCTC TACGGCTCTA ACGGTGATGT GATGCCGTTG CGCTACCGGC TGCGCGGCGA GGAGCGTCTC GTCGAGGCGC CATTCGAGGG CGTAATTCCG GCGTTGCGCC GACGGCTGGG GGAGTGCTCC GATGAAACGG AACGCGCGCA GATCGAGCAG TTCATGACGC CGTGCGTGTG TCCGGCATGC AACGGCGCGC GCCTGCGCCC CGAATTGCTC GCCGTCACCG TCGCCGGATA CACGATTGCG CAGGTGTCGG CGCTGCCCGT CGCTGAAGCA TGGTCGTGGG CGAAAACGCT GGCTGCCGAC GTCGCAGCGG CCGTCTCCTG CTGGCGCGAG ACGCGCGAAA GCAATCTGCG CTCGTCAATC TATGCGCTGA ATGTGCGCGA ATGTCAGATT GCAGCGCCCA TCCTGAACGA CATCTGCGCG CGGCTCCGAT TCCTGAACGA GGTTGGGCTG GAGTATCTCG CGCTGGATCG CGCCGCCGCG ACCCTCTCCG GCGGAGAAGC GCAGCGTATC CGCCTTGCGA CACAGATCGG GTCCGGGTTG AGCGGCGCGC TCTACGTGCT GGACGAGCCG AGCATTGGGC TCCACCCGCG TGATACGGCG CGCCTGCTCA ATACGCTGCG ACGGCTGCGC GACCTGGGGA ATAGTGTGCT GATCGTCGAA CACGACGAGG AAATCATCCG CGCCGCCGAC TGGATCGTCG ATATTGGTCC TGGCGCAGGG GAGCGCGGCG GCGAGGTGAT CGTCAGCGGA CCGTTCGAGG CAGTGCTGGC AGAGCCGCGC TCGCTAACCG GGCAGTATCT CTCCGGCAAA CGCGCGATTC CTGTGCCGCG CCGACGGCGC TCCGGCAGCG GCAGGTTTTT GATGATCAAA GGGGCGCGTG AGCACAATCT GAAGCATATC GATGTCGCCA TTCCACTAGG ATGCCTGGTT GCCATCACCG GTGTCAGCGG CTCCGGTAAA TCCACCCTGG TCAACGACAC CCTCTACCCG CGACTGGCGC AGGCGCTCCA TGGCGCGCGC GCGCGCCCCG GCGCCCACGA CGCGATCTAC GGCATTGAAC ATATCGATAA GGTGATCGAC ATCGACCAGT CGCCGATCGG TCGCACGCCG CGTTCCAATC CGGTCACCTA CACCAAAGCC TTTGACCCGA TCCGCAAGTT GTTTGCGCAA ACCCCCGAAG CGCGCGCGCG CGGCTATGAC GCCGGTCGTT TTTCGTTCAA CATTCCCGGC GGGCGCTGCG AACATTGCAA CGGCGAAGGG TTGATGCAGA TCGAGATGCA GTTCCTGCCG GACCTCTACG TGACCTGCGA TGTGTGCCAT GGCGCGCGCT ACAACCGTGA GACGCTTGAC ATCCGCTATC GCGGCAAAAA TATTGCTCAG GTGCTCGATA TGACCGCTGA GGAAGCGGCG GCGTTCTTCG AGCGCGTGCC TGCCATTGCC GAAAAATTGC AGACGTTGAT CGACGTGGGG TTGGGCTACA TTCGCCTCGG TCAACCGGCA ACCACGCTGT CCGGCGGCGA AGCGCAGCGC ATCAAACTGG CGACTGAACT GAGCCGCCGC GCCACCGGAC GCACCCTCTA CATCCTGGAC GAGCCAACCA CCGGATTACA CGTCGCCGAC GTCGACCGGC TGCTGCGTGT GTTGCAGCGG TTGGTCGATG CGGGCAACAC TGTGCTGGTC ATTGAACATA ACCTCGACGT TATCAAGTGC GCCGACTGGG TCATCGACCT TGGTCCCGAA GGCGGCGATG CTGGCGGGCG CGTCGTCGCC GCCGGAACTC CCGAACAGGT GGCGCGAACG CCAGGATCGC ACACCGGTCA GTGTCTGGCG CGCATACTCG TTGAACGTTG A
|
Protein sequence | MSADWIVVRG ARVHNLKNIT VAMPRNALVV ITGLSGSGKS SLAFDTIFAE GQRRYVESLS VYARQFLGQI DKPDVDAIEG LSPAIAIDQK GLVRNPRSTV GTVTEIYDYL RLLFARIGRP HCVHCGRPLI RQSAQQMIDT ILDLPPGSRI LLLAPLVRDQ KGDHQPLLDQ VRKQGFVRVR VDGEVRDLAD DLRLDRYRPH TIEVVVDRLV IPQSDTAPHQ SQLRVRVADS VEMALRVGDG VVIVQIVGGD ELPLSQRYAC PVHGPATIGA LEPRDFSFNN PSGACATCDG LGSVLEFDPD LVIPDRSRSL ADGAIAPWAN VSRAQRRYFD DLLASLADHL GFSPHTPLRD LPPEVIATIL YGSNGDVMPL RYRLRGEERL VEAPFEGVIP ALRRRLGECS DETERAQIEQ FMTPCVCPAC NGARLRPELL AVTVAGYTIA QVSALPVAEA WSWAKTLAAD VAAAVSCWRE TRESNLRSSI YALNVRECQI AAPILNDICA RLRFLNEVGL EYLALDRAAA TLSGGEAQRI RLATQIGSGL SGALYVLDEP SIGLHPRDTA RLLNTLRRLR DLGNSVLIVE HDEEIIRAAD WIVDIGPGAG ERGGEVIVSG PFEAVLAEPR SLTGQYLSGK RAIPVPRRRR SGSGRFLMIK GAREHNLKHI DVAIPLGCLV AITGVSGSGK STLVNDTLYP RLAQALHGAR ARPGAHDAIY GIEHIDKVID IDQSPIGRTP RSNPVTYTKA FDPIRKLFAQ TPEARARGYD AGRFSFNIPG GRCEHCNGEG LMQIEMQFLP DLYVTCDVCH GARYNRETLD IRYRGKNIAQ VLDMTAEEAA AFFERVPAIA EKLQTLIDVG LGYIRLGQPA TTLSGGEAQR IKLATELSRR ATGRTLYILD EPTTGLHVAD VDRLLRVLQR LVDAGNTVLV IEHNLDVIKC ADWVIDLGPE GGDAGGRVVA AGTPEQVART PGSHTGQCLA RILVER
|
| |