Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_4419 |
Symbol | |
ID | 5541932 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | - |
Start bp | 5676480 |
End bp | 5679611 |
Gene Length | 3132 bp |
Protein Length | 1043 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640896517 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_001434453 |
Protein GI | 156744324 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.834068 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.3832 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAAG ATGCTATTGT CATCAAAGGT GCGCGTGAGC ACAATCTGAA AGGGATCGAC CTGGAAATCC CGCGTGATAA ACTGGTTGTT CTGACCGGCG TCTCCGGGTC GGGCAAGTCG TCGCTGGCGT TCGACACGTT GTACGCCGAA GGGCAGCGCC GATACGTCGA GTCGCTCTCG GCATATGCAC GGCAGTTTCT CGGTCAGATG GAAAAACCGA AAGTCGATGC CATCGAGGGG TTGTCGCCCG CGATCGCCAT CGAGCAGAAA AGCGCCTCGA AGAATCCACG CTCGACGGTC GGCACTGTTA CCGAGATTTA TGACTACCTC CGTTTGCTGT ATGCCCGCGT CGGGACGCAG CACTGTCACG TGTGTGGTCG TCCGGTCAGT TCGCAGAGCG CCGAGCAGAT GGTCAATCGG GTGCTGACCC TGCCGACAGG GACGCGCTTT ATGGTGCTGG CGCCGCTTGT GTCGCAGCGC AAGGGCGAGT ATAAAGATGT CTTCGCGGAA GCGCGCGCCG AGGGGTTCGC GCGGGTGCGC GTCGATGGCG AGATTTTCGA TCTGGCGGGC GAGATCAAAC TCAATAAGAA GGTCAAGCAT ACGATTGAGA TTGTCATTGA CCGTCTGGCG ATGCCGGCGC GCGACGCCGC GCGTGATCAG GTGCGCTCTT CTGACGCTCC CATCGGTCGA GCACAGGAAG GTTCCCAGAG TGAGTGGGAC GCTTTTGTGA CCCGACTGAC CGATAGTGTC GAGCAGGCGC TGCGGGTCGG CGAGGGGCAG TTGATCATCA GCATTCAGAA CAAAACGGGC GCCGCCGAAG AATGGTTGAT GAGCGAAGCC AATACTTGCA CGCACTGCGG TATTTCGTTC CCTGAACTCT CGCCGCAAAT GTTCTCGTTC AACAGTCCGC AGGGCGCCTG CCCCGAATGC ACCGGTCTTG GCGTCCGGCT CGAAGTCGAT CCGCTCCTGC TCGTTCCCAA CCCATCGCTG ACCCTGCACG AAGGTGCGGT GACGTATTGG GGCGAACTGC GCAAGAAGCG TGACTCGTGG GGGTACCGCG CGCTGCTGGC GATTGCCCGG CACTATGGAT TCGATCTCGA TACGCCTTGG GAACAACTCA GCGAACAGGC GCGTCACGTC ATCATCTATG GCAGCGGGAA AGAGCGGATA CGTTTCCAGT GGGGTGATGA AGGCGGCGAT AGCCGAGGTG AGTTTACCCG CACCTGGGAA GGGCTGGCAA GCGAGATTCG CCGCCGTTTT CAGCAGACGG GCAGCGACTA CACACGCGAG TATTACCAGA GTTTTATGAG TGAGCAACCC TGCCCGGCGT GCAGCGGCGC GCGCCTGCGC CCCGAAAGCC TGGCGGTCCG GGTTGGCGGA TTGTCAATCC GCGACGTGAC ACGGATGACG ATTGCCGGGG CACTGGCATG GGTGAATGCC CTGAGCGGCA TTTCCGGCAA CATCGCGCAT CTGGCAGACC TGGAGGGTCA GGTGATGCCC GGGGTTGTGG CAGGAAATGG CGCTGCGCAT CACGGTGCAG TGACGCCATT GACCGATTAT CAGATGGCGA TTGTCAACGA TGTGCTGAAA GAAATTCGTG AACGGCTCGG CTTTCTGCTG AATGTCGGTC TTCATTACCT GACGCTGGAA CGTCCCGCGC CGACGCTCTC CGGCGGTGAG GCGCAACGCA TCCGTCTCGC ATCGCAGATC GGCTCTGGTC TTGTGGGTGT AACGTACATT CTCGACGAGC CGAGCATCGG GCTACATCAG CGCGACAATC GCAAACTCCT CGATACGCTG CTCAAACTGC GCGACCTGGG CAACACCGTC GTGGTGGTTG AGCACGATCT GGAAACCATG CAGGCGGCTG ACTGGATCAT CGATTTTGGT CCGGGCGCCG GGGTCAAGGG CGGTCAGGTC GTCGCCGCTG GTCCGCCCAA TGTCGTGGCG GCATCGCCCG AGTCGCTGAC CGGTGCGTAT CTCGCGGGGC GACTTGAGAT CCCTACGCCG CAGCAGCGCC GCACTGCGCG GGTGCGTCCG GTTGCCAATG GATTGCAGGA TGCGCCGCGT CGTCGCCGGG TAGATCATCA GTCCGATCTG GCGGAGGGAC CGTGGCTCGA ACTCGAAGGC GCCACGATGA ACAACCTGCG TGATGTGACT GTTCGCTTTC CGCTGGGGGT CTTTATGTGC GTCACCGGAG TGTCCGGTTC GGGTAAGTCG TCACTGATCA CCGAGACGCT CTACCCGGCG CTGGCGAATC GCCTGCATCG CGCTCAGTTG AAGCCGGGAC CATTCCGCGC GCTGCGCGGG TTGGAGCATC TCGATAAGGT GATCGATATC GATCAGCAAC CAATCGGGCG GACGCCGCGC TCGAACCCGG CGACGTATGT CAAACTGTTC GACCTGATTC GTGAACTATT CGCCTCGACT AATGAAGCGA AACTACGTGG CTATAACGCC GGTCGCTTCT CGTTCAACCT GAAGGGCGGG CGCTGCGAAG CCTGCGAGGG CAATGGCGAA AAACGCATCG ATATGCAGTT CCTGGCAGAT GTCTGGGTGC GCTGCGATGT CTGCAAGGGC AAACGGTACA ACCGCGAAAC GTTGCAGGTC AAGTACAAAG GCAAATCAAT CGCCGATGTC CTTGATATGG ACGTTCAGAC GGCGCTGGAG TTCTTCGACA ATGTGCCGCG TATCAAGCGC ATGCTCCAGA CGTTGCACGA TGTCGGGCTG GACTACATCA AACTCGGGCA ATCGGCGACG ACCCTTTCCG GCGGCGAGGC GCAACGGGTG AAACTTGCGA AAGAACTGGC GCGCGTTGCA ACCGGTCGTA CCATGTATAT TCTCGATGAA CCGACCACCG GGTTGCACTT TGCCGATGTG CAGCGCCTGC TCACCGTGCT GCACCGTCTT GTGGATGCGG GCAACACCGT GCTCGTCATT GAACACAACC TCGATGTGAT TAAGACCGCA GACTGGATCA TCGACATGGG ACCGGAGGGC GGCGACGGCG GCGGCACGGT CGTCGCCGTC GGCACGCCTG AAGAGGTCGC CATGATCGAG GCATCGCACA CGGGACGGTT CCTGCGCGAG ATTCTATATG CGACTGGGGT TAAGGGTGTG GCGCAAGATT AA
|
Protein sequence | MAKDAIVIKG AREHNLKGID LEIPRDKLVV LTGVSGSGKS SLAFDTLYAE GQRRYVESLS AYARQFLGQM EKPKVDAIEG LSPAIAIEQK SASKNPRSTV GTVTEIYDYL RLLYARVGTQ HCHVCGRPVS SQSAEQMVNR VLTLPTGTRF MVLAPLVSQR KGEYKDVFAE ARAEGFARVR VDGEIFDLAG EIKLNKKVKH TIEIVIDRLA MPARDAARDQ VRSSDAPIGR AQEGSQSEWD AFVTRLTDSV EQALRVGEGQ LIISIQNKTG AAEEWLMSEA NTCTHCGISF PELSPQMFSF NSPQGACPEC TGLGVRLEVD PLLLVPNPSL TLHEGAVTYW GELRKKRDSW GYRALLAIAR HYGFDLDTPW EQLSEQARHV IIYGSGKERI RFQWGDEGGD SRGEFTRTWE GLASEIRRRF QQTGSDYTRE YYQSFMSEQP CPACSGARLR PESLAVRVGG LSIRDVTRMT IAGALAWVNA LSGISGNIAH LADLEGQVMP GVVAGNGAAH HGAVTPLTDY QMAIVNDVLK EIRERLGFLL NVGLHYLTLE RPAPTLSGGE AQRIRLASQI GSGLVGVTYI LDEPSIGLHQ RDNRKLLDTL LKLRDLGNTV VVVEHDLETM QAADWIIDFG PGAGVKGGQV VAAGPPNVVA ASPESLTGAY LAGRLEIPTP QQRRTARVRP VANGLQDAPR RRRVDHQSDL AEGPWLELEG ATMNNLRDVT VRFPLGVFMC VTGVSGSGKS SLITETLYPA LANRLHRAQL KPGPFRALRG LEHLDKVIDI DQQPIGRTPR SNPATYVKLF DLIRELFAST NEAKLRGYNA GRFSFNLKGG RCEACEGNGE KRIDMQFLAD VWVRCDVCKG KRYNRETLQV KYKGKSIADV LDMDVQTALE FFDNVPRIKR MLQTLHDVGL DYIKLGQSAT TLSGGEAQRV KLAKELARVA TGRTMYILDE PTTGLHFADV QRLLTVLHRL VDAGNTVLVI EHNLDVIKTA DWIIDMGPEG GDGGGTVVAV GTPEEVAMIE ASHTGRFLRE ILYATGVKGV AQD
|
| |