Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RoseRS_3438 |
Symbol | |
ID | 5210415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus sp. RS-1 |
Kingdom | Bacteria |
Replicon accession | NC_009523 |
Strand | + |
Start bp | 4311519 |
End bp | 4314641 |
Gene Length | 3123 bp |
Protein Length | 1040 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640597033 |
Product | excinuclease ABC, A subunit |
Protein accession | YP_001277746 |
Protein GI | 148657541 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0178] Excinuclease ATPase subunit |
TIGRFAM ID | [TIGR00630] excinuclease ABC, A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.160486 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAAAG ACGCCATCGT TATCAAGGGT GCGCGTGAGC ACAATCTGAA GGGCATCGAC CTCGAAATCC CACGCGACAA ACTGGTTGTG CTGACCGGCG TCTCAGGTTC GGGAAAGTCG TCGCTGGCGT TCGATACGCT GTATGCCGAA GGACAGCGCC GGTACGTCGA GTCGCTCTCG GCATACGCCC GTCAGTTTCT CGGGCAGATG GAGAAGCCGA AAGTCGATTA CATCGGCGGT CTCTCGCCGG CGATTGCCAT CGAGCAGAAG AGCGCGTCGA AGAATCCGCG CTCGACCGTC GGCACCGTCA CCGAGATCTA CGACTATCTG CGCCTGCTGT ACGCTCGCGT TGGAACGCAG CACTGCCACA TGTGCGGGCG ACCGGTCAGT TCGCAAAGCG CCGAGCAGAT CGTCAACCGC GTGCTGACGT TGCCGGCTGG CACACGCTTC ATGGTGCTGG CGCCGCTCGT GTCGCAACGC AAAGGCGAGT ACAAGGACGT GTTCGCCGAA GCGCGCGCCG AGGGATTCGC GCGGGTGCGC GTCGATGGCG AAATACGCGA CCTGGCAAGC GAAATCAAAC TCAACAAGAA GGTCAAGCAT ACTATCGAGA TTGTGGTTGA TCGTTTGACC ATACTGGCAC GCGAAGGTGC AACCGATCAG TCGCATGTTC CGAGCGCCCC GATCGGAAAA GCGCAGGCAG GGGCGCAGAG CGATTGGGAT GCATTCGTCT CTCGCCTGAC CGATAGTGTC GAACAGGCGC TGCGTGTTGG CGAGGGGCAA CTGGTTATCA GCATCCAGAA TCCATCCGGT GGCACAGAAG AATGGTTGAT GAGCGAAGCC AACACCTGCG TCCACTGTGG CATTTCGTTT CCTGAACTGT CGCCGCAGAT GTTTTCGTTC AACAGTCCGC AGGGCGCCTG CCCCGAATGC ACCGGTCTCG GCGTTCGGAT GGAGGTGGAC CCGCTGCTGC TCGTGCCCAA CCCATCATTG ACCCTGCACG AGGGTGCGGT GACCTACTGG GGCGAACTGC GCAAGAAACG CGATTCGTGG GGGTACCGGG CGTTGCTGGC AATTGCGCAT CACTACGGCT TCGATCTCGA TACGCCATGG GAACAACTCA GCGAGCAGGC ACGCCACGTC ATTATCTATG GCAGCGGAAA GGAACGAATT CGCTTTCGCT GGGGCGACGA AACCAGTGAT AGTCGTGGTG AGTTCATGCG TCCCTGGGAG GGACTGGCAA GTGAAATTCG TCGCCGCTAT CAACAGACCG GCAGTGATTA CACCCGCGAG TATTACCAGA GTTTTATGAG CGAACAACCC TGCCCGGCAT GCGACGGTGC GCGTCTGCGC CCCGAAAGCC TGGCGGTCAG GGTCGGCGGG TGGTCGCTGC GCGATGTGAC GCGCCTGACG ATTACCGGTG CGATGGCATG GGTGCACGCC CTGAGCGGCA TGCCGGTAGA CCCGTCGCAT CTGGCGGCAT TGAACGGGCA TGTCGCAGGC AATGGCGCCA TACCCCACCT CAGTGTGACA CCACTGAGCG ACTACCAGAT GGCGATTGTC AGCGATGTGC TCAAGGAGAT CCGCGAGCGC CTGGGGTTCC TGCTCAATGT CGGTCTGCAT TACCTCACCC TCGAACGCCC CGCGCCAACC CTCTCCGGCG GCGAGGCGCA GCGTATTCGC CTGGCATCAC AGATCGGCTC CGGTCTCGTC GGCGTCACCT ATATCCTCGA TGAACCGAGC ATCGGGCTGC ACCAGCGCGA CAATCGCAAA CTGCTCGATA CGCTGCTGAA ACTGCGCGAT CTGGGCAATA CCGTCGTCGT CGTCGAGCAC GACCTGGAAA CCATGCAGGC GGCTGACTGG ATCATCGACT TCGGTCCTGG GGCAGGCGTC AAGGGGGGCG AGGTGGTCGC AGCCGGTCCT CCTGACCTGA TCGCCGCAAA CCCTGGCTCC CTGACCGGCG CATACCTGTC CGGGCGATTG GACATTCCGA TCCCGCAGCA GCGCCGCACT GCGCGGGTGC GCCCGGTTGC CGATACGGCG CAGGACGCGC CGCGCCGTCG TCGCCGGACT GATCACGCAA CCGACCAGGC GGATGGTCCG TGGCTCGAAC TCGAAGGCGC AACCATGAAT AATCTGCGCG ACGTGACCGT TCGTTTTCCG CTCGGCGTCT TCATCTGCGT GACCGGCGTC TCCGGGTCGG GAAAATCATC GCTGATCACC GAAACGCTCT ACCCTGCGCT GGCAAACCGC CTGAACCGCG CGCAGTTGAA GCCGGGACCG TTCCGTACAT TGCGTGGGCT GGAACATCTC GATAAGGTGA TCGATATCGA CCAGCAACCG ATTGGGCGAA CCCCGCGCTC CAACCCGGCA ACATACGTCA AACTGTTCGA CCTGATCCGC GAACTGTTCG CTTCGACCAA TGAGGCGAAA CTGCGCGGCT ACAACGCCGG GCGCTTTTCG TTCAACCTGA AGGGCGGGCG TTGCGAAGCC TGCGAGGGGA ATGGCGAAAA GCGCATCGAC ATGCAGTTCC TGGCGGATGT CTGGGTGCGC TGCGATGTCT GTAAGGGGAA ACGGTACAAC CGTGAAACAT TGCAGGTCAG GTACAAGGGC AAGTCCATTG CTGACGTGCT CGACATGGAC GTGCAGACGG CGCTGGAGTT CTTCGACAAT GTGCCGCGCA TCAGGCGCAT CCTGCAAACG CTCCACGACG TCGGTCTGGA CTACATCAAA CTCGGTCAGT CGGCGACGAC CCTTTCCGGC GGCGAGGCGC AGCGGGTGAA ACTGGCGAAA GAACTGGCGC GCACTGCTAC CGGTCGCACC ATGTATATTC TGGATGAACC AACGACCGGG CTGCACTTCG CCGATGTACA ACGCCTGTTG ACAGTGCTGC ACCGCCTGGT CGATGCAGGC AACACCGTGC TCGTCATCGA GCACAACCTG GACGTTATCA AAACCGCAGA CTGGATCATC GACATGGGAC CGGAAGGCGG CGACGGGGGT GGCAGAGTCG TGGCGACCGG CACACCCGAA GAAGTGGCGC TGATCGAGGA GTCGCACACC GGTCGATTCC TGCGCGAGAT CCTGCACCAC CACAACATCG TTGCCAGGGG CGTGCTTGAG TGA
|
Protein sequence | MAKDAIVIKG AREHNLKGID LEIPRDKLVV LTGVSGSGKS SLAFDTLYAE GQRRYVESLS AYARQFLGQM EKPKVDYIGG LSPAIAIEQK SASKNPRSTV GTVTEIYDYL RLLYARVGTQ HCHMCGRPVS SQSAEQIVNR VLTLPAGTRF MVLAPLVSQR KGEYKDVFAE ARAEGFARVR VDGEIRDLAS EIKLNKKVKH TIEIVVDRLT ILAREGATDQ SHVPSAPIGK AQAGAQSDWD AFVSRLTDSV EQALRVGEGQ LVISIQNPSG GTEEWLMSEA NTCVHCGISF PELSPQMFSF NSPQGACPEC TGLGVRMEVD PLLLVPNPSL TLHEGAVTYW GELRKKRDSW GYRALLAIAH HYGFDLDTPW EQLSEQARHV IIYGSGKERI RFRWGDETSD SRGEFMRPWE GLASEIRRRY QQTGSDYTRE YYQSFMSEQP CPACDGARLR PESLAVRVGG WSLRDVTRLT ITGAMAWVHA LSGMPVDPSH LAALNGHVAG NGAIPHLSVT PLSDYQMAIV SDVLKEIRER LGFLLNVGLH YLTLERPAPT LSGGEAQRIR LASQIGSGLV GVTYILDEPS IGLHQRDNRK LLDTLLKLRD LGNTVVVVEH DLETMQAADW IIDFGPGAGV KGGEVVAAGP PDLIAANPGS LTGAYLSGRL DIPIPQQRRT ARVRPVADTA QDAPRRRRRT DHATDQADGP WLELEGATMN NLRDVTVRFP LGVFICVTGV SGSGKSSLIT ETLYPALANR LNRAQLKPGP FRTLRGLEHL DKVIDIDQQP IGRTPRSNPA TYVKLFDLIR ELFASTNEAK LRGYNAGRFS FNLKGGRCEA CEGNGEKRID MQFLADVWVR CDVCKGKRYN RETLQVRYKG KSIADVLDMD VQTALEFFDN VPRIRRILQT LHDVGLDYIK LGQSATTLSG GEAQRVKLAK ELARTATGRT MYILDEPTTG LHFADVQRLL TVLHRLVDAG NTVLVIEHNL DVIKTADWII DMGPEGGDGG GRVVATGTPE EVALIEESHT GRFLREILHH HNIVARGVLE
|
| |