Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rpal_4824 |
Symbol | |
ID | 6412510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris TIE-1 |
Kingdom | Bacteria |
Replicon accession | NC_011004 |
Strand | + |
Start bp | 5190666 |
End bp | 5193701 |
Gene Length | 3036 bp |
Protein Length | 1011 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 642714701 |
Product | excinuclease ABC subunit B |
Protein accession | YP_001993788 |
Protein GI | 192293183 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0556] Helicase subunit of the DNA excision repair complex |
TIGRFAM ID | [TIGR00631] excinuclease ABC, B subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.748709 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGAAGA CTCCCGACAA GCCTGGCAAG CCGGCAAAAA CCCCGAAATC CAAAGCACAT CGGCCCGACG TGAAGCCGAT CGGCCCTGCG CTGGCCGAAC TGCTGAACCC GGCGCTGAAT CGCGGCGACG CGGGCCTCGG CTCCGGCACC GGCCTGCAGC CGCCACCGGA CAATTCGCGC GACCGCCGCA CCGGCGGCGA AGCCGCGGTG CATCGTGGCC GCGCCTCGAC ACCCAATCCA GGCGACGGCG CCACACCGCG GCCGACGGCC CTGCAGCCTT ATCCGCAGCC GCCGGGCGCC AGCCGCGGCG GCCTCAATGA AGCGCCGCAG GCCAATTACG GCACCGCCGC CACCATCCCG ACGCTCGATC CGGAACTGGC GCGGCAGCTC GGCTTGCCGA CCGAGGAGGA CGATGCCGAA GCCTTGGCGC GGCCGCCGCG CAGCAAGATG GAGGCGCTCG GCGTCAAGGC CACCGCCGAG GCACTGGAGA GCCTGATCCG CGACGGCCGC CCCGAGTTCA AAGGCGAAGA CGGCGGCGTC AAGCTGTGGG TGCCGCACCG GCCGCCGCGC CCGGAGAAAT CCGAAGGCGG CGTCCGCTTC GTGCTGAAGT CCGACTACCA GCCGCGCGGT GACCAGCCGA CCGCGATCAA AGAACTGGTC GAAGGCCTCG ACAGGAGCGA CCGCACGCAG GTGCTGCTCG GCGTCACCGG CTCGGGCAAG ACCTACACCA TGGCCAAGGT GATCGAGGCG ACGCAGCGCC CGGCGATCAT CCTGGCGCCG AACAAGACGC TGGCGGCGCA GCTCTACGGC GAGTTCAAGA ACTTCTTCCC CGACAACGCC GTCGAGTACT TCGTCTCGTA TTACGACTAC TACCAGCCCG AAGCCTACGT TCCGCGCACC GACACCTATA TCGAGAAGGA TTCGTCGATC AACGAACAGA TCGACCGGAT GCGCCACGCC GCGACCCGCG CGCTGCTCGA GCGCGACGAC GTCATCATCG TCGCCTCGGT GTCGTGCATC TACGGTATCG GCTCGGTCGA GACCTATACG GCGATGACCT TCGCGCTGAA GCGCGGCGAG CGCATCGACC AGCGCCAGCT GATCGCCGAT CTGGTGGCGC TGCAATACAA GCGCACCCAG GCCGACTTCT CGCGCGGCAC CTTCCGGGTG CGCGGCGACG TCATCGACAT CTTCCCGGCG CACTATGAGG ATCGCGCCTG GCGGGTGAAG ATGTTCGGCG ACGAGATCGA GGGCATCGAG GAATTCGACC CGCTCACCGG CCACAAGCAG GACGAGCTGG AATTCGTCAA GATCTACGCC AACTCGCACT ATGTGACGCC GCGGCCGACG CTGATCCAGG CGATTCAGTC GATCAAGACC GAGTTGAAAT GGCGGCTCGA TCAGCTGCAT GCACAAGGCC GCCTGCTGGA AGCACAGCGG CTGGAGCAGC GCACCACCTT CGACATCGAG ATGATGGAAG CGACCGGCTC CTGCGCCGGC ATCGAGAACT ACTCACGGTA CCTGACCGGC CGCCGGCCGG GCGAGCCACC GCCGACGCTG TTCGAATATG TGCCCGACAA CGCGCTGGTG TTCGCCGACG AAAGCCACGT CTCGATCCCG CAGATCGGCG CGATGTTCAA GGGCGACTTC CGCCGCAAGG CGACGCTGGC CGAATACGGC TTCCGCCTGC CGTCCTGCAT GGACAACCGG CCGCTGCGCT TCGAAGAATG GGACATGATG CGGCCGCAGA CGGTCGCGGT GTCGGCGACG CCGGCAGCAT GGGAGCTGAA CGAAAGCGGC GGCGTGTTCG TCGAGCAGGT CATTCGCCCC ACCGGCCTGA TCGACCCGCC GGTCGATATC CGCCCGGCGC GCACCCAGGT CGACGACCTC GTCGGCGAGG TCCGCGCCAC TGCGGCGCGC GGCTATCGCA CGCTGATCAC CGTGCTGACC AAGCGGATGG CCGAGGACCT CACCGAGTTC CTGCATGAGC AGGGCATCCG CGTGCGCTAC ATGCATTCGG ACATCGACAC CATCGAGCGC ATCGAGATCA TCCGCGATCT GCGGCTCGGC GCGTTCGACG CGCTGGTCGG CATCAACCTC TTGCGCGAAG GCCTCGACAT TCCCGAATGC GCGCTGGTTG CGATCCTCGA CGCCGACAAG GAAGGCTTCC TGCGCAGCGA GACCTCGCTG ATCCAGACCA TCGGCCGCGC CGCCCGTAAC GTCGACGGCA AGGTCATCCT CTATGCCGAT CAAATGACCG GCTCGATGCA GCGCTCGATC GACGAGACCA ACCGCCGCCG CGAGAAGCAG ATCGAATACA ACACCGCGCA CGGCATCACG CCGGAGAGCG TGAAGAAGTC GATCGGCGAC ATTCTCAACA GCGTGTACGA GCGCGACCAC GTCCTGGTCG AGATCGGCGA CGGCAAGGGC GCCGGCTTCA CCGACGACGC CGCGGTGATC GGCCACAATT TCGAAGCGGT GCTGGCGGAT CTCGAAACCC GGATGCGCGA AGCCGCGGCC GATTTGAACT TCGAAGAAGC CGCCCGTCTG CGCGACGAAG TCAAACGCCT GCGCGCCACC GAGCTCGCAG TGATCGACGA TCCGACCGTG AAGCAGCGCA AGGTCGCCGA CAAAGCCGGC AGCTACGCCG GCAACAAGCG CTATGGCGAC GCCGCGAACC TGCCAGCCGA TGCGGGCAAA GGCGGACGCG GCAAGTCAGG ATCACGAGGC GGCGCCGCCG CGTCACCCTC CCCCTTGCAG GGACGGTCGG CCGAAGACCG GGGCGGGGGT GCCGCGAGCA CGGCGTCAAA GGTCCATAAA CCCGACCTCG ATGAGATGGG TATCGCCGGC TTTCACGAAT TCAAGAAAGT CCAGCGCCCC AAGCCGCGCA AGCCGACGCT CGACGAAATG GGGCCGGGCA CGGAGAGCAA GATCTATCAG CCGACCTCCA GCCGCGAAGC CGGCCCGGAA TTCGGCCCCT CCCCGCGCAG CACCGGCGGC GCGCCAGGCA AGCGGGGCGG ATGGAAGAAG AGGTAG
|
Protein sequence | MAKTPDKPGK PAKTPKSKAH RPDVKPIGPA LAELLNPALN RGDAGLGSGT GLQPPPDNSR DRRTGGEAAV HRGRASTPNP GDGATPRPTA LQPYPQPPGA SRGGLNEAPQ ANYGTAATIP TLDPELARQL GLPTEEDDAE ALARPPRSKM EALGVKATAE ALESLIRDGR PEFKGEDGGV KLWVPHRPPR PEKSEGGVRF VLKSDYQPRG DQPTAIKELV EGLDRSDRTQ VLLGVTGSGK TYTMAKVIEA TQRPAIILAP NKTLAAQLYG EFKNFFPDNA VEYFVSYYDY YQPEAYVPRT DTYIEKDSSI NEQIDRMRHA ATRALLERDD VIIVASVSCI YGIGSVETYT AMTFALKRGE RIDQRQLIAD LVALQYKRTQ ADFSRGTFRV RGDVIDIFPA HYEDRAWRVK MFGDEIEGIE EFDPLTGHKQ DELEFVKIYA NSHYVTPRPT LIQAIQSIKT ELKWRLDQLH AQGRLLEAQR LEQRTTFDIE MMEATGSCAG IENYSRYLTG RRPGEPPPTL FEYVPDNALV FADESHVSIP QIGAMFKGDF RRKATLAEYG FRLPSCMDNR PLRFEEWDMM RPQTVAVSAT PAAWELNESG GVFVEQVIRP TGLIDPPVDI RPARTQVDDL VGEVRATAAR GYRTLITVLT KRMAEDLTEF LHEQGIRVRY MHSDIDTIER IEIIRDLRLG AFDALVGINL LREGLDIPEC ALVAILDADK EGFLRSETSL IQTIGRAARN VDGKVILYAD QMTGSMQRSI DETNRRREKQ IEYNTAHGIT PESVKKSIGD ILNSVYERDH VLVEIGDGKG AGFTDDAAVI GHNFEAVLAD LETRMREAAA DLNFEEAARL RDEVKRLRAT ELAVIDDPTV KQRKVADKAG SYAGNKRYGD AANLPADAGK GGRGKSGSRG GAAASPSPLQ GRSAEDRGGG AASTASKVHK PDLDEMGIAG FHEFKKVQRP KPRKPTLDEM GPGTESKIYQ PTSSREAGPE FGPSPRSTGG APGKRGGWKK R
|
| |