Gene Rpal_4824 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4824 
Symbol 
ID6412510 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp5190666 
End bp5193701 
Gene Length3036 bp 
Protein Length1011 aa 
Translation table11 
GC content67% 
IMG OID642714701 
Productexcinuclease ABC subunit B 
Protein accessionYP_001993788 
Protein GI192293183 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.748709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGAAGA CTCCCGACAA GCCTGGCAAG CCGGCAAAAA CCCCGAAATC CAAAGCACAT 
CGGCCCGACG TGAAGCCGAT CGGCCCTGCG CTGGCCGAAC TGCTGAACCC GGCGCTGAAT
CGCGGCGACG CGGGCCTCGG CTCCGGCACC GGCCTGCAGC CGCCACCGGA CAATTCGCGC
GACCGCCGCA CCGGCGGCGA AGCCGCGGTG CATCGTGGCC GCGCCTCGAC ACCCAATCCA
GGCGACGGCG CCACACCGCG GCCGACGGCC CTGCAGCCTT ATCCGCAGCC GCCGGGCGCC
AGCCGCGGCG GCCTCAATGA AGCGCCGCAG GCCAATTACG GCACCGCCGC CACCATCCCG
ACGCTCGATC CGGAACTGGC GCGGCAGCTC GGCTTGCCGA CCGAGGAGGA CGATGCCGAA
GCCTTGGCGC GGCCGCCGCG CAGCAAGATG GAGGCGCTCG GCGTCAAGGC CACCGCCGAG
GCACTGGAGA GCCTGATCCG CGACGGCCGC CCCGAGTTCA AAGGCGAAGA CGGCGGCGTC
AAGCTGTGGG TGCCGCACCG GCCGCCGCGC CCGGAGAAAT CCGAAGGCGG CGTCCGCTTC
GTGCTGAAGT CCGACTACCA GCCGCGCGGT GACCAGCCGA CCGCGATCAA AGAACTGGTC
GAAGGCCTCG ACAGGAGCGA CCGCACGCAG GTGCTGCTCG GCGTCACCGG CTCGGGCAAG
ACCTACACCA TGGCCAAGGT GATCGAGGCG ACGCAGCGCC CGGCGATCAT CCTGGCGCCG
AACAAGACGC TGGCGGCGCA GCTCTACGGC GAGTTCAAGA ACTTCTTCCC CGACAACGCC
GTCGAGTACT TCGTCTCGTA TTACGACTAC TACCAGCCCG AAGCCTACGT TCCGCGCACC
GACACCTATA TCGAGAAGGA TTCGTCGATC AACGAACAGA TCGACCGGAT GCGCCACGCC
GCGACCCGCG CGCTGCTCGA GCGCGACGAC GTCATCATCG TCGCCTCGGT GTCGTGCATC
TACGGTATCG GCTCGGTCGA GACCTATACG GCGATGACCT TCGCGCTGAA GCGCGGCGAG
CGCATCGACC AGCGCCAGCT GATCGCCGAT CTGGTGGCGC TGCAATACAA GCGCACCCAG
GCCGACTTCT CGCGCGGCAC CTTCCGGGTG CGCGGCGACG TCATCGACAT CTTCCCGGCG
CACTATGAGG ATCGCGCCTG GCGGGTGAAG ATGTTCGGCG ACGAGATCGA GGGCATCGAG
GAATTCGACC CGCTCACCGG CCACAAGCAG GACGAGCTGG AATTCGTCAA GATCTACGCC
AACTCGCACT ATGTGACGCC GCGGCCGACG CTGATCCAGG CGATTCAGTC GATCAAGACC
GAGTTGAAAT GGCGGCTCGA TCAGCTGCAT GCACAAGGCC GCCTGCTGGA AGCACAGCGG
CTGGAGCAGC GCACCACCTT CGACATCGAG ATGATGGAAG CGACCGGCTC CTGCGCCGGC
ATCGAGAACT ACTCACGGTA CCTGACCGGC CGCCGGCCGG GCGAGCCACC GCCGACGCTG
TTCGAATATG TGCCCGACAA CGCGCTGGTG TTCGCCGACG AAAGCCACGT CTCGATCCCG
CAGATCGGCG CGATGTTCAA GGGCGACTTC CGCCGCAAGG CGACGCTGGC CGAATACGGC
TTCCGCCTGC CGTCCTGCAT GGACAACCGG CCGCTGCGCT TCGAAGAATG GGACATGATG
CGGCCGCAGA CGGTCGCGGT GTCGGCGACG CCGGCAGCAT GGGAGCTGAA CGAAAGCGGC
GGCGTGTTCG TCGAGCAGGT CATTCGCCCC ACCGGCCTGA TCGACCCGCC GGTCGATATC
CGCCCGGCGC GCACCCAGGT CGACGACCTC GTCGGCGAGG TCCGCGCCAC TGCGGCGCGC
GGCTATCGCA CGCTGATCAC CGTGCTGACC AAGCGGATGG CCGAGGACCT CACCGAGTTC
CTGCATGAGC AGGGCATCCG CGTGCGCTAC ATGCATTCGG ACATCGACAC CATCGAGCGC
ATCGAGATCA TCCGCGATCT GCGGCTCGGC GCGTTCGACG CGCTGGTCGG CATCAACCTC
TTGCGCGAAG GCCTCGACAT TCCCGAATGC GCGCTGGTTG CGATCCTCGA CGCCGACAAG
GAAGGCTTCC TGCGCAGCGA GACCTCGCTG ATCCAGACCA TCGGCCGCGC CGCCCGTAAC
GTCGACGGCA AGGTCATCCT CTATGCCGAT CAAATGACCG GCTCGATGCA GCGCTCGATC
GACGAGACCA ACCGCCGCCG CGAGAAGCAG ATCGAATACA ACACCGCGCA CGGCATCACG
CCGGAGAGCG TGAAGAAGTC GATCGGCGAC ATTCTCAACA GCGTGTACGA GCGCGACCAC
GTCCTGGTCG AGATCGGCGA CGGCAAGGGC GCCGGCTTCA CCGACGACGC CGCGGTGATC
GGCCACAATT TCGAAGCGGT GCTGGCGGAT CTCGAAACCC GGATGCGCGA AGCCGCGGCC
GATTTGAACT TCGAAGAAGC CGCCCGTCTG CGCGACGAAG TCAAACGCCT GCGCGCCACC
GAGCTCGCAG TGATCGACGA TCCGACCGTG AAGCAGCGCA AGGTCGCCGA CAAAGCCGGC
AGCTACGCCG GCAACAAGCG CTATGGCGAC GCCGCGAACC TGCCAGCCGA TGCGGGCAAA
GGCGGACGCG GCAAGTCAGG ATCACGAGGC GGCGCCGCCG CGTCACCCTC CCCCTTGCAG
GGACGGTCGG CCGAAGACCG GGGCGGGGGT GCCGCGAGCA CGGCGTCAAA GGTCCATAAA
CCCGACCTCG ATGAGATGGG TATCGCCGGC TTTCACGAAT TCAAGAAAGT CCAGCGCCCC
AAGCCGCGCA AGCCGACGCT CGACGAAATG GGGCCGGGCA CGGAGAGCAA GATCTATCAG
CCGACCTCCA GCCGCGAAGC CGGCCCGGAA TTCGGCCCCT CCCCGCGCAG CACCGGCGGC
GCGCCAGGCA AGCGGGGCGG ATGGAAGAAG AGGTAG
 
Protein sequence
MAKTPDKPGK PAKTPKSKAH RPDVKPIGPA LAELLNPALN RGDAGLGSGT GLQPPPDNSR 
DRRTGGEAAV HRGRASTPNP GDGATPRPTA LQPYPQPPGA SRGGLNEAPQ ANYGTAATIP
TLDPELARQL GLPTEEDDAE ALARPPRSKM EALGVKATAE ALESLIRDGR PEFKGEDGGV
KLWVPHRPPR PEKSEGGVRF VLKSDYQPRG DQPTAIKELV EGLDRSDRTQ VLLGVTGSGK
TYTMAKVIEA TQRPAIILAP NKTLAAQLYG EFKNFFPDNA VEYFVSYYDY YQPEAYVPRT
DTYIEKDSSI NEQIDRMRHA ATRALLERDD VIIVASVSCI YGIGSVETYT AMTFALKRGE
RIDQRQLIAD LVALQYKRTQ ADFSRGTFRV RGDVIDIFPA HYEDRAWRVK MFGDEIEGIE
EFDPLTGHKQ DELEFVKIYA NSHYVTPRPT LIQAIQSIKT ELKWRLDQLH AQGRLLEAQR
LEQRTTFDIE MMEATGSCAG IENYSRYLTG RRPGEPPPTL FEYVPDNALV FADESHVSIP
QIGAMFKGDF RRKATLAEYG FRLPSCMDNR PLRFEEWDMM RPQTVAVSAT PAAWELNESG
GVFVEQVIRP TGLIDPPVDI RPARTQVDDL VGEVRATAAR GYRTLITVLT KRMAEDLTEF
LHEQGIRVRY MHSDIDTIER IEIIRDLRLG AFDALVGINL LREGLDIPEC ALVAILDADK
EGFLRSETSL IQTIGRAARN VDGKVILYAD QMTGSMQRSI DETNRRREKQ IEYNTAHGIT
PESVKKSIGD ILNSVYERDH VLVEIGDGKG AGFTDDAAVI GHNFEAVLAD LETRMREAAA
DLNFEEAARL RDEVKRLRAT ELAVIDDPTV KQRKVADKAG SYAGNKRYGD AANLPADAGK
GGRGKSGSRG GAAASPSPLQ GRSAEDRGGG AASTASKVHK PDLDEMGIAG FHEFKKVQRP
KPRKPTLDEM GPGTESKIYQ PTSSREAGPE FGPSPRSTGG APGKRGGWKK R