Gene RPB_4617 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4617 
Symbol 
ID3912434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5217870 
End bp5219738 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content68% 
IMG OID637886521 
ProductATP-dependent DNA ligase 
Protein accessionYP_488211 
Protein GI86751715 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.656205 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCGGT TCGCCGAATT GCTCGACCGC CTCGCCTATG AGCCCGGCCG CAACAACAAG 
CTGCGGCTGA TCACCAATTA CTTCCGCGCC ACCGAAGACC CCGACCGCGG CTATGCGCTG
GCCGCGCTGA CCGGCGCATT GTCGTTTCGC CACGCCAAGC CCGGCCTGAT CCGGACTCTG
ATCGCCGAAC GCACCGATCC GGTGCTGTTC GGCCTGTCCT ACGACTATGT CGGCGACCTG
TCGGAGACGG TCGCGCTGAT GTGGCCTGCG CCGCGAGGCC GAGCCTCCTC CCCTCCCCCT
TGCGGGGAGG GAGCCTGCCC CGGACTTGAT CCGGGGTCGG GGGTGGGGGT CGACAACGGG
AGCTTTTCCT CGGCGCACCC TCACCCCACC CCTCTCCCGC AAGCGGGAGA GGGAGCGCGC
AGTCCGTCGG GTGAACACAC CTCTCCACTA TCCGACGCAG CAACGAAAGC TGGCCACAAC
AACCCGCCGC CGCCGACGCT CACCGACATC GTCACCACGC TGCACAGCCT CGGCAAGGCC
GAGCTGCCGG CGCAGCTTGC GCGCTGGCTC GACGAACTCG ACGAGACCGG ACGCTGGGCG
TTGTTGAAGC TCGTCACGGG CGGCCTGCGG ATCGGCATCT CGGCGCGCCT CGCAAAGACC
GCTGCCGCAG CGCTCGGCGA TAAGGATCCG CACGACGTCG AACTGGTCTG GCCCGGGCTC
GAGGCGCCCT ATCTCGACTT GTTCGCCTGG CTCGACGGCC GCGGCCCGCA GCCGGTCAAC
CGCGATCCGG CGCCGTTCCG CCCGGTGATG CTGGCGCATG CGGTCGAGGA CGCCGACTTC
GAGGCCATGA CCCCGGCCGA CTACATCGCC GAATGGAAAT GGGACGGCAT CCGCGTCCAG
GCGGTGAGCG GCATCGGCGA CGACGGCGAA ACCGTGGTGC GGCTGTATTC GCGCAGCGGC
GAGGACATCA CCATGAGCTT CCCGGACCTG TTGCCATCGC TGCGGCTGCC CGGCGCGATC
GACGGCGAAC TCCTGGTGCT GCGCGAAGGC CGGGTGCAGT CGTTCAACGT GCTGCAGCAA
CGGCTCAACC GCAAATCGGT GACGCCGAAG CTGATCAAGG AGTATCCGAT TCATCTGCGC
GCCTACGACC TGCTCGGCGA CGGCGACGCG GATCTGCGGA TACAGCCCTT CGTCGATCGA
CGTGCGCGGC TCGAGCGCTT CATCGCACGG CTCGACGACC CCAGAGTCGA TCTTTCGCCG
ACGGTCGCGT TCGACGATTG GGACGGCTTG CGGGCGGCGC GGCGCGATCC GGCCAGCGCC
GGCGCCGGCC TCGACGCCGA CGCGGTCGAG GGCGTGATGC TGAAGCGGCG CGACGCGCTG
TATCTGCCCG GCCGGCCGAA AGGCCAGTGG TGGAAGTGGA AGCACGACCC GCACATCATC
GACGCGGTGC TGATGTACGC CCAGCGTGGC CACGGCAAGC GCTCGTCTTA TTACTCCGAC
TACACTTTCG GGGTCTGGAC TGAAGGCGAC GACGGCGACC AGCTCGTTCC GGTCGGCAAA
GCCTATTTCG GCTTCACCGA CGAGGAGCTG CTGCAGATCG ACCGCTTCGT CCGCCGCAAC
ACCACCGAGA AATTCGGCCC GGTCCGCCAT GTCGTGCACG AGCCCGACCA GGGCCTGGTG
CTGGAAGTCG CGTTCGAGGG GCTGCAACGC TCGCCGCGGC ACAAGTCCGG CGTGGCGATG
CGGTTTCCGC GAATCAGCCG GCTGCGCTGG GACAAGCCGC CGCGCGACGC CGACCGGCTG
GCGACGCTGG AACGGATGCT GAAGGCCGAC ACCGCCGGCC CCGAAAAAGC CGCCCCGAGC
AGCCATTGA
 
Protein sequence
MNRFAELLDR LAYEPGRNNK LRLITNYFRA TEDPDRGYAL AALTGALSFR HAKPGLIRTL 
IAERTDPVLF GLSYDYVGDL SETVALMWPA PRGRASSPPP CGEGACPGLD PGSGVGVDNG
SFSSAHPHPT PLPQAGEGAR SPSGEHTSPL SDAATKAGHN NPPPPTLTDI VTTLHSLGKA
ELPAQLARWL DELDETGRWA LLKLVTGGLR IGISARLAKT AAAALGDKDP HDVELVWPGL
EAPYLDLFAW LDGRGPQPVN RDPAPFRPVM LAHAVEDADF EAMTPADYIA EWKWDGIRVQ
AVSGIGDDGE TVVRLYSRSG EDITMSFPDL LPSLRLPGAI DGELLVLREG RVQSFNVLQQ
RLNRKSVTPK LIKEYPIHLR AYDLLGDGDA DLRIQPFVDR RARLERFIAR LDDPRVDLSP
TVAFDDWDGL RAARRDPASA GAGLDADAVE GVMLKRRDAL YLPGRPKGQW WKWKHDPHII
DAVLMYAQRG HGKRSSYYSD YTFGVWTEGD DGDQLVPVGK AYFGFTDEEL LQIDRFVRRN
TTEKFGPVRH VVHEPDQGLV LEVAFEGLQR SPRHKSGVAM RFPRISRLRW DKPPRDADRL
ATLERMLKAD TAGPEKAAPS SH