Gene Rpal_4171 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRpal_4171 
SymbolligD 
ID6411855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris TIE-1 
KingdomBacteria 
Replicon accessionNC_011004 
Strand
Start bp4474379 
End bp4477123 
Gene Length2745 bp 
Protein Length914 aa 
Translation table11 
GC content65% 
IMG OID642714053 
ProductATP-dependent DNA ligase 
Protein accessionYP_001993142 
Protein GI192292537 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase
[COG3285] Predicted eukaryotic-type DNA primase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02777] DNA ligase D, 3'-phosphoesterase domain
[TIGR02778] DNA polymerase LigD, polymerase domain
[TIGR02779] DNA polymerase LigD, ligase domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.497319 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCAGCA ACAAGCTCAG CATCTACCGC AAGAAACGCG ATTTCGAGCA GACCGCGGAG 
CCGCGTGGCG ATGCCAAGGT GGCGCCGTCG AAGCAGCGGC GGTTCGTGAT TCAGAAGCAC
GACGCCACCC GGCTGCACTA CGACCTGCGC CTGGAATATG GCGGTGTCTT TAAGTCCTGG
GCGGTGACCA AAGGACCGTC GCTCGATCCC AACGACAAGC GCCTTGCTGT CGAGGTCGAG
GATCATCCGC TCGACTACGG TGACTTCGAA GGCACGATTC CCAAGGGTCA GTACGGCGGC
GGCACGGTGC AGCTCTGGGA CCGCGGCTAT TGGGAGTGCG ACGATCCCGA GCGCGGCTTC
AAGACCGGCG ACCTGAAGTT CACCCTGCAT GGCGAGAAGC TGCATGGCAG TTGGGTGCTG
GTGCGGATGC GGCACGACCG CAGCGGCGGC AAGCGCACCA ATTGGCTGCT GATCAAGCAC
CGCGACGACG ACGCCCGCGA GGGCGGCGCC AACACCGTGC TCGACGACGA CCGCTCGGTC
GCCTCGGGGC GCGCCATGGA AACGATCGCG GAAGGCAAGG GCAAGGCACC GAAGCCATTC
ATGCTGCGCA AGGCGGCCGC GAAGGTCACG GCGGACGCGG TGTGGGATTC CAACACCGGG
CTTGCGGCCG ATAAGCGGGC GGAGACTGGC GCTGCGAAGA AAGTCGCCAA GTCCAAGCCG
AAGACCTCCG CGAAAGCGAA AGCTGCCAAG TCCGCGAAGC CAGCGTCGCG GAGTAAGCTA
GCCGCGCAAG GCCAGCGCGC AGCCATGCCG GACTTCGTGC CGCCGCAGCT CTGCACCTCG
GTGGAGCGGC CGCCCGGCGG CGAGGGCTGG GGTCACGAAA TCAAGTTCGA TGGCTACCGG
ATGCAACTCC GCGTCGAGCA TGGCGAAGCC AGGCTGCTGA CACGCAAGGG TCTGGACTGG
ACCGCCAAGT TTAGTGCGAT CGCCGACGAA GCGGCATCGC TGCCGGACTC GATTATCGAC
ACCGAGGTGG TGGCGCTGGA TTCGCACGGG CAGCCGGATT TCGCGGCATT GCAGGCGGCG
CTGTCCGATG GTCACACCGA GAACTTGATC TGCTTCGCGT TCGATCTGCT CTACGTCGAT
GGTGAGGATC TGCGCGGCCT GCCGTTGGCG GAGCGCAAGG CGCGGCTTGC GGCCCTGCTG
AAAGCAGCGC GCGGCCGGCG CAAGGAGGGG CTGATCCGTT ACGTCGAGCA TTTCGACACG
GGCGGAGACG CGATCCTGCA ATCGGCCTGC AAGCTGTCGC TTGAGGGGAT CGTATCGAAA
AAGCTCGACG CCGCGTATCG CTCCGGTCGC AGCGATGGCT GGACCAAGGC GAAGTGCCGC
GCCGGCCATG AGGTGGTGAT CGGTGGCTGG AAGACCACGA ACGGCAAATT CCGCTCATTG
CTTGTCGGCG TCCATCGCGG CGATGTCCTT GCCTATGTGG GCATCGTCGG CACCGGCTTT
GGCCAGGACG TCGTCAAACG CATCCTGCCC GAGCTGAAAG CGCACAAAGC GGATACCAGT
CCGTTCAGCG GCGCCAATGC GCCGAAGAAG ACCCGTGAGT TGCATTGGGC TACGCCCGAT
CTCGTCGCTG AGATCGAATT TGCCGGCTTC ACCGGCGCCG GCATGGTGCG GCAGGCGTCG
TTCAAGGGCC TACGTGCCGA CAAGCCGGCG GAGGAGGTGG AGGCCGAGGA GCCGCAGCGG
GTCGAGGTTG CCGAGCCGGC ACTGAAGCGT CGCACTGCCA AGGCTTCGAA GTCGGGCAAG
CCGGCCGCAA AGCGCGCGGC GCCGCGCCGG GCCGATGCCG CCCGCTCCGA GGTGATGGGC
GTGGCGATCT CCAAACCCGA CAAGGAGCTG TGGCCGGCGA GCGAGATCGG CGCCGCGATC
ACCAAACGCG ATCTTGCCGA TTACTTCGCC GAGGTCGGCG ACTGGCTGAT CCCGCACATC
AAGGGACGGC CGTGTTCGAT CGTGCGGGCA CCGGACGGAA TCGACGGCGA GCACTTCTTC
CAGCGGCACG CGATGCCGGG AATGTCGAAC CTGATCGGGC TGGCAAAGGT CTCAGGCGAT
CGTAAGCCGT ATGTGCAGAT CGATCGTGTG GAAGGACTGA TCGCGGCGGC GCAGATCGGC
GGGGTTGAGC TGCATCCGTG GAACTGTGCG CCGAACCGCT ACGACGTGCC GGGACGTCTG
GTGTTCGATC TCGATCCGGC GCCTGAGGTC GGGTTCGACG AGGTGATCAC GGCCGCCAAG
GAGATGAAGC TGCGGCTGGA AACGCTCGGG CTTGCCACCT TCTGCAAGAC CACTGGCGGC
AAGGGGCTGC ACGTCGTGGT GCCGCTCAAA CCCGACGACG GCATCGACTG GAAGGCGGCG
AAGACCTTCG CCCAGACGGT GTGTGCGCAG ATGGCCGAGG ATAGCCCCGA GCAATATCTG
CTCAACATGT CGAAGCAGAA GAGGATCGGA AAGATCTTCC TCGATTACTT GCGCAACGAC
CGGATGTCGA CCGCGGTCGC CGTGCTGTCG CCACGGGCGC GCACCGGTGC CACGGTGTCG
ATGCCGCTGA CGTGGTCGCA GGTGAAGGCC GGGCTCGATC CCAAGCGCTA CACGATCAAG
TCGGTGCCCG CGCTGATGAA GCGCTCGAAG GCGTGGGCGG ATTACGACGA CGCCGCAGCT
CCGCTGAAGG CCGCGATCAA GACGCTGACC TCGGGCAAGA CATAG
 
Protein sequence
MASNKLSIYR KKRDFEQTAE PRGDAKVAPS KQRRFVIQKH DATRLHYDLR LEYGGVFKSW 
AVTKGPSLDP NDKRLAVEVE DHPLDYGDFE GTIPKGQYGG GTVQLWDRGY WECDDPERGF
KTGDLKFTLH GEKLHGSWVL VRMRHDRSGG KRTNWLLIKH RDDDAREGGA NTVLDDDRSV
ASGRAMETIA EGKGKAPKPF MLRKAAAKVT ADAVWDSNTG LAADKRAETG AAKKVAKSKP
KTSAKAKAAK SAKPASRSKL AAQGQRAAMP DFVPPQLCTS VERPPGGEGW GHEIKFDGYR
MQLRVEHGEA RLLTRKGLDW TAKFSAIADE AASLPDSIID TEVVALDSHG QPDFAALQAA
LSDGHTENLI CFAFDLLYVD GEDLRGLPLA ERKARLAALL KAARGRRKEG LIRYVEHFDT
GGDAILQSAC KLSLEGIVSK KLDAAYRSGR SDGWTKAKCR AGHEVVIGGW KTTNGKFRSL
LVGVHRGDVL AYVGIVGTGF GQDVVKRILP ELKAHKADTS PFSGANAPKK TRELHWATPD
LVAEIEFAGF TGAGMVRQAS FKGLRADKPA EEVEAEEPQR VEVAEPALKR RTAKASKSGK
PAAKRAAPRR ADAARSEVMG VAISKPDKEL WPASEIGAAI TKRDLADYFA EVGDWLIPHI
KGRPCSIVRA PDGIDGEHFF QRHAMPGMSN LIGLAKVSGD RKPYVQIDRV EGLIAAAQIG
GVELHPWNCA PNRYDVPGRL VFDLDPAPEV GFDEVITAAK EMKLRLETLG LATFCKTTGG
KGLHVVVPLK PDDGIDWKAA KTFAQTVCAQ MAEDSPEQYL LNMSKQKRIG KIFLDYLRND
RMSTAVAVLS PRARTGATVS MPLTWSQVKA GLDPKRYTIK SVPALMKRSK AWADYDDAAA
PLKAAIKTLT SGKT