Gene RPD_3490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3490 
SymbolligD 
ID4024004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp3880200 
End bp3882992 
Gene Length2793 bp 
Protein Length930 aa 
Translation table11 
GC content66% 
IMG OID637963694 
ProductATP-dependent DNA ligase 
Protein accessionYP_570614 
Protein GI91977955 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase
[COG3285] Predicted eukaryotic-type DNA primase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02777] DNA ligase D, 3'-phosphoesterase domain
[TIGR02778] DNA polymerase LigD, polymerase domain
[TIGR02779] DNA polymerase LigD, ligase domain 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.505494 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCAG GCAAGACTCT CACGCTGTAT CGCAACAAGC GCGACTTCGA GCAGACCGCC 
GAGCCGCGCG GCGATGGCAA GGTCGCGCCG TCGAAGCGGC GGCGTTTCGT GATCCAGAAG
CACGACGCCA CGCGGCTGCA CTACGACCTG CGGCTGGAAT ATAACGGCGT GTTCAAGTCC
TGGGCGGTGA CGCGCGGGCC GTCGCTCGAT CCGCACGACA AGCGGCTCGC GGTCGAGGTC
GAGGACCATC CGCTCGACTA CGGCGACTTC GAAGGCACGA TTCCCAAGGG CCAATATGGC
GGCGGCACCG TGCAGCTCTG GGATCGCGGC TATTGGGAGT GCGACGATCC GGAGCGCGGT
TTCAAGAAGG GCGACCTCAA ATTCTCGCTG GACGGCGAGA AGTTGCAGGG AAGCTGGGTG
CTGGTGCGGA TGCGCCATGA TCGCAACGGC GGCAAGCGAA CCAACTGGTT GCTGATCAAG
CATCGCGACG ACGACGCCCG CGAGGGAAAG GCCAACGATA TTCTCGACCA GGACCGCTCG
GTGGCGTCCG GCCGCGCGAT GACAGAGATC GCACAGGGGA AAGGCCGCGC GCCAAAACCG
TTCATGACCG GCAAGGCGGC GCGCGTGAAG GCCGACGCGG TATGGAATTC GAAGGAGGGG
CTGGCCGCAG ACGAGCGTGC GGCCAATGGC GCGGGCGGGG CGGTCGCGTC GAAGCCGGCG
CCGCCGAACA AGGCGAAGGC GACCACCGCA AAGTCCAAGA CGGCAAAGTC CAAGACGGCC
AAGGTCGCGA AGCCGAAGCC GGGGCGCAAG GCTGCGGCGA AGCGCGCGGC GATGCCGGAT
TTCGTTCCGC CGCAGCTCTG CACCTCGGTC GAGCGGCCGC CCGGCGGCGA TGGCTGGGGT
CACGAGATCA AATTCGACGG CTACCGGATG CAGCTACGCG TCGACGCCGG CGAGGCGACG
CTGAAGACCC GCAAGGGATT GGACTGGACC GCCAGATTTC AGGCGATCGC CGACCAGGCG
GCGGGCCTGC CCGATGCGCT GATCGACACC GAGATCGTCG CGCTCGATCA GCACGGCCAC
CCGGATTTCG CGGCGCTGCA GGCGGCGTTG TCGGACGGCG ACACGGACAA TCTGATCTGC
TTTGCGTTCG ACCTGCTCCA TGCCGATGGC GAGGATCTGC GTGCGTTGCC GCTGTCCGAC
CGCAAGCGGC GGCTGCAGGA GTTGCTCACG GCCGCGCGCG GCCGGCGCAA GCAGGGCCTG
ATCCGCTATG TCGAGCATTT CGACACCGGC GGCGACGCGA TCCTGCAATC CGCCTGCAAG
CTGTCGCTCG AAGGCATCGT CTCGAAGAAG CTCGATGCGC CCTATCGATC CGGCCGCAGC
GACAACTGGA CCAAGGCGAA GTGCCGGGCC GGGCACGAGG TGGTGATCGG CGGCTGGAAG
ACCAATGCGG GCAAATTTCG TTCGCTGCTG GTCGGCGTGC ATCACGACGA TCATCTTGCT
TATGTCGGCA TCGTCGGCAC CGGCTTCGGG CAGGATGTGG TGAAGCGACT GCTGCCGGAA
TTGAAAGCCC ACGCCGCCAA GGACAATCCA TTTGCCGGCG AGAACGCGCC GCGCAAGACC
GCCGATGTGC ATTGGCTGGC GCCGGAGCTT GTGGCCGAGA TCGAGTTCGC CGGCTTCACC
GGCGCCGGCA TGGTGCGGCA GGCCGCATTC AAGGGGCTGC GCGCGGACAA GCCGGCCGGG
GAGGTCGAAG CGGAGGAGCC TGCGAAGGTC GATATCGCCG GGCCGAAGCC CGGACGCCGC
GCCAAGCCCT CATCCAACGC CAAGTCGGCC AAGCCAAAGA CCGCAACGCC GAACGCGGGA
CGCAGCAAGA CCCGCGCGAC CCATGCCAGT CGCGCCGAGG TGATGGGCGT GGTGATCTCC
AAGCCGGACA AGGAGCTGTG GCCCGCGAGC CGGATCGGCG ATGCGGTCAA CAAGCGCGAT
CTCGCGCACT ACTACGAAGC GGTCGGCGAT TGGATGATCC CGCATCTCGA GGGGCGGCCG
TGTTCGATCG TCCGCGCGCC CGACGGGATC GGCGGCGAAC ACTTCTTCCA GCGACACGCA
ATGCCGGGGA TGTCGAACCT GATCGCGCTT GCGAAAGTGG CGGGCGACCG CAAGCCTTAC
GTGCAGATCG ACCGGGTCGA AGGATTGATC GCTGCGGCGC AGATCGGCGG CCTTGAACTG
CATCCGTGGA ATTGTGCGCC GGGTCGGTAC GAGGTCCCCG GGCGGCTGGT GTTCGATCTC
GATCCGGCGC CCGAAGTCGG GTTCGACGCG GTGATCGTGG CGGCGCGTGA AATGAAGGAC
CGGCTCGAGG CGATCGGCCT GCCGAGCTTC TGCAAGACCA CCGGCGGCAA GGGGCTGCAC
GTGGTGACGC CGCTGGTCCC CAGCGACGAT ATCGACTGGA AGCAGGCCAA AATCTTCGCG
CAGACGGTGT GCGCCGGGAT GGCCGACGAC AGCCCCGATC GCTATCTGCT CAACATGTCG
AAGCAGCAGA GGAAGGGAAA GATCTTCCTG GACTATCTGC GCAACGACCG GATGTCGACC
GCGGTCGCTG TGCTGTCGCC TCGCGCACGC GAGGGCGCGA CGGTGTCGAT GCCGTTGACG
TGGACGCAGG TGAAAGCCGG GCTCGATCCC AAGCGATACA CGGTCGAGAC AGTTCCAGCG
CTGCTGGCAA AGACCAAAGC CTGGGCGGAC TACGACAAAG CCGCGGCCCC GCTGAAAGCG
GCGATCAAGA AGCTCGCGTT CAGCAGGGCT TGA
 
Protein sequence
MVAGKTLTLY RNKRDFEQTA EPRGDGKVAP SKRRRFVIQK HDATRLHYDL RLEYNGVFKS 
WAVTRGPSLD PHDKRLAVEV EDHPLDYGDF EGTIPKGQYG GGTVQLWDRG YWECDDPERG
FKKGDLKFSL DGEKLQGSWV LVRMRHDRNG GKRTNWLLIK HRDDDAREGK ANDILDQDRS
VASGRAMTEI AQGKGRAPKP FMTGKAARVK ADAVWNSKEG LAADERAANG AGGAVASKPA
PPNKAKATTA KSKTAKSKTA KVAKPKPGRK AAAKRAAMPD FVPPQLCTSV ERPPGGDGWG
HEIKFDGYRM QLRVDAGEAT LKTRKGLDWT ARFQAIADQA AGLPDALIDT EIVALDQHGH
PDFAALQAAL SDGDTDNLIC FAFDLLHADG EDLRALPLSD RKRRLQELLT AARGRRKQGL
IRYVEHFDTG GDAILQSACK LSLEGIVSKK LDAPYRSGRS DNWTKAKCRA GHEVVIGGWK
TNAGKFRSLL VGVHHDDHLA YVGIVGTGFG QDVVKRLLPE LKAHAAKDNP FAGENAPRKT
ADVHWLAPEL VAEIEFAGFT GAGMVRQAAF KGLRADKPAG EVEAEEPAKV DIAGPKPGRR
AKPSSNAKSA KPKTATPNAG RSKTRATHAS RAEVMGVVIS KPDKELWPAS RIGDAVNKRD
LAHYYEAVGD WMIPHLEGRP CSIVRAPDGI GGEHFFQRHA MPGMSNLIAL AKVAGDRKPY
VQIDRVEGLI AAAQIGGLEL HPWNCAPGRY EVPGRLVFDL DPAPEVGFDA VIVAAREMKD
RLEAIGLPSF CKTTGGKGLH VVTPLVPSDD IDWKQAKIFA QTVCAGMADD SPDRYLLNMS
KQQRKGKIFL DYLRNDRMST AVAVLSPRAR EGATVSMPLT WTQVKAGLDP KRYTVETVPA
LLAKTKAWAD YDKAAAPLKA AIKKLAFSRA