Gene Caul_1769 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1769 
SymbolligD 
ID5899224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1868191 
End bp1870947 
Gene Length2757 bp 
Protein Length918 aa 
Translation table11 
GC content68% 
IMG OID641562259 
ProductATP-dependent DNA ligase 
Protein accessionYP_001683396 
Protein GI167645733 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase
[COG3285] Predicted eukaryotic-type DNA primase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02777] DNA ligase D, 3'-phosphoesterase domain
[TIGR02778] DNA polymerase LigD, polymerase domain
[TIGR02779] DNA polymerase LigD, ligase domain 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.242945 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCAGG GACAACTCGC CGAATATCGC CGCAAGCGCG ATTTCCAGAA GACCGCCGAG 
CCGAGCGGCG ACGCCGCGGT CGCATCCGCG CCGCACGCCC GCTTCGTGAT CCAGAAGCAC
GACGCCACGC GGCTGCACTA CGACTTTCGC CTGGAGGTCG ACGGGGTGCT GAAGTCGTGG
GCGGTGACCA AGGGGCCGTC CCTGGATCCC GCCGACAAGC GGCTGTCGGT GGAGGTCGAG
GACCATCCGC TGGACTATGG CGACTTCGAG GGCACGATCC CCAAGGGCCA GTACGGCGGC
GGGACGGTGC AGCTGTGGGA CCGCGGCTAT TGGGCGCCCG AACCCGGCTT CGAGGATGTG
GCCAAGGCGC TGAAGAAGGG CGAGCTGAAG TTCGTGCTGG AGGGCGAGCG GCTGCATGGC
TCGTGGGTGC TGGTGCGGAT GAACTGGGAC CGCAACGCCA AGGATGGCCC CAAGCGACAA
GTAGGGGGCG GGCGCTCCAA CTGGCTGCTG ATCAAGCACA ATGACGAGGC CGCGCGGCCG
GGCGAAGGGG CGGCCGTGCT GGAGGAGGAC GCCTCGATCG CCTCGGGCCG GACCATGTCC
GACATCGCCG CCGGCAAGGG CAAGGGACCG TCGGCGTTCA TCCTGAAGAC CAAGGGCAAG
GGCGACGCCG TCTGGACCTC GCGGACCAAG GCCGAGCGCG AGGCGCTGCA GAAGGAGGCG
CAGGACACGC GGGCGGTCGA GCCCACCCCA TCCCGTCTTC CCGGCGAAGG CCAGGACCCA
GATTCATCCG GAGAGTCGGC AGGGCGTCAC CTGGGTCCCG GCCTTCGCCG GGGAGACGGA
GACAAGGCGA CCAAGAAAGC CAAGGCCAAG ACCGCTGTCC TGCCCGACTT CATCGAACCT
CAACTCTGCA AGTCTCTGGA TCGCCCGCCG TCCGGACCCG GCTGGGCGCA TGAGATTAAG
TTCGACGGCT ATCGCGTGCA GTTGCGCGTC GAGGGCGGTC GGGCGACCCT GCGCACCCGC
AAGGGTCTGG ACTGGACAGA GAAGTTCGCC GCCATCGCGC GGGCCGCCGC GGACCTGCCC
GACAGCATTA TAGACGGCGA GGTCGTGGCC CTGGACGCCG CCGGCCAGCC CGACTTCGCC
GGCCTGCAGG CGGCGCTGTC GGACGGCAAG ACCGGCGACC TGATCTTCTT CGCTTTCGAC
CTGCTGGCGG AGGGCGGCGA GGACCTGCGC GCCCTGCCGC TGCACGAACG CAAGGCGCGG
CTGAAGGCGA TGATGGGCGA GGACGAGCCG CGCCTGCGTT TCGTCGACCA CTTCGAGACG
GCCGGGGACG CGGTGCTGCT GTCGGCCTGC AAGCTGGAAC TGGAAGGCAT CATCTCCAAA
CGGCTGGACG CCCCATACCG ATCGGGGCGC AGCGAGACCT GGACCAAGGC CAAGTGCCGG
GCCGGCCATG ACGTGGTGAT CGGTGGCTAT ACGACGACCG GCTCGGCGTT CCGCTCGCTG
ATCGCCGGCG TCATGCGCGA CGGAAAGTTG ACCCATGTGG GCCGAATCGG CACCGGCTTT
GGCCGCGACA AGGTGGGCAA GCTGCTGCCG AGGCTGAAGG CCTTGGAGAC CGACACCTCG
CCGTTCGAAG GCAAGGGCGC GCCGCGCAAG GCCGACGACA TCCACTGGGT GAAGCCGGAA
CTGGTGGCCG AGATCGAATA TGCCGGTTTC ACGGGGGACG GCTCGATCCG CCAGGCCTCG
TTCAAGGGCC TGCGCGAGGA CATTCCAGCC AGCGAGGTCC AGGCCGAGGT TCCGGCCAAG
GCGGAAACCG CCGAGCTGGC GACGCCCACC CCCAAGGCCG CGCCCAAGTC CCAGACGATC
GCCACCGCCA AGACCGATAG CATCGTGCTG GGCGTGACGA TCTCCAAGCC CGACAAGCCG
CTGTGGCCGG ACGTCGACGG CGCGCCGGGG ACCAAGATCG ACCTGGCCCG CTACATGGCG
GCGATCGGCG ACTGGATGCT GCCGCACGTG AAGGGCCGGC CGGCCTCGAT CATCCGGGTT
CCCGACGGCA TCGGCGGCGA GACCTTCTTC CAACGCCACG CCATGCGCGG GATGTCGTCG
CTGATCGACC TGGTTTCGGT CAAGGGCGAC AAGCAGCCCT ACATCCAGTT CAACCGGGTC
GAGGCGCTGA TCGCCGCCGC CCAGATCGCC GCCGTCGAGA TCCATCCCTG GAATTGCCAG
CCGGGCGATC CGGAGGTGGC CGGCCGCCTG GTGTTCGACC TGGATCCCGC GCCGGGCGTC
ACCTTCGAGG ACGTGATCGC CGGCGCGCGC GAGATCCGCG ACCGGCTGGA GGAACTGGGC
CTGGTCAGTT TCTGCAAGAC CACCGGCGGC AAGGGCCTGC ACGTGGTCAC GCCCTTGTCC
GATAAAGTAG AGTGGGCTGT AGCAAAGACC TTCGCCCGCG AGGTCTGCGC CCGCATGGCC
GCCGACGCGC CGGACAAGTA CCTGATCACC ATGAGCAAGA AGGCGCGCGA GGGCCGCATC
TTCCTCGACT ACCTGCGCAA CGACCGCACC TCGACCGCCG TGGCGCCGCT TTCGGCCCGC
GCCAGGCCGG GGGCGACGGT GTCGATGCCG CTGAACTGGA CCCAGGTAAA GGCGGGGCTG
GACCCGACGA AGTACACCAT CCGCACCGTC CCGGACCTGA TCGCCAAGAG CAGGGCCTGG
GAGGACTATT TCGACGCCGC CAAGCCGCTG AAGGCGGCGA TCAAACGGCT GGGGTGA
 
Protein sequence
MAQGQLAEYR RKRDFQKTAE PSGDAAVASA PHARFVIQKH DATRLHYDFR LEVDGVLKSW 
AVTKGPSLDP ADKRLSVEVE DHPLDYGDFE GTIPKGQYGG GTVQLWDRGY WAPEPGFEDV
AKALKKGELK FVLEGERLHG SWVLVRMNWD RNAKDGPKRQ VGGGRSNWLL IKHNDEAARP
GEGAAVLEED ASIASGRTMS DIAAGKGKGP SAFILKTKGK GDAVWTSRTK AEREALQKEA
QDTRAVEPTP SRLPGEGQDP DSSGESAGRH LGPGLRRGDG DKATKKAKAK TAVLPDFIEP
QLCKSLDRPP SGPGWAHEIK FDGYRVQLRV EGGRATLRTR KGLDWTEKFA AIARAAADLP
DSIIDGEVVA LDAAGQPDFA GLQAALSDGK TGDLIFFAFD LLAEGGEDLR ALPLHERKAR
LKAMMGEDEP RLRFVDHFET AGDAVLLSAC KLELEGIISK RLDAPYRSGR SETWTKAKCR
AGHDVVIGGY TTTGSAFRSL IAGVMRDGKL THVGRIGTGF GRDKVGKLLP RLKALETDTS
PFEGKGAPRK ADDIHWVKPE LVAEIEYAGF TGDGSIRQAS FKGLREDIPA SEVQAEVPAK
AETAELATPT PKAAPKSQTI ATAKTDSIVL GVTISKPDKP LWPDVDGAPG TKIDLARYMA
AIGDWMLPHV KGRPASIIRV PDGIGGETFF QRHAMRGMSS LIDLVSVKGD KQPYIQFNRV
EALIAAAQIA AVEIHPWNCQ PGDPEVAGRL VFDLDPAPGV TFEDVIAGAR EIRDRLEELG
LVSFCKTTGG KGLHVVTPLS DKVEWAVAKT FAREVCARMA ADAPDKYLIT MSKKAREGRI
FLDYLRNDRT STAVAPLSAR ARPGATVSMP LNWTQVKAGL DPTKYTIRTV PDLIAKSRAW
EDYFDAAKPL KAAIKRLG