Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_3330 |
Symbol | |
ID | 8545718 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 4605709 |
End bp | 4608399 |
Gene Length | 2691 bp |
Protein Length | 896 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 646387997 |
Product | DNA ligase D |
Protein accession | YP_003267725 |
Protein GI | 262196516 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1793] ATP-dependent DNA ligase |
TIGRFAM ID | [TIGR02776] DNA ligase D [TIGR02777] DNA ligase D, 3'-phosphoesterase domain [TIGR02778] DNA polymerase LigD, polymerase domain [TIGR02779] DNA polymerase LigD, ligase domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.580871 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.138663 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCGGGA AGCGACACGG CGGGCGGGGC CTCGAGACCT ATCGCGCCAA GCGCGACCCC GCGGCCACGC CGGAGCCGTT CGGCGGGCGG GTCGGTGGTC CCGAGTGGCG CGCGGCGGCT GGCCGAGCGG CGCCCTTCGT GGTCCACAAG CACGCGGCGC GGCGTCTGCA CTACGATCTG CGCCTGGCCA TGGACGGCGT GCTGCTGTCC TGGGCGGTGC CCAGAGGGCC CTCGCTGTCG TCGTCGGACA GGCGTCTGGC CGTCCAGGTC GAGGACCATC CGCTCGACTA CGCCTGGTTC GAGGGCGTCA TCCCAGCCGG GGAATACGGC GCCGGCGCCA GCATCGTGTG GGACGCGGGC TCGTGGCTGC CGAGCGCGGA TCCGCGCCGC GGGCTCGAGG ACGGCGCGCT CGAGTTCGAG CTCTTTGGCT ACAAGCTGCG CGGCGGCTTC CGCCTGGTGC GCACCGGCAA ACGCGGCCAG GGCGCGGGCA AGGAGTGGCT GCTGCTCAAG CGCCGCGACG GCTTCGCCGA TGACGACGCG GTCCTGGTCG AGAGCTCGGT GCTGTCGGGG CTCACGGTCG AGGAGCTGGG CGCCGGGCCG ACCCGGGCCC AGCAGGCCGC GGCCGCGCTC GACGCGCTCG CGGCGCCGGC GCGGCGGCTG CGCGTGGACC AGGTGGCGCC GATGCTGTGC GAGCTGGCCG ACGGGCCGTT CTCGGATCCC GACTGGGTGT ACGAACTCAA ATACGACGGC TACCGCATGC TGGCCGGCAT CGCCGATGGC GAGGTCAGCC TGCGCTTGCG CAGCGGCCGC GACGCCACCG CGCTGTTCCC CGAAATCGTC CGCGCGCTGC GCCGCTGGCC GCTGGCCGAC GCGGTGCTCG ACGGCGAGGT GGTGGCCCTC GACGAGGCCG GGCGGCCGGT GTTTCAGCGC CTGCAGCCGC GCAATCGCCC CGCGGGCGCG GACGAGATCG AGCGCGCGGC CGCGCTGGCG CCGGTGAGCT ACGCGGTCTT CGACCTGCTC GCCTGCGAGG GCCGCGATCT GCGCGCGCTG CCGCTGCTGG CGCGCAAGCA GGTGCTGGCG CCGCTGGTGC CCGCGCGCGG TCCGGTGCGC TACGCCGACC ACGTCGAGGC CCAGGGCGAG GCGTTTTTCG ACCAGGTGGT CGCCCACGGC CTCGAGGGCG TGGTGGCCAA GCGCGCGTCC TCGAGCTACC AGGGCCGGCG CAGCGACGAG TGGCGCAAGA TCCGGCAGAT TCGCCACGGC CGCTTCGTCA TCGTCGGCAC CACCTCGCCC AGGGGCGGGC GCGCGGGGTT TGGCTCGCTG CACCTGGCCG CCTGGGACCA CGGCTGGGTG TACGTGGGCC GCGTGGGCAC GGGCTTCGAC GCGCGCGCGC TGCGCGAGAT CGCGGCCCGA CTCGAGGCCC TGCCGCGGTG GCGGCCGAGC TTCCCGGCGC CGTCGTCGCC CGGCCGGCGC GACCGCTGGC TCGCGCCGAC GCTGGTGTGC GAGGTCGGCT ATCAGGACGT GTCCGACGAT GGCCGCCTGC GGCTGCCGCG CTTCGTCGGG CTGTGCCCCG AGGTCGCGCC CGAGGACTGT CCGCCGCCGC GCTCGGCGCG CGCGCTGGCC GACGCTCGCG CCTCCGGGTT CGCTACCGCG ACCGCCGACG CCGACGCCGA CGCCGACGCC GAGGATACAG ACGCGGTCCG CCAGGCTGCC GAGAACCCGG CCGGGGCGCG CGAGCCGGCC GCCAGCGGCG CGGACCAAGA CGGCATCGAC GACCGGGCGC CGCGGGGGCA GGTGAGCAAC CCGGACAAGC TCTACTTTCC CGCCGGCGCG GGCCGGCGCG CGCACAGCAA GGGCGCGTTG GTCGCCTACT ACCGGGCCGT CGCGCCGTGG ATGCTGCCGT ACTTGCGCGA TCGCCCGCTA GCCCTGGTGC GCTTCCCCGA GGGCATCGAG GGGCCGTCGT TCTTCCAAAA GGACGCGCCC GCCTGGGTGC CGGCGTGGCT GCGCACCGAG ACGCTGTGGA GCCCGCAGGC CCAGCGCACG CTGCGCTACA TCATCTGCGA CAGCGAGGAC GCGCTGGCCT TTGTGGCCAA TCTCGGCGCC ATCGCCGTGC ACACCTGGTC GGCGCGCATC GGCGCGCTGG GCCGCCCCGA CTGGGCCATC CTGGATCTCG ACCCCAAGGG CGCGCCCTTC GTCCACGTGG TCGAAATCGC CCGGGCGTTG CGCGCGCTGT GCCGCGCGCT CGACCTGGCG TGCTTCGTCA AGACCAGCGG CGCCACCGGG CTGCACGTGC TGATCCCGCT CGGCGGCAGT TGCACGCACG AGCAAGCGCG CACCCTGGCC CAGCTCCTGG CCCAGCTCGT GTGCGCCGAG CATCCCGCGA TCGCGACCAC GGCCCGGCCG CTGGCCGCGC GCGGCGGCCG CGTGTACATC GACTGTGTGC AGAACGGCCA GGGCCGGCTG CTGGTGGCGC CGCTGAGCGT GCGCGCGCGG CCCGGGGCGC CGGTGTCGAT GCCGCTGCGC TGGCGCGAGC TCTCGCCCAA GCTCGACCCC GCGCGCTTCA CCATCGACAA GGCCGTCGCC CGGCTGCGGC GCATGGACCC GGAGCCCTTC GCCGGTCTGC TCGACACCCG ACCCGACCTG GCGCGCGCGC TGGCGCTGCT CGAGGAGCGG CTCGGGCTCA GCCGCGGTTG A
|
Protein sequence | MSGKRHGGRG LETYRAKRDP AATPEPFGGR VGGPEWRAAA GRAAPFVVHK HAARRLHYDL RLAMDGVLLS WAVPRGPSLS SSDRRLAVQV EDHPLDYAWF EGVIPAGEYG AGASIVWDAG SWLPSADPRR GLEDGALEFE LFGYKLRGGF RLVRTGKRGQ GAGKEWLLLK RRDGFADDDA VLVESSVLSG LTVEELGAGP TRAQQAAAAL DALAAPARRL RVDQVAPMLC ELADGPFSDP DWVYELKYDG YRMLAGIADG EVSLRLRSGR DATALFPEIV RALRRWPLAD AVLDGEVVAL DEAGRPVFQR LQPRNRPAGA DEIERAAALA PVSYAVFDLL ACEGRDLRAL PLLARKQVLA PLVPARGPVR YADHVEAQGE AFFDQVVAHG LEGVVAKRAS SSYQGRRSDE WRKIRQIRHG RFVIVGTTSP RGGRAGFGSL HLAAWDHGWV YVGRVGTGFD ARALREIAAR LEALPRWRPS FPAPSSPGRR DRWLAPTLVC EVGYQDVSDD GRLRLPRFVG LCPEVAPEDC PPPRSARALA DARASGFATA TADADADADA EDTDAVRQAA ENPAGAREPA ASGADQDGID DRAPRGQVSN PDKLYFPAGA GRRAHSKGAL VAYYRAVAPW MLPYLRDRPL ALVRFPEGIE GPSFFQKDAP AWVPAWLRTE TLWSPQAQRT LRYIICDSED ALAFVANLGA IAVHTWSARI GALGRPDWAI LDLDPKGAPF VHVVEIARAL RALCRALDLA CFVKTSGATG LHVLIPLGGS CTHEQARTLA QLLAQLVCAE HPAIATTARP LAARGGRVYI DCVQNGQGRL LVAPLSVRAR PGAPVSMPLR WRELSPKLDP ARFTIDKAVA RLRRMDPEPF AGLLDTRPDL ARALALLEER LGLSRG
|
| |