Gene Msil_1736 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1736 
SymbolligD 
ID7090848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1886469 
End bp1889135 
Gene Length2667 bp 
Protein Length888 aa 
Translation table11 
GC content66% 
IMG OID643465059 
ProductATP-dependent DNA ligase 
Protein accessionYP_002362044 
Protein GI217977897 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase
[COG3285] Predicted eukaryotic-type DNA primase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02777] DNA ligase D, 3'-phosphoesterase domain
[TIGR02778] DNA polymerase LigD, polymerase domain
[TIGR02779] DNA polymerase LigD, ligase domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones40 
Fosmid unclonability p-value0.946456 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGACA CCAAGCTCAA AACCTATCGC GCCAAGCGCG ACTTTGCTCA AACGGCGGAA 
CCGAGCGGAG AGGCTCCGAT CGCGGCCGGA CCGCGTCGGC GCTTCGTCAT CCAGAAACAT
GCCGCAACCC GGCTCCATTA CGATCTTCGA CTGGAGCTCG ACGGAGTCTT CAAATCATGG
GCTGTCACCA AGGGCCCCTC GCTCGATCCG CATGACAAGC GCCTTGCAGT TGAGGTCGAG
GATCATCCGC TCGACTATGG CGATTTCGAA GGCGTGATCC CCAAAGGCCA ATATGGGGGC
GGCACGGTCC AGCTGTGGGA TCGCGGGTTC TGGGCGCCGG AGGGCGACAA GACGCCTGAA
CAGGCGCTGG CCGACGGCGA TCTCAAATTC ACGCTCGACG GCCAAAGGCT GTATGGCAGT
TGGGTGCTCG TGCGCATGAA GGCCGACCGC ACGGGCGGCA AACGAACCAA TTGGCTGCTC
ATCAAACATC GCGACGGCTA CGCGCGGGAT GGCGACGCCG ACGCTCTGCT GGCGGAGGAC
CGCTCCGTCG CTTCGGGCCG CGCGATGGCG GCGATCGCGG CCGGCAAGGG CAAGGGGCCA
AAGCCTTTCA TGCTCGCGGG CGAGCAAGCC GCCGATCCGA AAGCGGTGTG GGACTCGAAC
AAGGGGCTCG CCGCCGAGGC GCGCGCGGCG CCCAAGGCCA CGCGGAAAAA ATCTGGCGCG
GCTTTGGCGC AAATGCCGGA TTTCCTGCCG CCGCAGCTTT GCCAGCCGGT CGAGCGGCCG
CCTTCGGGCG ACGGTTGGGT TCATGAAATC AAATTCGACG GCTATCGCAT GCAGCTTCGC
GTCGCCGGCG GCAAAGCGAC GCTGAAAACG CGCAAGGGGC TGGACTGGAC CGATAAATTC
GCGGCGATCG CTGCGGAGGC GGCGGATTTC CCCGACGCTA TCATCGATGG CGAGATCGTC
GCGCTGGATA GCTCGGGCTC GCCCGATTTC GTGGCGCTGC AGGCGGCGCT TTCAGAACAG
AATACCGATG ATCTGATTTT CTACGCCTTC GACCTGATGT TCGAGGGCGG GAGAGATTTG
CGGCTCGAGC CGCTCGCCGC GCGCAAGCAG GCGTTGGCGC GTCTCATAGC CGGCGCGCGG
CTTGGCGCGG GCGCGCTGAT CCGTTTCGTC GAGCACTTCG AGACCGGCGG CGACGCGATT
TTGCAATCCG CCTGCCGCCT CAATCTGGAA GGCATCGTTT CAAAGAAACG CGACGCGCCC
TACCAGCCCG GCCGCTCCGA CAGCTGGACC AGGGCGAAGT GCCGCGCCGG CCACGAGGTG
GTGATCGGCG GATGGACGAC CACGGAGGGG AAATTCCGCT CCCTGTTGGC GGGCGTCCAT
CACGGCGAGA ATTTCACCTA TATCGGCCGC ATCGGAACGG GGTTCGGCGA AGCCAAGGTC
AAAACCCTGC TGCCGAAGCT GAAGCAGTTC GCGGCGGAGA CATCGCCCTT CACCGGGCCG
AGCGCGCCAC GCAAAACCGC TTCGATCCAT TGGCTGAAGC CGGAGCTCGT CGCCGAGATT
GAGTTTGCGG GGTTTACAGG CGCCGGCATG GTGCGGCAGG CGGCCTTCAA AGGGCTGCGC
GAAGACAAGC CGGCCGAAGA GGTCGAAGCC GAGACGCCGG CCCCGCCGGA GCAAGCTGCT
GTTCCCGATC CCGCAGAGAT TCAGGCGAGC GCGCGTTCAT CCTCGGATAA GCCCATGGCG
ACGGCCAACG GCAAACCCAT CGTCATGGGC GTCGCCATCT CCAACCCGGC GAAGGAGTTG
TGGCCGGCTG ACGGCGCCGA AGCTCCGGTC TCGAAACTCG ATCTGGCGCG CTATTACGAA
GCGGCCGGTC CCTGGCTCAT AGAGCATGTG CGCGGCAGGC CTTGCTCGCT CATCCGCGCG
CCCGACGGGA TCACTGGCCA GCAATTCTTT CAGCGCCACG CCATGGCCGG AGCCTCGAAC
CTGCTCGATC TTGTGACCGT GTCGGGCGAT CGCGCGCCCT ATCTGCAGAT CGACCGTGTT
GAGGGATTGG CGGCGGTGGC GCAAATCGCT GGGCTTGAGC TGCATCCGTG GAACTGCGCG
CCGGGGCGCC CGGAGACGCC CGGGCGGCTG ATATTCGATC TCGACCCCGG CCCCGACGTC
GCCTTCGAGC AGGTGGCAGC CGCCGCTCTT GAGATGCGCG ACAGGCTCGA CGCGCTGGGG
CTCGTCAGTT TCTGCAAGAC GACGGGCGGC AAGGGGCTGC ATGTCGTCAC GCCGCTGGCG
GTCGCCAAGG GCTCCAAACT CACCTGGCCC GAGGCGAAGG GGTTCGCTCA GGAGGTCTGC
CGGCGCATGG CCGCGGACAA TTCGAACGCC TATCTCCTCA ACATGTCCAA AAAGCTTCGC
GCGGGGCGTA TTTTCCTCGA CTACCTGCGC AATGACCGCA TGTCGACGGC CGTCGCGCCG
CTGTCGCCAC GCGCGCGTCC GGGCGCCACG GTTTCGATGC CGCTGAACTG GAGCGAGGCG
ACCAACAGCC TCGATCCGAA AGCCTTTACG ATCCGCACGT CCGTTGGGCT TCTCGACAAG
AGCAAAGCCT GGGCCGACTA TGATTCGGGC GCGCGTCCGC TGGAGGCGGC GATCAAACGC
CTCGGCCGCG CCAAGGCGGC CGCGTGA
 
Protein sequence
MADTKLKTYR AKRDFAQTAE PSGEAPIAAG PRRRFVIQKH AATRLHYDLR LELDGVFKSW 
AVTKGPSLDP HDKRLAVEVE DHPLDYGDFE GVIPKGQYGG GTVQLWDRGF WAPEGDKTPE
QALADGDLKF TLDGQRLYGS WVLVRMKADR TGGKRTNWLL IKHRDGYARD GDADALLAED
RSVASGRAMA AIAAGKGKGP KPFMLAGEQA ADPKAVWDSN KGLAAEARAA PKATRKKSGA
ALAQMPDFLP PQLCQPVERP PSGDGWVHEI KFDGYRMQLR VAGGKATLKT RKGLDWTDKF
AAIAAEAADF PDAIIDGEIV ALDSSGSPDF VALQAALSEQ NTDDLIFYAF DLMFEGGRDL
RLEPLAARKQ ALARLIAGAR LGAGALIRFV EHFETGGDAI LQSACRLNLE GIVSKKRDAP
YQPGRSDSWT RAKCRAGHEV VIGGWTTTEG KFRSLLAGVH HGENFTYIGR IGTGFGEAKV
KTLLPKLKQF AAETSPFTGP SAPRKTASIH WLKPELVAEI EFAGFTGAGM VRQAAFKGLR
EDKPAEEVEA ETPAPPEQAA VPDPAEIQAS ARSSSDKPMA TANGKPIVMG VAISNPAKEL
WPADGAEAPV SKLDLARYYE AAGPWLIEHV RGRPCSLIRA PDGITGQQFF QRHAMAGASN
LLDLVTVSGD RAPYLQIDRV EGLAAVAQIA GLELHPWNCA PGRPETPGRL IFDLDPGPDV
AFEQVAAAAL EMRDRLDALG LVSFCKTTGG KGLHVVTPLA VAKGSKLTWP EAKGFAQEVC
RRMAADNSNA YLLNMSKKLR AGRIFLDYLR NDRMSTAVAP LSPRARPGAT VSMPLNWSEA
TNSLDPKAFT IRTSVGLLDK SKAWADYDSG ARPLEAAIKR LGRAKAAA