Gene Msil_3103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3103 
Symbol 
ID7092781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3407813 
End bp3408823 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content64% 
IMG OID643466413 
ProductDNA polymerase LigD, ligase domain protein 
Protein accessionYP_002363374 
Protein GI217979227 
COG category[L] Replication, recombination and repair 
COG ID[COG1793] ATP-dependent DNA ligase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02779] DNA polymerase LigD, ligase domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGT CCCCGCCAGC AAAGCCGCGC TCGATCAGCT GCAAGACAGA TGTCGTCGCG 
GATGGGTTGA GGAGTCGGCC GCTTGTCTCG GCTGATCCGA CAGTCCCCCA GCTCTTCGAT
GCGCCGCTGC CGGGCTGGAT CGCGCCCTGT TTGCCGACGC TCGTCCCCAA GCCGCCGGCC
GGCGAAGAAT GGGTTCACGA AATCAAATGG GACGGCTATC GGGTTTCAGC TTACGTCGAG
CCAGGCGCCG TCACGATCCG CACGCGCAAC GGCTATGATT GGACGGCGAG ATTTCCGACG
ATCGCCGCTG CGCTCGGCAA GCTGAAGGTG CGGTCGCTGG TTATCGACGG CGAGGCGAGC
GTGCTCGACG AGAAAGGCCG TTCGAGCTTC GCCGAGCTGC AAGCCGACCT TGCAACGGGC
GGCGCGCAAC GAGCCGTGCT TTACGCCTTC GATCTGCTTT TCCTCGATGG GGAAGACTGG
CGCCAGCGGC CGCTAGACGA GCGGCGCGGG GCCTTGGCCG GCCTGATCAA GAAAAAGCCG
CCGCTGCTTC TCAGCCAGGA ATATGCCGGA ACTGGCGTCG ATTTTTTCAA GGTCGCTTGC
GAGCATGAGC TCGAAGGGAT CGTCTCGAAG CGCCTCGACA AGCCTTACCG ATCCGGCCGC
AGCAAGGATT GGCTGAAGAC CAAATGCGTG CAGAGCGGGG AATTTGTCGT GATCGGCTAT
CAGCCCTCGT CCGGCGCGGT CCGGGCGCCC CTGGCCAATA TCAAGGTCGC GCGATGGGAA
GAAGGCGCGC TGCGCTACGC GGGAGCAGTG GGAACAGGCT TCAGCGAGCG CGTCGCCAGG
ATGCTGCGCG ACAGGCTTGA CGGCCTCAGG ACGCCGCGCT GTGCGATCCC AAGGCTCAAG
GTTGGGGGCG CAGTTTGGAC GAAGCCCGAT CTCATCGTTG AGATTGATTA TCGCGGCCTC
ACTGCGGACG GCGAGCTTCG CCATGCGAGC TTTCGCGGGA TCGCAGAATG A
 
Protein sequence
MAKSPPAKPR SISCKTDVVA DGLRSRPLVS ADPTVPQLFD APLPGWIAPC LPTLVPKPPA 
GEEWVHEIKW DGYRVSAYVE PGAVTIRTRN GYDWTARFPT IAAALGKLKV RSLVIDGEAS
VLDEKGRSSF AELQADLATG GAQRAVLYAF DLLFLDGEDW RQRPLDERRG ALAGLIKKKP
PLLLSQEYAG TGVDFFKVAC EHELEGIVSK RLDKPYRSGR SKDWLKTKCV QSGEFVVIGY
QPSSGAVRAP LANIKVARWE EGALRYAGAV GTGFSERVAR MLRDRLDGLR TPRCAIPRLK
VGGAVWTKPD LIVEIDYRGL TADGELRHAS FRGIAE