Gene Msil_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3043 
Symbol 
ID7092720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3356107 
End bp3357414 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content60% 
IMG OID643466353 
Productprotein of unknown function DUF264 
Protein accessionYP_002363315 
Protein GI217979168 
COG category[S] Function unknown 
COG ID[COG5323] Uncharacterized conserved protein 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones67 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACAT TCAGCCCGGG CCAGGATGCG GCGCGCCGCT TGCTGGAGGG ACCGCAGCGT 
TACACTTGTC TTGCCGGAGG CACGCGCTCT GGCAAGACTT TTCTGATCAT CCGCGCAATC
ATTATACGCG CTCTCCAGGC TGAAGAGACC CGGCACGCGG TTCTGCGCTT CCATGCCAAT
GCGGCTCGCG CATCCATAGC GCTGGACACG CTGCCGCGCG TCATGCGCCT CTGCTTTCCG
GACGCGACGT TGCGCGAGCG GCGGCAGGAC GGATATTTCG AACTTGGGAA TGGATCGCGA
ATCTGGATCG GCGGCCTCGA CGACAAAGAC CGCGTGGAGA AAATACTCGG ACTGGAATAT
GCGACAATTT TTTTGAATGA GGCGTCACAG ATCCCCTATT CGTCGGCTTT GATCGCTTTC
ACGCGGCTCG CGCAGGTCGC GCCGCGGATT GATCAACGGG CTTTCGTCGA TCTAAACCCT
GTCGGCAAGA CACATTGGAC CAATCAGCTG TTTGGAGAAA AGCGCGACCC GGTGTCGAGA
CGACCGCTGC CAGACCCGGA GAGCTACCGC CGCGCCTTCC TCAACCCGCC CGACAACAAA
GCGAATCTAT CGCGCGAATT TCTGGCAAGT CTCTCTCATT TGCCGGAAAA GCAACGCAAG
CGCTTTCTTG ACGGCGTGTA TGTGGATGAA GTCGACGGCG CGCTTTGGAC CTATGCCGGA
ATCGATGCAG GACGATGCGC GGCTGAGCGC ATATCCGTGG ATAAAAGAGC TGCGGTCGTT
GTCGCTGTGG ACCCATCGGG AGCGGCGGGC CGGGACGATC TTGGAGCCGA TGAGATCGGC
ATAATCGTCG CCGCCAGGGG CGTCGATGGC GACGCCTATA TTCTGGAGGA TCTATCGTGC
AGGGATGCGC CAGCCGTTTG GGGCAGGCGG GCAGTGGTGG CCTTCCATAG ATATCAAGCC
GACAGCATCG TCGCGGAAAG CAATTTTGGC GGTGAAATGG TCCGGGCGAC GATACAGGCG
GCGGATCGGA ATGTTCCGGT AAAGCTCGTC ACTGCGAGTC GCGGCAAGGC CGTGCGCGCC
GAACCGATCT CGGTGCGCTA CGCTCAAGGA CAGGTCCATC ATGTCGGTAG ATTTCCCAAG
CTGGAAGACC AGCTCTGCGC CTTTTCAAGC GCCGGCTATA ACGGCGGCGG CAGCCCCGAT
CATGCCGATG CGGCGATCTG GGCGCTGACG CATCTGTTTG GCGCAGACGA CGGGACCGGA
ATCATCGAGT TTTATCGCCG CGAAGCTGAA ATCAAGCGTC GCTCCTGA
 
Protein sequence
MVTFSPGQDA ARRLLEGPQR YTCLAGGTRS GKTFLIIRAI IIRALQAEET RHAVLRFHAN 
AARASIALDT LPRVMRLCFP DATLRERRQD GYFELGNGSR IWIGGLDDKD RVEKILGLEY
ATIFLNEASQ IPYSSALIAF TRLAQVAPRI DQRAFVDLNP VGKTHWTNQL FGEKRDPVSR
RPLPDPESYR RAFLNPPDNK ANLSREFLAS LSHLPEKQRK RFLDGVYVDE VDGALWTYAG
IDAGRCAAER ISVDKRAAVV VAVDPSGAAG RDDLGADEIG IIVAARGVDG DAYILEDLSC
RDAPAVWGRR AVVAFHRYQA DSIVAESNFG GEMVRATIQA ADRNVPVKLV TASRGKAVRA
EPISVRYAQG QVHHVGRFPK LEDQLCAFSS AGYNGGGSPD HADAAIWALT HLFGADDGTG
IIEFYRREAE IKRRS