Gene Msil_2339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2339 
Symbol 
ID7090323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2541314 
End bp2542465 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content61% 
IMG OID643465661 
Productdomain of unknown function DUF1745 
Protein accessionYP_002362631 
Protein GI217978484 
COG category[S] Function unknown 
COG ID[COG3287] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.25177 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGGC GCGGGGCGAT ACGACGCGGA TTTTCGACGG CGCCCGATCC TGTCACGGCG 
GTGCGCGAAT TTCACGACGC CATCGCCCAG CCCGATATTG GCCTGGTCGT GTTCTTCTGC
TCCTACAGCT TCGACCTCAA TGTGCTGGAG CCGGAGCTGC GGCGCATGTT CGCCGGCGTT
CAGGTAATCG GCTGCACCAC GGCCGGAGAA ATCACCCCGA TCGGCTATCT TGACGGGTCG
ATCACCGGCT TCAGCATCGC GGCCTCCCAT TGCGTCGCGG CCACCGCCCT GATGCAGAAT
CTGTCCAATT TCCAGATGTC CGACGGGCAC GCCGCGACGC AAAAAGTCGT CTCGGCCATG
GGCGAGAAAG GTTATACGCT CGACCCGCGG GACTCTTTCG CGCTCCTTCT CATCGACGGC
ATGTCCCGCA ATGAGGAGGT CGTGCTCGCC TCGATGCATC TTCTAATGGA TATGACGCCG
CTCGTCGGCG GCTCCGCGGC GGATAATCTC TGCCTCAACG GCGCCTTCGT CTATTGCGAC
GGCGCCTTCC ACAGCGACGC GGCCTTGCTC GCGGCGATCC GCATCAAAGC GCCGTTCCGA
ATCCTGAAAT GCCAGCATCT CGTCGGCTCC GACGAACGAA TGGTGGTGAC GCGCGCCGAT
CCGCACAGCC GCAAAGTGTT CGAGCTCAAC GGCGAACCGG CGGCGCGAGA ATATGCGCGA
CTGCTCAATC TGCCCGAGCG CGCCCTGACC CCTTCGACCT TCTCCACTTA TCCGCTGATG
GTCAAGATCG GCGCCGATTT TCATGTCCGC TCGATTCAGG CCGCCCATTT CGACGACAGC
CTCACCTTCT TCTGCGCGAT CGACGAAGGC GTCGTATTGC GGCTCGCCAA AAGCGAAGCG
GTGCTGCCCA ATCTCACCGC CTTTTTCGAA GGCGTGAACG AAAGCTTCGG ACAACCCGAG
CTCGTGATCG GCTTCGACTG CATCTATCGC AGCCTCGCGC TGGAAAAGGC CCAGACGAAA
CGCCTTGCCG GCGCATTGCT CGCCGCCAAT CATGTGATCG GCTTCAGCAC TTATGGCGAG
CAGTTCGCCG GCATGCATTT GAACCAGACC TTTACCGCAA TCGCCATCGG CAAGCCTTAT
GACGATCTCT GA
 
Protein sequence
MSGRGAIRRG FSTAPDPVTA VREFHDAIAQ PDIGLVVFFC SYSFDLNVLE PELRRMFAGV 
QVIGCTTAGE ITPIGYLDGS ITGFSIAASH CVAATALMQN LSNFQMSDGH AATQKVVSAM
GEKGYTLDPR DSFALLLIDG MSRNEEVVLA SMHLLMDMTP LVGGSAADNL CLNGAFVYCD
GAFHSDAALL AAIRIKAPFR ILKCQHLVGS DERMVVTRAD PHSRKVFELN GEPAAREYAR
LLNLPERALT PSTFSTYPLM VKIGADFHVR SIQAAHFDDS LTFFCAIDEG VVLRLAKSEA
VLPNLTAFFE GVNESFGQPE LVIGFDCIYR SLALEKAQTK RLAGALLAAN HVIGFSTYGE
QFAGMHLNQT FTAIAIGKPY DDL