Gene Msil_3038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3038 
Symbol 
ID7092715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3351462 
End bp3352751 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content61% 
IMG OID643466348 
Productprotein of unknown function DUF323 
Protein accessionYP_002363310 
Protein GI217979163 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03440] conserved hypothetical protein TIGR03440 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCGT CCGCCCTATC GTCTCCTGAG CCGCACGCAC GGCCGCCCCT TGCCAGCCTG 
GTTGACGATT TATTCGCCGT TCGGCGTCAA ACGCTTGCGC TTGCGTCGCA TCTCGGGCCG
GAAGACCAAG CGGTCCAAGC AAACGAAGAC GCAAGCCCGA CCAAATGGCA TCTTGCGCAT
ACGACCTGGT TCTTCGAGAC CTTTGTGCTG CAGCCCTTTG TTCCGGCCTA TCGCATCTTC
GACGAGCGCT TCAACTTCTG TTTCAATTCC TATTATGACC ATCAGGGGCC GCGTCAGCCG
CGGGCGCTGC GCGGTCTTCT GACCCGGCCG ACGGCGCCGG AGGTTTTGGC CTATCGCGCC
TATGTCGACG AACAGCTCCA CACGCTCTTT GCCTCGGCGC GGTCGGAGGA TTCGGACCTT
TTGAGGACAG TCGAAATCGG CGTCAATCAT GAGGAACAGC ATCAGGAGCT GCTGCTCACG
GACATTCTTG CGCTGTTTGC GCAAAATCCG CTGCGTCCCG CCTATCGGGA AAGCGCCAAA
CCTTCCGTCC GCCAGGATCT CGACGACATG GTCTTTCTGT CGTTTGAGGG CGGGCTGCGG
TTCGCCGGAC ATGAGGGACA CGGCTTCGCC TTCGACAATG AATCCCCGCA GCATCAGACC
TTCCTGCGGC CCTTCAAACT TTCAAACCGC CTTGTCGCCA ATGGCGAATG GATGGAATTC
ATGGCGGATG GCGGCTACCT TACGCCCACG CTCTGGCTCG CCGATGGATG GGCGAAGGTG
ACGAAGGAAG GCTGGCGCGC GCCGCTCTAT TGGGAGCAGA ACCAGAATCA TTGGGAGGAG
ATGACGCTTT GGGGCTTGCA GACGATCGAT CCGGCAGCTC CGGTCGCCCA CGTCAGCTAT
TATGAGGCGG ACGCTTTTGC GCGCTGGGCC GGCAAGAGGC TTCCGACTGA ATTCGAGTGG
GAGGCCGCCG CCGAGGGGGT CGCCGTCGCT GGCAATATGC TGGAGCACGA TGCGTTGCGT
CCGCTCCCCG CCGGGCCGCG CGGCGCCTTG CGTCAAATGT TCGGAGACGC ATGGCAATGG
ACCCAGAGCG CTTATTCGCC GTACCCGGGC TATCGGCCGC AGCCGGGCGC GATCGGCGAA
TATAATGGCA AGTTCATGTG CAGCCAGCAG GTGCTGCGCG GATCGTCCTG CGTAACGCCG
AAGAGGCATG CGCGTAAAAC CTATCGGAAC TTCTTTTACC CGCATCAGCG CTGGCAATTC
AGCGGGTTGC GACTGGCAGA TGAGGCGTGA
 
Protein sequence
MSASALSSPE PHARPPLASL VDDLFAVRRQ TLALASHLGP EDQAVQANED ASPTKWHLAH 
TTWFFETFVL QPFVPAYRIF DERFNFCFNS YYDHQGPRQP RALRGLLTRP TAPEVLAYRA
YVDEQLHTLF ASARSEDSDL LRTVEIGVNH EEQHQELLLT DILALFAQNP LRPAYRESAK
PSVRQDLDDM VFLSFEGGLR FAGHEGHGFA FDNESPQHQT FLRPFKLSNR LVANGEWMEF
MADGGYLTPT LWLADGWAKV TKEGWRAPLY WEQNQNHWEE MTLWGLQTID PAAPVAHVSY
YEADAFARWA GKRLPTEFEW EAAAEGVAVA GNMLEHDALR PLPAGPRGAL RQMFGDAWQW
TQSAYSPYPG YRPQPGAIGE YNGKFMCSQQ VLRGSSCVTP KRHARKTYRN FFYPHQRWQF
SGLRLADEA