Gene Msil_2540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2540 
Symbol 
ID7091091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2775158 
End bp2776456 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content64% 
IMG OID643465856 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_002362826 
Protein GI217978679 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGACGC ATTTTTCACT GGAACAGCTC GCCGACCCTC GCCTTGCCGT CTCGGAAGCG 
ATCCTGCGCA AATGCGTGCA TTGCGGATTT TGCACGGCGA CCTGTCCGAC CTTCACCTTG
CTCGGCGACG AGCTCGATAG TCCGCGCGGC CGCATCTATC TCATCAAGGA AATGCTTGAG
AATGGCCGCC CTGCCGACAG CGAGACGGTC AAGCATATCG ACCGCTGCCT GTCGTGCCTC
TCCTGCATGA CGACCTGCCC GTCGGGCGTG AATTACATGC ATCTCGTCGA CGAGGCGCGC
GTGCGCATCG AAGAAACCTA TAAAAGGCCG CTCGCCGACC GCGCGTTGCG CGCGCTGCTC
GGCTTTGTCC TGCCCTATCC CGAGCGTTTT CGCCTGGCGC TGCGCGGAGC CGCGCTCGCC
GCTCCGTTAG CGCCGCTGCT GGCGAAATTC GGGCAGACGG GGGATCGGCT CAACGCCATG
CTCCGGCTCG CGCCGCGATT CCTGCCAAAG CGTACGGATT TCCCGCAAGC CAGCGTCAGC
CCCGCGCAAA AACCCATCGC CCGCGTCGCC ATCCTGGCAG GCTGCGCGCA AAAAGTGCTG
CGGCCAGCGA TCAATGACGC CGCGATCCGG CTCCTGACGC GGCTCGGCGT CGAAATCGTG
CAGCCGCCCG AGGAAGGCTG CTGCGGCGCG CTGGTGCATC ATCTCGGCCA GGAGCATGAG
GCGCTGGCGC AGGCCAGGCA CAATATCGAC GTCTGGACTG CCGAGATCGA CAAAGGCGGC
CTCGACGCCA TTTTGATCAC GGCGTCAGGC TGCGGCACGG TCATCAAGGA TTACGGCTTC
ATGCTGCGCC TTGATGAAGC CTACGCCGCC AAGGCCGCCA AAATCTCCGC CCTCGCCAAG
GACGTCACCG AATTTCTCGG CGGCCTCAAA TTGCCGGCGC CCCTCGCCGC GCAAAAACCC
GCGCTCGCCT ATCATTCGGC CTGCTCGATG CAGCATGGGC AAAAAATCAC GAGCCTGCCC
AAAGAGCTTT TGAAGGCCGC CGGCTTCGAG GTGCGCGACG TTCCCGAAGG GCATTTGTGC
TGCGGCTCGG CCGGCACCTA CAACATCCTG CAGCCCAAAA TCGCCGAGCA ATTGCGCGCG
CGAAAAATCG AAAACATCCT GAAAACGGCG CCGGACGGCA TAGCCACCGG AAATATCGGC
TGCATGACGC AGATCGGCGC CGGCGCGGGC GTGCCCGTGC TGCACACGGT CGAATGGCTC
GACTACGCCT ATGGCGGCCC CAAGCCGGCG GAGATTTAG
 
Protein sequence
MQTHFSLEQL ADPRLAVSEA ILRKCVHCGF CTATCPTFTL LGDELDSPRG RIYLIKEMLE 
NGRPADSETV KHIDRCLSCL SCMTTCPSGV NYMHLVDEAR VRIEETYKRP LADRALRALL
GFVLPYPERF RLALRGAALA APLAPLLAKF GQTGDRLNAM LRLAPRFLPK RTDFPQASVS
PAQKPIARVA ILAGCAQKVL RPAINDAAIR LLTRLGVEIV QPPEEGCCGA LVHHLGQEHE
ALAQARHNID VWTAEIDKGG LDAILITASG CGTVIKDYGF MLRLDEAYAA KAAKISALAK
DVTEFLGGLK LPAPLAAQKP ALAYHSACSM QHGQKITSLP KELLKAAGFE VRDVPEGHLC
CGSAGTYNIL QPKIAEQLRA RKIENILKTA PDGIATGNIG CMTQIGAGAG VPVLHTVEWL
DYAYGGPKPA EI