Gene Msil_2471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2471 
Symbol 
ID7091023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2698925 
End bp2699884 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content68% 
IMG OID643465792 
Productprotein of unknown function DUF58 
Protein accessionYP_002362762 
Protein GI217978615 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCTGT TCGCGCGCTT GCAATCGCAA TTCTCACGCA AAAATGATGC GCCAGCCTTT 
GCTGGCCGCG AGTCTGGCGA GAGCATTCCC GGCGTCGATC TCGACGTCGA TGACCTCTTG
CGCATCCGTC ATCTCGCCGA GCGCATGGAT CTGCCGAAAT CTGCGCCGCG CTCGACCTTG
CCCGGCAATG TGGCGCATCG CCGGCGCGGC CGCGGCCTTG AGGTGCATGA CATCCGGAGC
TGGTCGGATG GCGACGACGT CCGCCATCTC GACCGCAATG TGATGGCGCG CACGGGAATT
CCGCATGTGC GAACCTTTCG CGAAGAACGC GAACGCGCCG TTCTTCTCGT CGCGGACTTC
CGGCCCTCCA TGCTGTTTGG CACGCGGCGC GCGCTGCGCT CCGTCGCGGC GGCGGAGGCG
CTGACCCTCC TCGGCTGGCG CGCCGCCCGC GACGGGCGCG TCGGCCTGAT GGTCATTCAG
CATGACGGCG GTCATCTGAT CCGCTACGGC CGCGGCGCGC GGGCGATGAT CGCCATGGTC
TCCGAGCTCG CGCGGGCGCA TCGCAACGCG CTGGCGAGCC GCTCAAGGCT CGATCCGCCG
CTGACCGAGA GCCTTGAGGA AGCCGACCGG CTCGCCGGCA AGAACGCCGC AATCGTCGTC
GCCACTGCGC TGGACGAGCC GGGACCGCAG TTCGACGAGA TCGTGGCGCG GATCGCGCTA
CGGCGCGATC TTTCCTTCGC GCTCATCGCC GACCGGTTCG AGACCGCGCC GCCGCAAGGC
TCCTATCCTT ATGCAACAAT GGCCGGCGCT GCGGGTTGGC TGAGCATCGG CGCGAATGAG
CCGCAAAAGC CGGACGAGCG CGTCGCCCGG CTGCAGCGGC TTGGCGCGCG CGCCTTGAGC
CTCGATTCCC GCCTCGATGT CGAGGCGATG GCGCCGCTGC TGGAGCGTCT CGATGGCTGA
 
Protein sequence
MSLFARLQSQ FSRKNDAPAF AGRESGESIP GVDLDVDDLL RIRHLAERMD LPKSAPRSTL 
PGNVAHRRRG RGLEVHDIRS WSDGDDVRHL DRNVMARTGI PHVRTFREER ERAVLLVADF
RPSMLFGTRR ALRSVAAAEA LTLLGWRAAR DGRVGLMVIQ HDGGHLIRYG RGARAMIAMV
SELARAHRNA LASRSRLDPP LTESLEEADR LAGKNAAIVV ATALDEPGPQ FDEIVARIAL
RRDLSFALIA DRFETAPPQG SYPYATMAGA AGWLSIGANE PQKPDERVAR LQRLGARALS
LDSRLDVEAM APLLERLDG