Gene Msil_3219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3219 
Symbol 
ID7090634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3530987 
End bp3531982 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content65% 
IMG OID643466527 
Productprotein of unknown function DUF58 
Protein accessionYP_002363488 
Protein GI217979341 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.000742031 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAGTTT CCTACGCCGC TATAGGCCGC TCGTCTGGGC CGCGCCGCGT TCGTGAAACC 
TCTTCCTCTT CAAGGTTTGG TCGTTACATT TCCGTCGCCG ATCTCGTTGC GTTGCACGCG
GCGGCGCGGG ACATTGGCTT CTTGTCGCGC CAGCGCACGC AGAGCGTGCT GTCGAGCCGG
CGCCATCGCG GCTCCAATCT GCCGCCGACG TCCGGCGTTT TCGGCCCGCT GATGTCCGGC
GATTTGTCGG GGGCCGACCA CCGGCGGCGC TTTGGGCGCT CGGCCGCGGA CGTCACCGCG
TACGACGAGA CGCCGCCGAG GCCGATTTTT ATCGTCGTCG ACCAACGCCA ATGCATGTTT
TACGGATCGC GCCGTTCGCT GAAATCCGTC GCAGCGGCGG AAGCCGCCGC ACTTTGCATC
TGGCGCGCCC TCGACGACGG CGCGCCGATT GGAGGCGTGG TTTTCAACGA CGCCATTATT
GAAGCCGTCG AGCCGTCGAC CGGCAGTTCC GCGGCGATGG CCATCATCAA GGCCATAGCG
GGGCAAAACG CCGAGCTTCG CGCCAGGCCG GCGCAGCCGC GCGCGCCTTC GCAGCTCGAA
AAAGCGCTTC GATCCGAACG GCTGGAGCAG GCGAGCGGCA GCCTCATCGT CGTCATCAGC
GATTTTCAGG GCCATGGCGC ACACACGCGC GCCGCGCTGC AAAAGCTTGC CGAGGCCAAT
GAGGTCGTCG CCGTCTGCGC CTATGATCCT TATCTGTTGG ACCTGCCGAA AACGGGCGAG
ATCATCGTCA CCGGCGGCGA GGTGCAGATC GACCTCGAAT TCGGCCAAGG CCGCATCCGC
AGGCGGCTGT TCGACTATGC CGACGCGCAG GCGCAGGGGC TGTTGACGAT CGAAAGGGAG
ATTGGCGTGC CGGTGCTGTC CTTATCGGCG GCCGAGGACA CCTCGCTGCA AATGCGCCGC
CTGCTGGACG AGAACGTCTG GCGCGTGCGC CAATAG
 
Protein sequence
MAVSYAAIGR SSGPRRVRET SSSSRFGRYI SVADLVALHA AARDIGFLSR QRTQSVLSSR 
RHRGSNLPPT SGVFGPLMSG DLSGADHRRR FGRSAADVTA YDETPPRPIF IVVDQRQCMF
YGSRRSLKSV AAAEAAALCI WRALDDGAPI GGVVFNDAII EAVEPSTGSS AAMAIIKAIA
GQNAELRARP AQPRAPSQLE KALRSERLEQ ASGSLIVVIS DFQGHGAHTR AALQKLAEAN
EVVAVCAYDP YLLDLPKTGE IIVTGGEVQI DLEFGQGRIR RRLFDYADAQ AQGLLTIERE
IGVPVLSLSA AEDTSLQMRR LLDENVWRVR Q