Gene Msil_1075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1075 
Symbol 
ID7091904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1164268 
End bp1165515 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content68% 
IMG OID643464415 
Productprotein of unknown function DUF900 hydrolase family protein 
Protein accessionYP_002361406 
Protein GI217977259 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.290615 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCTTA CGCGCGCGGC GCGAACCATT TCCGTCCTTG CCCGGCCGGC GCTGCTGATC 
GCCGGCCTGC TGGCGCTTGC GGGCTGCGGC GGCGGAATGA CGGATCTCGA CGGCGCGGGC
GCCGGGCCGC GCCCGACCTC GGTGTTTGTC GTCTCGACGC GCAAGGGCGA AAGCGGTCCC
GCCAGCGAAC TCAGCAATGA TGGAAACGAG CGCTATTCGC TGCAGATGAT CGGCGCGCCG
CTCAATCACC AGATCGGACA ATTGGAGCGC CCTTCCATCG GCAGTCCCGA TCCCGCGCGT
CATTTTGCGC TTCAGACGCG CCGCGCGCTC GACGAGGATG GTTTTACGGC CGCGCTCGCC
ACGCATCTGT CGGGGCGCAT CGGCTCCAAC CGCGACGTTC TCCTTTATGT GCACGGCTTC
AACACGAGCT ATGACGAGTC GCGGTTCCGG CTCGCGCAGA TCGTCGCGGA CGGCCGCTTT
GGCGGCGTCG CCGTCCTGTT CACCTGGCCG TCGACGAATA ATCTGCTCGA CTATGGCGCG
GCGAAAGAGA ATGCGACGAT CTCGCGGGAC GCGCTGGCGA AGCTGATCCG GCAGCTGACG
GATGCGCCCG ACGTCGGGCG CGTGCACATC CTCGCCCATT CGATGGGGGC CTGGCTGACC
ATGGAGGCGC TGCGCCAGGA TTTTATCGCG GGCGGCGCGC GGCTGAACGA CAAGCTGGGC
GATATCATGC TGGCCGCTCC CGACATCGAT CTGAATGTTT TCCGCCAGCA GATCAGCCGC
CTCGACGCGT CGCACATCTT CGTGCTCGTT GCAGCCAATG ATCGCGCCTT GTCGCTCTCG
CGCACGCTGA CCAGTGATCG GCCGCGCCTC GGCGCGCTCG ATCCGAAGAA CCCGGCCGAC
CGATCGGCGC TCGAGACGCT CGGCGTCAGG GTTTATGATC TGAGCCGGGA GGCTGATATA
TTCATCGGCC ACGGCGCCTA TGCGGACGCG CCCGACGCGC TGCGCACCAT CGGCGCGCAG
ATCGCCGCGC CGCGGCCGCA AGACTCCAAT GTTCAGGCGG TTCTCGGCGA AAACCCCATC
GACGACCGCA TTCACGCCAC GCCCTTGCCG CCGCCGGCCG CTGCAGCGCC TGGCGCGCCC
GCCGCCGCTC CGGCCCGCCC CGAGGCGCCG ATCAGTGCGG TGGTCCCGCT TGCGACGGCG
ACGCCGGGTT CCGCGACGCC GGCGTCTTCC GCCGCGCCGA CGCCCTGA
 
Protein sequence
MRLTRAARTI SVLARPALLI AGLLALAGCG GGMTDLDGAG AGPRPTSVFV VSTRKGESGP 
ASELSNDGNE RYSLQMIGAP LNHQIGQLER PSIGSPDPAR HFALQTRRAL DEDGFTAALA
THLSGRIGSN RDVLLYVHGF NTSYDESRFR LAQIVADGRF GGVAVLFTWP STNNLLDYGA
AKENATISRD ALAKLIRQLT DAPDVGRVHI LAHSMGAWLT MEALRQDFIA GGARLNDKLG
DIMLAAPDID LNVFRQQISR LDASHIFVLV AANDRALSLS RTLTSDRPRL GALDPKNPAD
RSALETLGVR VYDLSREADI FIGHGAYADA PDALRTIGAQ IAAPRPQDSN VQAVLGENPI
DDRIHATPLP PPAAAAPGAP AAAPARPEAP ISAVVPLATA TPGSATPASS AAPTP