Gene Msil_1162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1162 
Symbol 
ID7093925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1249125 
End bp1250159 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content60% 
IMG OID643464503 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002361493 
Protein GI217977346 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.139454 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGCAC CGCTCGCAAC ACGGCGCCGA TCGCGGGCCT CAGGCGCCCT TGTCGCCGCC 
CTGTTCATCT TCGCCTTTTT CGGCGCGCCC GCCCTCGCCC TCGACAAAGT GACTTTCGCC
ACCAACTGGC TTGCCGAAGG CGAGCATGGC GGATTCTATC AGGCCAAGGC CGACGGCACC
TATCAGCGCT ATGGCCTCGA CGTTTCGATT TTGCACGGGG GGCCGCAGGC CAACAACAGG
CTGTTGCTGG CGGCCGGCAA AATCGAGTTT AATTTGGCCG CCAATCTGAT CCAGTCCTTC
GACGCCGCCT CGCAAAACAT TCCGCTCGTC GCGGTCGCCG CGCTGTTCCA GAAAGACCCC
TTCATTCTGA TGTCCCACCC CGACGCGGGG TTCGACAAGA TCGAGGATTT GCCGCGGGCG
ACCGCCTTCA TCGGCAAGGA CGCCTTCGTC TCGGTCTATC AATGGCTGAA GAGCGCCTAT
GGATTTCGCG AGGACAAGGT CCAGCCCTAT AATTTCAACG CCGCCCCCTT CATCCGCGAT
AAAAATTCGA TCCAGCAGGG CTATGCGACG TCGGAGCCTT TCGCCATCGA GCGCGAGGGC
GGCTTTCGGC CCAATGCGTT CCTCATCGCC GACTATGGCT ATGATTCCTA CTCAACCCTG
ATCGAGACGC GGGCCGACCT CATCGCCAAA AACCCCGACC TCGTGCAGCG CTTCGTCGAC
GCCTCGATCA TCGGCTGGCT GCATTATCTG TATGGCGACA GCGGCAAGGC GGATGCGCTG
ATCCTTGCCG ACAATCCCGA CATGACGAAA GAGCTGCTCG CCTATTCGCG CGACAAGATG
AAGGAGCTCG GCATCGTCGT TTCCGGCGAG GCAAGGACGC TTGGCGTCGG CGCCATGACA
GAGCCTCGCG TCAAAAGCTT TTTCGGCAAG ATGGCGGCGG CCGGATTGTT CAAGCCCAAT
CTCGATTATC GCAGCGCCTA CACGCTGCAA TTCATCAACA AGGGCGTCGG CCTTGATCTC
ATTCCGCGCC CGTAA
 
Protein sequence
MTAPLATRRR SRASGALVAA LFIFAFFGAP ALALDKVTFA TNWLAEGEHG GFYQAKADGT 
YQRYGLDVSI LHGGPQANNR LLLAAGKIEF NLAANLIQSF DAASQNIPLV AVAALFQKDP
FILMSHPDAG FDKIEDLPRA TAFIGKDAFV SVYQWLKSAY GFREDKVQPY NFNAAPFIRD
KNSIQQGYAT SEPFAIEREG GFRPNAFLIA DYGYDSYSTL IETRADLIAK NPDLVQRFVD
ASIIGWLHYL YGDSGKADAL ILADNPDMTK ELLAYSRDKM KELGIVVSGE ARTLGVGAMT
EPRVKSFFGK MAAAGLFKPN LDYRSAYTLQ FINKGVGLDL IPRP