Gene Msil_2671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2671 
Symbol 
ID7091140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2921140 
End bp2922243 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content66% 
IMG OID643465985 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002362955 
Protein GI217978808 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACC GTCAAGACAG CGACAAACCG GCGGGCGTCG CATCGTCCCG CCGCATTTTT 
ATCGCCGGGG CCGCCGCGGC GGCCGTCGGA GCGCCCGCAG CCCTCGCGGC CGGCCGCGTC
TTCGCCTTTC CGCGCGCGGC GATCGACGCG CAGGGGCTGC CGATCTGCAG CGTCGCGGCC
GACGGTCCGG CGCCCGCGGC CGGGCCGCTG AAGAAAATCA CCTTCGCCTG GAACGCCGGC
GCGCCCTGCC TCGTCGCCGT CACTGTCGCC AAGGATAAAG GCTTCTTCGC AAGACATGGG
CTCGACGTCG ACCTCATCAA CTACTCCGGC TCGACCGACC AACTGCTCGA GACGCTCGCG
ACCGGCAAAG CCGACGCCGC AATCGGCATG GCCCTGCGCT GGCTGAAGCC GCTGGAGCAG
GGCTTTGACG TCAAGATCAT CGCCAGCACT CATGGCGGCT GCCTGCGCCT TCTCGTTCCG
GCGGACTCCG GGCTCGGCGA TCTCAAGGAC CTCAAGGGAA AAACGATCGC CGTCAGCGAC
ATGAATGCGC CGGGAAAAAA CTTCTTCGCG ATCGCTCTGA AAAGGGCGGG GCTCGATCCC
GTCGCGGACG TCGATTTCAA GCCGTTTCCG GGACCGCTTC TGCGCGCCGC CGTGGAGAAA
GGCGAGGCGC ACGCCATCGC CGATACGGAT CCCAACACCT TCCTCTGGCT GAAGGACGGC
AAGTTCAAGG AGATCTCGTC GAATCTTTCG GGGGACTATG CGCAGCGAGC CTGTTGCATC
GTCGGCGTGC GCGGCGGGCT GGTCCGCGAC GATCGGCCGA CCGCCGCGGC CATCGCCCGG
GCGCTGCTCG AGGCGGCGGA CTTCGCCCAT GCTCATCCCA GTGAGGCCGC CGCCACCTAT
CTGCCTTTCG CGCCCGGCAG CGTCTCCCTC GACGATCTGA CGACGCTCGC GAAATATCAT
ACACATCAGC ACCATCCCGT CGGTCAGGCG CTGAAGGATC AGCTCGCAAG CTATGCGGAA
GAGTTGAAGC TCGTCTCCGT CTTCAAGCCG ACGACGGATA CGGCGAAATA CGCCGCGCGC
ATCTATGCCG ATGTCCTCAG CTGA
 
Protein sequence
MTNRQDSDKP AGVASSRRIF IAGAAAAAVG APAALAAGRV FAFPRAAIDA QGLPICSVAA 
DGPAPAAGPL KKITFAWNAG APCLVAVTVA KDKGFFARHG LDVDLINYSG STDQLLETLA
TGKADAAIGM ALRWLKPLEQ GFDVKIIAST HGGCLRLLVP ADSGLGDLKD LKGKTIAVSD
MNAPGKNFFA IALKRAGLDP VADVDFKPFP GPLLRAAVEK GEAHAIADTD PNTFLWLKDG
KFKEISSNLS GDYAQRACCI VGVRGGLVRD DRPTAAAIAR ALLEAADFAH AHPSEAAATY
LPFAPGSVSL DDLTTLAKYH THQHHPVGQA LKDQLASYAE ELKLVSVFKP TTDTAKYAAR
IYADVLS