Gene Msil_1950 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1950 
Symbol 
ID7094068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2125662 
End bp2127077 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content60% 
IMG OID643465277 
ProductUBA/THIF-type NAD/FAD binding protein 
Protein accessionYP_002362255 
Protein GI217978108 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones88 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCTC GCATAACCCT TGCGCTCTCG GGTGATCAGC ACGAGCACCT GATGAGCTTC 
CTGTTTCCGG GGGATGGCAA AGAGGCAGTT GCCATTCTAC TGTGCGGACG CCGCGATGGT
GATCGCTGCC ATCGCCTTGT CGTCCGAGAG ATACACGGTA TTCCTTACGA CGACTGCTCC
GAGCGGACGC CGTCGCGCGT CACATGGCCG CCAGATTACA TCGCGCCGAT GCTGGATCGG
GCTGCCGCCG AGCGTCTTTC GGTCGTCAAA GTTCACAGTC ATCCGACCGG CTATGGCGCA
TTCTCCACAA CCGACGACGC GGGCGACGCA CGCCTCCTGC CGATGGTCCG TGGATGGGTC
GAGGCCAATG TCTTTCACGG CAGCGCAGTC ATGTTGCCTT ACGGCCAGAT GTTCGGGCGC
GTCATGTTGG ACGACGGCAG CTTTGCGCCA ATCGACTGTA TCTCTGTGGC AGGCGACGAC
CTTCTTTTTT GGTATGCGGA CGCGGGAAGC GTTGCCTTGC CGAACTTCGT CGCATCTCAT
GCGCAAGCCT TTGACGAAGG AACAATCCAG CGGCTCCGCC GCCTTTCGTT TGCCGTGGTC
GGGGCCTCCG GCACCGGAAG CCCGACTGTT GAACAGCTCG TCAGGCTGGG CGCTGCTGAA
ATCGTGATTG TTGACGATGA TTACATGGAG GATCGCAACG TCAACCGTAT CCTGAACTCC
ACAATGCAAG ATGCGAGTGA CAGTCGGACG AAAGTCGACG TGCTCGCGGA TGCTGCCGAG
CGGATCGGCC TTGGAACCCG CGTCGTTCGC GTGCGCAAGA ATCTCTGGCA TCCCGATGTC
ATTCGAGAAG TCGCACAGTG CGACGTGATA TTCGGCTGCA TGGACACGGT CGATGGTCGC
TACCTCCTCA ACGCGCTCGC CTCATATTAC TCGATCCCAT ATTTCGATAT TGGCGTGCGC
CTCGATGCGG TCCGGGACGG CGCTGGGAAA GGTCGCATCC GCGAAGTCTG CGGCACCGTC
AACTACCTTC GCCCTGGTCG CTCCAGTCTT ATGAGCCGGG GTCTGTTCAC GATGGGCGAG
GTCGCCGCGT CGGGCCTAAG GCGTAATGAT CCGCGCGCCC ATGAGCGCCA GGTCGATGAC
GGATACATTA AGGGAGTTGC GGCGCATCGC CCTGCGGTAA TCAGCGTGAA CATGTTTGCA
TCCGCGCTTG CCGTGGATGA GTTCCTCGCC CGTCTCCATC CCTTCCGCGA AGAGCCGAAC
GCAAGCTATG CGAGCGTAAC GTTCAGTCTC GCCAGCATGG AGCTGATCTG CGATCCTGAA
GAGGGCATCT GCGAAATTCT GGGCGGCGCT GTTGGCATCG GCGACACATC CCCGCTTCTA
GGGATAATGG AACTCGCTGA AAGGCGGGTG TCGTGA
 
Protein sequence
MSARITLALS GDQHEHLMSF LFPGDGKEAV AILLCGRRDG DRCHRLVVRE IHGIPYDDCS 
ERTPSRVTWP PDYIAPMLDR AAAERLSVVK VHSHPTGYGA FSTTDDAGDA RLLPMVRGWV
EANVFHGSAV MLPYGQMFGR VMLDDGSFAP IDCISVAGDD LLFWYADAGS VALPNFVASH
AQAFDEGTIQ RLRRLSFAVV GASGTGSPTV EQLVRLGAAE IVIVDDDYME DRNVNRILNS
TMQDASDSRT KVDVLADAAE RIGLGTRVVR VRKNLWHPDV IREVAQCDVI FGCMDTVDGR
YLLNALASYY SIPYFDIGVR LDAVRDGAGK GRIREVCGTV NYLRPGRSSL MSRGLFTMGE
VAASGLRRND PRAHERQVDD GYIKGVAAHR PAVISVNMFA SALAVDEFLA RLHPFREEPN
ASYASVTFSL ASMELICDPE EGICEILGGA VGIGDTSPLL GIMELAERRV S