Gene Msil_0490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_0490 
Symbol 
ID7091223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp543248 
End bp544315 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content63% 
IMG OID643463820 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_002360824 
Protein GI217976677 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTCA CAGGCGCCGT CGGCGATCAC GCGCCGACGC GGAGACAGGC GGTCGCGCAA 
ATCTCCGCTT TTTGCGCATT AGGGTTTTCG CCGTCGAGGG GGGCGGCCGC CGTCCCGAAG
ATTACAGTCG CTCTCGTCCG AAGCGCAGGG TCCGGCCCGC TCTTCATTGC GGCCGCAAAG
GGATATTTCG CGGACGAAGG GCTCGATGCC GAGCTGCGCT TCGTTGCGTC CGATGATGAC
GCGAGGGCCG CCGTCGCCGC AGGCGAGGCC GCCTTCGGCG TTTCGCAGCT GACAGCCTCA
TTTTTCAGCT ATGCGGTAGA TCAGCGGCTG ACGCTGATCG CCTCGCAATT CAGCGATCAG
GCGGGGTTTC CCGCCAATGC TCTCGTGATC GTCAAACCGG CCTATGACGC AGGGTTCAAA
AGCGTCCCCG ATTTGCGGCG CAAACAGATC GGCCTCGAGG ATGTGGGATC TGGCCGTCGC
TACGCCCTGG CGCATATCGC CGCGCGCTAC GGGCTAGATC CAGACGAGCT CACGATCGCC
GCGCTTGAAA GGCCTCAAAG AGAATTTGAG GCGTTGCGCA AAGGCGAAAT CGACGCCGCC
GTCGTTTCGT TTCACACGGC GCTCGAAACC GCCTCCTCCG CCAGCGATCT GGTTCTCGTC
AGGATGGGCG ATCTGGCGCA GTCGCAAATG GGGGCGGTCT TCACGGCCCA GCAGACGATT
GATTCAAACC GCCCGATCGT CGAGAAATTC ATCCGCGCCT ATCAGCGGGG CGTCGCCTCC
TACGATCTTA CATTTCTTCA GCGGTCCGAC GGCGACGACG AAGCCAAGCC CGACGACTAC
GACTCGACGT TGCAGCTGGT GTCTGAACAA GCGAATGTCG CGCCGCGCCT TATCGATCAG
GCGCCTCTTT ATTGCGATCG CCTCGGGCGA TTGGACGAGG CCGACGTCTC GGCGCAGCTC
GCATTCTGGC AAGACCACGG AATGGTCGCC CGAAGCGCGT CGGCAGCGAA TCTGATCGAT
GGGTCCTTTA CCGCCGAGCG CCTGCCGGGC AATCCGGATC CGAACTGA
 
Protein sequence
MRLTGAVGDH APTRRQAVAQ ISAFCALGFS PSRGAAAVPK ITVALVRSAG SGPLFIAAAK 
GYFADEGLDA ELRFVASDDD ARAAVAAGEA AFGVSQLTAS FFSYAVDQRL TLIASQFSDQ
AGFPANALVI VKPAYDAGFK SVPDLRRKQI GLEDVGSGRR YALAHIAARY GLDPDELTIA
ALERPQREFE ALRKGEIDAA VVSFHTALET ASSASDLVLV RMGDLAQSQM GAVFTAQQTI
DSNRPIVEKF IRAYQRGVAS YDLTFLQRSD GDDEAKPDDY DSTLQLVSEQ ANVAPRLIDQ
APLYCDRLGR LDEADVSAQL AFWQDHGMVA RSASAANLID GSFTAERLPG NPDPN