Gene Mnod_5845 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMnod_5845 
Symbol 
ID7301107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium nodulans ORS 2060 
KingdomBacteria 
Replicon accessionNC_011894 
Strand
Start bp5945256 
End bp5946497 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content67% 
IMG OID643603463 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_002500976 
Protein GI220925674 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.187018 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAC CCGTCCTCCC GCCCTACGAC GTCGCGGCGA TCCGGTCGCA ATTCCCGATC 
CTGTCGCAGA CGGTCTACGG CAAGCCGCTC GTCTATCTCG ACAACGCCGC CTCGGCCCAG
AAGCCGAAGG CGGTGATCGA CGCCATGGCG GAAGCCATGG AGACGGCCTA CGCCAACGTC
CATCGCGGCC TGCACTTCAT GGCGAATGCC GCCACGGAAG GCTTCGAGGG CGCCCGCGAG
ACCGCGCGGC AATTCCTCAA CGCCCGCTCG ACGGACGAGA TCATCTTCAC CCGCAACGCG
ACCGAGGGCT ACAACCTCGT CGCCTCGTCG ATGGGCTGGG CCGGCCTGAT CGGGGAGGGG
GACGAGATCA TCCTCTCGAT CATGGAGCAC CATTCCAACA TCGTGCCCTG GCACTTCCTG
CGGGAGCGCC GCGGCGCGGT GATCAAGTGG GCGCCCGTCG ACGACGAGGG CAACTTCCTC
GTCGAGGAAT ACGAGAAGCT GTTCACGCCG CGCACCAGGA TGGTGGCGAT CACCCACATG
TCGAACGTGC TCGGCACGGT GACGCCGGCC CGTGAGATCG TCCGCATCGC CCATGCGCAC
GGGGTGCCGG TGCTCCTCGA CGGCGCCCAG AGCGCGGTGC ACCAGACGAT CGACGTGCAG
GATCTCGACT GCGATTTCTT CGTCTTCACC GGCCACAAGG TCTATGGGCC GACCGGCATC
GGCGTGCTCT ATGGCAAGAA GGAATGGCTG GAGCGCCTGC CCCCCTATCA GGGCGGCGGC
GAGATGATCC AGACCGTCAC GCAGGACGCG ATCACCTACA ACGAACCCCC GCACCGCTTC
GAGGCGGGCA CCCCGGCGAT CGTCGAGGCG GTGGGCCTGG GCGCCGCCCT CGAATTCATG
ATGAAGCTCG GCCGCGACCG GATCGCCGCG CACGAGGCCG CTCTCTCGGC CTATGCGCAT
GAGCGCCTGT CCGAGATGAA CAGCCTGCGC ATCATCGGCC GGGCGAAGGG GAAGGGCGCC
GTGATCTCCT TCGAGATGAA GGGCGCGCAT GCCCACGACA TCGCCACGGT GATCGACCGC
CAGGGCGTGG CCGTGCGGGC CGGCACGCAT TGCGCGATGC CGCTGCTCAG CCGCTTCGGC
ACGACCGCGA CCTGCCGCGC CTCGTTCGGG CTCTATAATA CGCCGGATGA GGTCGATGCG
CTGGTCGCGG CCCTCGCCAA GGCCGAGATG ATGTTCGCCT AG
 
Protein sequence
MNAPVLPPYD VAAIRSQFPI LSQTVYGKPL VYLDNAASAQ KPKAVIDAMA EAMETAYANV 
HRGLHFMANA ATEGFEGARE TARQFLNARS TDEIIFTRNA TEGYNLVASS MGWAGLIGEG
DEIILSIMEH HSNIVPWHFL RERRGAVIKW APVDDEGNFL VEEYEKLFTP RTRMVAITHM
SNVLGTVTPA REIVRIAHAH GVPVLLDGAQ SAVHQTIDVQ DLDCDFFVFT GHKVYGPTGI
GVLYGKKEWL ERLPPYQGGG EMIQTVTQDA ITYNEPPHRF EAGTPAIVEA VGLGAALEFM
MKLGRDRIAA HEAALSAYAH ERLSEMNSLR IIGRAKGKGA VISFEMKGAH AHDIATVIDR
QGVAVRAGTH CAMPLLSRFG TTATCRASFG LYNTPDEVDA LVAALAKAEM MFA