Gene Mext_3980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_3980 
Symbol 
ID5835593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp4423765 
End bp4425021 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content66% 
IMG OID641369771 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001641422 
Protein GI163853379 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.970806 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAC CCGTCCGTAC CACTGCCCAG GACTCGTCCT ACGACGTCGA AGCGATCCGG 
AAGGAATTCC CGATCCTGTC GGAAAAGGTC TACGGCAAAC CGCTGGTCTA TCTCGACAAC
GCCGCCTCGA CGCAGAAGCC GCGGGCCGTG ATCGACGCGA TGGTCTCCTG CATGGAGACC
GGCTACGCCA ACGTGCACCG TGGCCTGCAC TACATGGCCA ATGCCGCGAC CGAAGGGTTC
GAGGGCGCGC GCGAGACCAC GCGCCAGTTC CTCAACGCGG CCTCGACCGA CGAGATCATC
TTCACCCGCA ACGCGACCGA GGCCTACAAC CTCGTGGCCT CCTCCATGGG CTGGGCCGGG
CTGATCGGGG AGGGGGACGA GATCATCCTC TCGATCATGG AGCACCACTC CAACATCGTG
CCCTGGCATT TCCTGCGCGA GCGGCGCGGC GCCGTCATCA AGTGGGCGCC GGTCGATGAC
GACGGCAACT TCCTGGTCGA GGAATACGAA AAGCTCTTCA CGCCGCGCAC CAAGATGGTG
GCGATCACCC ACATGTCGAA CGTGCTCGGC ACGGTGACGC CGGCCGAGGA GATCGTGCGC
ATCGCCCATG CCCACGGCGT GCCGGTGCTG CTCGACGGGG CGCAGAGCGC GGTGCACCGC
CCGGTCGATG TGCGGGCGCT CGATTGCGAC TTCTTCGTCT TCACCGGCCA CAAGGTCTAC
GGGCCGACCG GCATCGGCGT GCTCTACGGC AAGAAGGAGT GGCTCGACCG TCTGCCGCCC
TACCAGGGCG GCGGCGAGAT GATCCGCACG GTGAGCCAGG ACGCGATCAC CTACAACGAT
CCGCCCCACC GCTTCGAGGC GGGCACGCCG GCGATCATCG AGGCGGTCGG CCTCGGCGCG
GCGCTGGAAT TCATGATGAA GCTCGGCCGC GACAAGATCG CCGCGCACGA GGCGATGCTG
ACCGCCTACG CCCAGGAGCG GCTCGGCGCG ATGAATTCGA TCCGCCAGAT CGGCAATTCC
CGCGACAAGG GCGGCGTCAT CGCCTTCGAG GTGAAGGGCG CGCACGCCCA CGACATCGCC
ACCGTGATCG ACCACCAGGG CGTGGCGGTA CGGGCCGGCA CCCACTGCGC GATGCCGTTG
CTGACGCGCT TCGGTGTCAC CTCGACCTGT CGCGCCTCGT TCGGTCTGTA TAATACGACG
CAGGAAATCG ATGTCCTGGC CGCGGCTCTG GCCAAGGCCG AGATGCTGTT CGCCTGA
 
Protein sequence
MNAPVRTTAQ DSSYDVEAIR KEFPILSEKV YGKPLVYLDN AASTQKPRAV IDAMVSCMET 
GYANVHRGLH YMANAATEGF EGARETTRQF LNAASTDEII FTRNATEAYN LVASSMGWAG
LIGEGDEIIL SIMEHHSNIV PWHFLRERRG AVIKWAPVDD DGNFLVEEYE KLFTPRTKMV
AITHMSNVLG TVTPAEEIVR IAHAHGVPVL LDGAQSAVHR PVDVRALDCD FFVFTGHKVY
GPTGIGVLYG KKEWLDRLPP YQGGGEMIRT VSQDAITYND PPHRFEAGTP AIIEAVGLGA
ALEFMMKLGR DKIAAHEAML TAYAQERLGA MNSIRQIGNS RDKGGVIAFE VKGAHAHDIA
TVIDHQGVAV RAGTHCAMPL LTRFGVTSTC RASFGLYNTT QEIDVLAAAL AKAEMLFA