Gene Mext_1178 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1178 
Symbol 
ID5832122 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1296926 
End bp1298944 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content70% 
IMG OID641366971 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001638651 
Protein GI163850608 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01976] cysteine desulfurase family protein, VC1184 subfamily
[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.0228689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACGC CTAACGCCCC CGCTCTCGGT CTTCCGACCG GCAACGCGGG CGATCTGGCG 
CATGCCGATC TCGTCGGGCG CCTCGCCCGC GAGATCACCG GCCAGGGACA GGCGGGTAGC
GCACCGTTCC AGCCCCAACT GCCGCAGACG CCTCAAGTCC CCCAGGGGCT TGAGACCCTT
CCCACCGGTT CCGGGCAGCC CGCGCTGCCC GGCGCCAATC TGGGTACGCC CTCCGTGCCC
TCGGGCGCGC TGCCGTCGGG ATTCCAGCCC GACCTGTCGG CGCAGGCCGC GCGCTCCTTC
GGCGCGCCGC CGGCCGGCCT GCCGGGCCTG ACCGGTCCGT CCCATGCCAA CCCGGTCGGC
GCGACGCCCG CGCCGGCGAG CGCAAACCTG TTCGACGTGG CGGCGATCCC CTCGGTGCAT
CCGTTCACGA ACCCGAACCG GCTGCCCGAA TTCTTCGTGC CGAGCCTGCC CGACGTCGCC
GCAGCGATCC CCGGTGGCCC CGACCTGCAC CGGCAGATCG CGGCCGACCA TCCCCGCGCC
AACGGCTTCG CCCACGCGCT GGCACCCCAC CTCGTGCCGC GCGATCCCGG CAAGCCCGGC
GGCGACGCCG CGCAGGAGCA TTTTTCGGAG TTCGGCCACC TGCCGCATCC GCGCACGCCG
GACAACACGG GCGATTACTA CTTCTTGAGC CCGGCCCCAC CGCCTGCGAG GGGCAAGAAG
GCTGCGGCCC AGCCGCGGGT CGAGCCCTCG CGCCACGCGG GGCCCGCCCC GCGGGTGACC
TCGCACGGGT CGGTTCCGGC CGGCGCGCCG TTCGATGTCG AGCACGTTCG CAAGGATTTT
CCGGCGCTGC ACCAGAGCGT GAACGGGCAC CGCCTCGTCT GGCTCGACAA CGCGGCCACC
ACCCACAAGC CGCAGAGCGT GATCGACGCC ACGAGCGAAT TCTACGGTCG CCACAACTCG
AACATCCACC GGGCCGCGCA CACGCTCGCC GCGCGCTCCA CCGACCTGTT CGAGGGCGGT
CGCGAGAAGA CGCGCCGCTT CCTCAACGCG CCGAGCAAGG ACGACATCGT CTTCCTGCGC
GGCACCACGG AAGCGATCAA CCTCGTCGCC AACTCCTACG GCCGGGCCAA TATCGGCCCC
GGCGACGAGA TCATCGTCTC GACCATCGAG CACCACGCCA ACATCGTGCC CTGGCAACTT
CTGGCACAGG CGACCGGCGC GACGATCCGG GTGATCCCGG TCAACGATCG CGGCGAGATC
ATCTTCGAGC AATACGCGGC TTTGCTCTCG GGCCGCACCA AGATCGTCTC GGTGACGCAT
GTCGCCAACG CGCTGGGCAC CGTGAACCCG ATCCGGGAGA TCATCGCTCT GGCGCATGCC
TACGGCGTGC CGGTGCTGGT CGATGCCGCG CAATCGACGC CGCACATCCC CATCGACGTG
CAGGCGCTCG ACGCCGATTT CCTCGTCTTC TCCGGCCACA AAGTGTTCGG GCCGACCGGC
ATCGGCGCCC TGTATGGGAA GCGGCACCTC TTGGAGGCGA TGCCGCCGTG GCAGGGCGGC
GGCCACATGA TCGAGGACGT CACCTTCGCT AAGACCGTCT ACAAGGGCGC ACCCGAAAAG
TTCGAGGCGG GCACGCCCGA CATCGCCGGC GCGGTGGGTC TCGGTGCGGC GCTCGATTAC
CTGGAGAGCG TCGGCCTGCC GGCAATCGCG GCCTACGAGC ACGACCTGCT CGAATATGCG
CAGGAAGGCT TGGCTGACGT GAAGGGCCTG CGCCTGATCG GCACCGCGCT CAACAAGGCG
AGCGTCATGT CCTTCACGGT CGATGGCCTC ACCAACGAGG CCGTCGCCCA CCACCTCGAT
TCCCTCGGGA TCGCGGTGCG CTCGGGCCAC CACTGCGCCC TGCCGGCGCT GCGCCGCTTC
GGCGTCGATC AGTCGGTGCG CGCCTCGCTG GCCTTCTACA ACACGCGGGA GGACGTCGAC
CTCTTCCTGC GCGGGCTGCA CACCTTACCG CGGCACTGA
 
Protein sequence
MTTPNAPALG LPTGNAGDLA HADLVGRLAR EITGQGQAGS APFQPQLPQT PQVPQGLETL 
PTGSGQPALP GANLGTPSVP SGALPSGFQP DLSAQAARSF GAPPAGLPGL TGPSHANPVG
ATPAPASANL FDVAAIPSVH PFTNPNRLPE FFVPSLPDVA AAIPGGPDLH RQIAADHPRA
NGFAHALAPH LVPRDPGKPG GDAAQEHFSE FGHLPHPRTP DNTGDYYFLS PAPPPARGKK
AAAQPRVEPS RHAGPAPRVT SHGSVPAGAP FDVEHVRKDF PALHQSVNGH RLVWLDNAAT
THKPQSVIDA TSEFYGRHNS NIHRAAHTLA ARSTDLFEGG REKTRRFLNA PSKDDIVFLR
GTTEAINLVA NSYGRANIGP GDEIIVSTIE HHANIVPWQL LAQATGATIR VIPVNDRGEI
IFEQYAALLS GRTKIVSVTH VANALGTVNP IREIIALAHA YGVPVLVDAA QSTPHIPIDV
QALDADFLVF SGHKVFGPTG IGALYGKRHL LEAMPPWQGG GHMIEDVTFA KTVYKGAPEK
FEAGTPDIAG AVGLGAALDY LESVGLPAIA AYEHDLLEYA QEGLADVKGL RLIGTALNKA
SVMSFTVDGL TNEAVAHHLD SLGIAVRSGH HCALPALRRF GVDQSVRASL AFYNTREDVD
LFLRGLHTLP RH