Gene M446_5106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_5106 
Symbol 
ID6132376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5606690 
End bp5607931 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content68% 
IMG OID641645241 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001771866 
Protein GI170743211 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0201229 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAC CCGTCCTCCC GCCCTACGAC GTCGCGGCGA TCCGGGCCGA ATTCCCGATC 
CTGTCGCGGA CGGTCTACGG CAAGCCCCTC GTCTACCTCG ACAACGCCGC CTCGGCCCAG
AAGCCGCGGG TGGTGATCGA GGCCATGACC CGCACGATGG AGACGGCCTA CGCCAACGTC
CATCGCGGCC TGCACTTCAT GGCGAACGCC GCCACCGAGG GCTACGAGGG CGCCCGCGAG
ACCGCGCGGC TGTTCCTCAA CGCCCGCTCG ACGGACGAGA TCATCTTCAC CCGCAACGCC
ACCGAGGCCT ACAACCTCGT CGCCTCGTCG ATGGGCTGGG CCGGCCTGAT CGGGGAGGGG
GACGAGATCA TCCTGTCGAT CATGGAGCAC CATTCCAACA TCGTGCCCTG GCACTTCCTG
CGCGAGCGCC GGGGCGCGGT GATCAAGTGG GCGCCCGTCG ACGACGAGGG CAACTTCCTG
GTCGAGGCCT TCGAGAAGCT GTTCACGCCG CGCACCCGCA TGGTGGCCAT CACCCACATG
TCGAACGTGC TCGGCACGAT CACCCCCGCC AAGGAGATCG TGCGCATCGC GCACGCGCAC
GGCGTGCCGG TGCTCCTCGA CGGCGCCCAG AGCGCCGTGC ACCAGAGCAT CGACGTGCAG
GATCTCGGCT GCGACTTCTT CGTCTTCACC GGCCACAAGG TCTACGGGCC GACCGGCATC
GGCGTGCTCT ACGGCAAGAA GGAGTGGCTG GAGCGCCTGC CCCCCTACCA GGGCGGCGGC
GAGATGATCC AGACCGTCAC CGAGGACGCC ATCACCTACA ACGAGCCGCC GCACCGCTTC
GAGGCCGGCA CCCCGGCGAT CGTCGAGGCG GTCGGCCTCG GGGCCGCCCT CGAATTCATG
ATGAATCTCG GCCGCGAGAG GATCGCCGCG CACGAGGCCG CCCTCTCGGC CTACGCGCAT
CAGCGCCTCT CGGAGATGAA CGCGCTGCGC ATCATCGGGC GGGCGCGGTC CAAGGGCGCG
GTGATCTCCT TCGAGATGAA GGGCGCGCAC GCCCACGACA TCGCCACGGT GATCGACCGC
CAGGGCGTGG CGGTGCGGGC CGGCACCCAT TGCGCGATGC CGCTGCTCAG TCGTTTCGGC
ACCACATCGA CCTGTCGCGC CTCGTTCGGA CTGTATAATA CGCGGGACGA GGTCGACGCG
CTGGTCGCGG CGCTCGCCAA GGCCGAGATG ATGTTCGCGT AG
 
Protein sequence
MNAPVLPPYD VAAIRAEFPI LSRTVYGKPL VYLDNAASAQ KPRVVIEAMT RTMETAYANV 
HRGLHFMANA ATEGYEGARE TARLFLNARS TDEIIFTRNA TEAYNLVASS MGWAGLIGEG
DEIILSIMEH HSNIVPWHFL RERRGAVIKW APVDDEGNFL VEAFEKLFTP RTRMVAITHM
SNVLGTITPA KEIVRIAHAH GVPVLLDGAQ SAVHQSIDVQ DLGCDFFVFT GHKVYGPTGI
GVLYGKKEWL ERLPPYQGGG EMIQTVTEDA ITYNEPPHRF EAGTPAIVEA VGLGAALEFM
MNLGRERIAA HEAALSAYAH QRLSEMNALR IIGRARSKGA VISFEMKGAH AHDIATVIDR
QGVAVRAGTH CAMPLLSRFG TTSTCRASFG LYNTRDEVDA LVAALAKAEM MFA