Gene Msed_1700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1700 
Symbol 
ID5105346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1639743 
End bp1640789 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content50% 
IMG OID640507594 
Productaspartate-semialdehyde dehydrogenase 
Protein accessionYP_001191779 
Protein GI146304463 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR00978] aspartate-semialdehyde dehydrogenase (non-peptidoglycan organisms) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.109656 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.328927 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAC TGAAGGTATC TCTTCTGGGA GCCACGGGGA TGGTAGGGCA AAAGATGGTT 
AGGTTGCTCT CCTCTCATCC CTACATAGAA CTCACCAAGG TTAGCGCTTC CCCGGGCAAG
ATAGGGAAAA GATACATTGA GGCAGTGAAG TGGGTTGAGG GTGGAGAGGT ACCAGAGCAG
GCTAGGGACC TCAAGATAGT GTCTACCGAG CCAGAGGATC ACAAAGACGT AGACGTTGTC
CTTTCGGCCT TACCCAATGA GCTAGCTGAG GGGATAGAGC TCAAACTTGT TAGGGAAGGG
ATAACCGTGG TGTCGAATGC AAGTCCATTC AGAATGGACC CAGAGGTTCC ACTAATAAAT
CCCGAGGTAA ACTGGGATCA TCTGAAGCTT CTCGAAACCC AAAGACAGAA GAGGGGTTGG
AAGGGCCTTC TTGTAAAGAA CCCCAATTGT ACTGCTGCCA TAATGAGCAT GCCGATCAAA
CCCCTTCTTA AGTACCGCTT AAATCACATG ATCATAACCA CCCTCCAGGC GGTAAGCGGT
GCAGGATATA ACGGTCTTTC CTTCATGTCA ATCACAAACA ACGTTATACC CTTCATAAAG
GGGGAGGAGG AAAAGATCCC TAAGGAATCC GGGAAGATGC TTGGGACACT AGTAAACGAC
TCGATCCGCC ACGTGGAGCT CAAAGCATTG GTAACCTCCA CGAGGGTTCC GGTCAAGGTG
GGACATATGG GGGTAATGTA CCTGTTCTTT GACTCCCCGG TTAATGCTGA GGAGGTTAAG
AGGGATCTTT CCTCTTTCAA ATCCTTACCC CAAGAGAGGA ACTTACCCAC TGCTCCCAAG
ACCCCAATCA GGGTACTAGA GGGGGAGGAT AGGCCTCAAC CTGAGATAGA CGTTAGTGCT
GAGAGGGGAA TGGCCATTAG CGTGGGGAGA GTAAAGAACG AAAATGGGGC TCTCAGGATG
GTTGTACTAG GGGATAACTT GGTTAGGGGC GCAGCAGGGA TAACCATTCT CACTTTGGAG
GTTATGAAAG AGCTGGGCTA CGTATGA
 
Protein sequence
MDKLKVSLLG ATGMVGQKMV RLLSSHPYIE LTKVSASPGK IGKRYIEAVK WVEGGEVPEQ 
ARDLKIVSTE PEDHKDVDVV LSALPNELAE GIELKLVREG ITVVSNASPF RMDPEVPLIN
PEVNWDHLKL LETQRQKRGW KGLLVKNPNC TAAIMSMPIK PLLKYRLNHM IITTLQAVSG
AGYNGLSFMS ITNNVIPFIK GEEEKIPKES GKMLGTLVND SIRHVELKAL VTSTRVPVKV
GHMGVMYLFF DSPVNAEEVK RDLSSFKSLP QERNLPTAPK TPIRVLEGED RPQPEIDVSA
ERGMAISVGR VKNENGALRM VVLGDNLVRG AAGITILTLE VMKELGYV