Gene Msed_0216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0216 
Symbol 
ID5104082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp177695 
End bp178948 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content50% 
IMG OID640506121 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_001190317 
Protein GI146303001 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000686637 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0344059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCAGTG ATTTGTGGAA TAAGGCTCAA GCTGTCTTTG CAGGTGGAGT AAATAGCCCA 
GTAAGGGCTG CGGTAAAGCC ATTTCCCTTC TATACTAGAT CTGCTAAGGG CGCATACCTA
GTTACAGAGG ATTCAAGGAG ACTCATTGAT TTCGTTTTGG GATATGGTCC CCTGATTCTT
GGTCACGCCC ACCCCAAGGT AATCGAGGCA GTGAAGGACC AGTTAGAGAG GGGATGGTTA
TACGGAACCC CCAGCAGGGC GGAGGTTGAG CTTGCCTCCA TGATTACCAA GCATGTACCA
TCTGCCCAGA AGGTCAGATT CGTGAATAGT GGCACAGAAG CTACCATGAC CGCACTCAGG
TTAAGCAGGG GATTCACTGG TAGGAGTAAG ATACTCAAGT TCGACGGTAA TTATCACGGG
GCCCACGATT ACGTTCTCAT CGACGCGGGA AGCGCAGCTT CGGAGTTCGG AGTTCCCTTC
TCTCAGGGAA TACCCACGGA GGTCTCATCC ACCGTGTTGG TTTGCCCCTA CAATGACTTG
ACCTGCGTGG AGACGGTATT GAAGAGGGAG GAGATAGCTG GGGTAATAGT GGAGCCTGTG
ATGGGAAACA TGGGAGTTAT TCCACCTGAG AAGGATTTCT TACCAGGTTT GAGGAAGCTC
ACCAGAGAAT ACGGGTCAGT GCTAATTTTC GACGAAGTGA TTACGGGCTT CAGGTTAGGA
TTAGGTGGCG CGCAATCCTA TTTTGGGGTG ACGCCCGACC TGACAACCTT GGGTAAGATA
ATAGGCGGAG GTTTCCCCAT AGGTGCAGTT TGCGGTAGGA AAGAGATAAT GGACCAGCTC
ACCCCCTCTG GAAGAGTGTT CAACGCTGGA ACCTTTAATG CAAACCCAAT TTCCATGACA
GCCGGGATAG CCACAATACA AGAACTTGAG AAAAAGGAGG TGTACACGGT GAGCGAGAGG
GCTGCAAGGG TCTTGGCTGA GGAGATAGAT CGGGCCATAA AGATGGACCA CGTCGTGAAC
AGGGTCTACA ACTTTTTCCA GTTCTTCCTT GGGGTGAAGG ATGTCAGAAA CGCTAATGAT
GCTCGTAAGG CAAAGAGGGA TCTTTACGTT AAAGTTCACG AGAGTTTACT GAAGGAGGGG
GTATTCATCC CTCCAAGTCA GTTTGAGGCC CTCTTTACCT CAGGAGCTCA TGATGACGAC
GTGGTGAATG AAAGTGCTAG GGTCTTCAAG AAGGTACTGG AGGATCTAAC GTGA
 
Protein sequence
MSSDLWNKAQ AVFAGGVNSP VRAAVKPFPF YTRSAKGAYL VTEDSRRLID FVLGYGPLIL 
GHAHPKVIEA VKDQLERGWL YGTPSRAEVE LASMITKHVP SAQKVRFVNS GTEATMTALR
LSRGFTGRSK ILKFDGNYHG AHDYVLIDAG SAASEFGVPF SQGIPTEVSS TVLVCPYNDL
TCVETVLKRE EIAGVIVEPV MGNMGVIPPE KDFLPGLRKL TREYGSVLIF DEVITGFRLG
LGGAQSYFGV TPDLTTLGKI IGGGFPIGAV CGRKEIMDQL TPSGRVFNAG TFNANPISMT
AGIATIQELE KKEVYTVSER AARVLAEEID RAIKMDHVVN RVYNFFQFFL GVKDVRNAND
ARKAKRDLYV KVHESLLKEG VFIPPSQFEA LFTSGAHDDD VVNESARVFK KVLEDLT