Gene Msed_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1603 
Symbol 
ID5103967 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1550102 
End bp1551889 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content46% 
IMG OID640507493 
Productglucosamine--fructose-6-phosphate aminotransferase 
Protein accessionYP_001191682 
Protein GI146304366 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0449] Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains 
TIGRFAM ID[TIGR01135] glucosamine--fructose-6-phosphate aminotransferase (isomerizing) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0350517 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.152965 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGGAA TTATTGGAGT AGCATCTATC GGAGCCAAGG ATGCTGCCTT AGCCAATATA 
ACTTTGGACG CATTAAAGAG CTTAGAGTAT AGGGGATACG ACAGTGTAGG AATGGCCTCG
ATGGATAGCT CTCACCTAGA GGTGAGAAAG GCTGCAGGTA ACGTGGAGAA GTTCGAGAGG
TTAAAGAATC CTTTGAACAT GAAGGGAAAC ATTTTCCTGG GCCACACCAG GTGGGCTACT
CACGGTGAGC CAAATGATAT TAATGCTCAT CCCCATCTAG ACTGCACAGG GGAAATAGCG
GTTATACATA ACGGAACTGT GCTCAACTTT CTAGAACTTA AACAAGACTT GATTGCCAAG
GGGCATAGGT TCGTAAGTGA TACAGATACC GAGGTGATTG CCCATCTCAT TGAGCATTAC
AGGAAGCTTG GAATGGACAA CTTCACTGCG TTTAAACAGG CCATATTCAG CATACAGGGA
GACCACGCAG TTCTAGCCAT AATAAAGGGA GATAATAGGA TATTTTTCTC GAAGAAGAAC
AACCCTCTGG TAATAGGATT AGGAGACGAC ATGAACCTTA TATCCAGCGA TGTTTGGAGC
CTGGTTAAGG TTACTAACAG GACCATAACC ATAGGTGACG ACGAACTTGG ATACATAACC
GCTACCTCGG TTTACGCAGA AAAGATTACT GGGGAAAAGG TCAACCTGAC CTCTAGGCTC
ATTATTCAGC AAATAGACAG TTCCGCAACT TCCCTTCAGG GATACGAATC CTTCATGATG
AAAGAAATCA GGGAAAGCTG GGGAGCGGTT AGGGATACCA TAGTTGGACT CATGAATGAC
ATGGAGAAGC TTGGTAAGGC AGTGAAGGCT ATGGACAAGG CACGCAGGAT ACTTGTGGTG
GCAGCTGGGA CAAGTTATCA TGCAGGGTTA ATTTTCGCCT CTAGGTTGAT GAGGACTGGG
AAGACCGTTA TCCCAGTTAT AGCCTCTGAG TATGAAAACG TTAAGGCTGG CGGGGAAGAC
GTTGTTTTAG TCATTAGTCA GAGCGGGGAA ACCATGGACT CACTTCTAGC GATGAAAAGC
TTCAAGAACA GTGGGTCTTT CGTTGTATCC CTAACCAACA CCCTTGGAAA TAGTATCTCA
TACTACAGTG ATATCGCCCT CCATACCAGG GCAGGACCTG AGATAGGTGT TGCTGCAACT
AAGACCTTTA CCTCTCAGGT TGGGGCTTTA CTCCTTATTT CCTCCTTAAT GATTGGAGAA
AACCTGGATT ACCTTAAGGA AGCTGAGAGT ACAGTATCAA GTAGCTTCTC TAAGTCCATT
GGTTATGCAG AAAAAATAGG GGTGGATGTT TCAAAGAAAC AAAGCTTATA CTACCTTGGA
AAGGGACTTG GAGTGCCCAT GGCCATGGAG GGAGCGCTGA AAATTAAGGA GATAGCGTAC
ATTCACGCAG AGGCTTATCC TGCAGGAGAG AGCAAACATG GACCCATTGC GTTGGTAGAG
AAGGACTTTC CCGTGATCTT TGTGAATACT GGCGAGCACG TAGATGAACT AAGAAACAAT
TTGAAAGAGA TGCAGAGCAG AAAGGCCAGG GTATACGTGG TCAGTGCAGG CTCTGAGCTA
AGAAGCGACC AGGAGGTTAC TGAGATCATG ATCGACATTA ACGACGCTAG ACTGGCACCT
CTGGCTTTGG CACCTCCATT GCAACTAATT GCGTATTATG CAGCGAAGGA AAGGGGTCTG
AATCCCGATA GACCCAGAAA CTTGGCTAAG ACGGTGACTG TTAGATGA
 
Protein sequence
MCGIIGVASI GAKDAALANI TLDALKSLEY RGYDSVGMAS MDSSHLEVRK AAGNVEKFER 
LKNPLNMKGN IFLGHTRWAT HGEPNDINAH PHLDCTGEIA VIHNGTVLNF LELKQDLIAK
GHRFVSDTDT EVIAHLIEHY RKLGMDNFTA FKQAIFSIQG DHAVLAIIKG DNRIFFSKKN
NPLVIGLGDD MNLISSDVWS LVKVTNRTIT IGDDELGYIT ATSVYAEKIT GEKVNLTSRL
IIQQIDSSAT SLQGYESFMM KEIRESWGAV RDTIVGLMND MEKLGKAVKA MDKARRILVV
AAGTSYHAGL IFASRLMRTG KTVIPVIASE YENVKAGGED VVLVISQSGE TMDSLLAMKS
FKNSGSFVVS LTNTLGNSIS YYSDIALHTR AGPEIGVAAT KTFTSQVGAL LLISSLMIGE
NLDYLKEAES TVSSSFSKSI GYAEKIGVDV SKKQSLYYLG KGLGVPMAME GALKIKEIAY
IHAEAYPAGE SKHGPIALVE KDFPVIFVNT GEHVDELRNN LKEMQSRKAR VYVVSAGSEL
RSDQEVTEIM IDINDARLAP LALAPPLQLI AYYAAKERGL NPDRPRNLAK TVTVR