Gene Msed_0547 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0547 
Symbol 
ID5103707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp503238 
End bp504647 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content49% 
IMG OID640506451 
ProductexsB protein 
Protein accessionYP_001190646 
Protein GI146303330 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG0449] Glucosamine 6-phosphate synthetase, contains amidotransferase and phosphosugar isomerase domains
[COG0603] Predicted PP-loop superfamily ATPase 
TIGRFAM ID[TIGR00364] exsB protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGTAGCG TAACAGGAGT TCTTATCATG GACCCCAGCA AATACGCTGA AGTTAATCAG 
AAGTTAAAGT CGATCCTGAT CAGGGCTGAG GATAGGGGTA GAGACAGTTT TGGAGTTATT
GCCGTTGAGG AGGACGGACA TGTTAGGTCT GTCAAGTCGC TTGGGAGACC CTCACTTAAC
CAGGAGAAAC TGGATGGAAT AATCACCGAG AAAACTAGGG TTCTGGTGGC AAATAACAGG
GCAGAGCCAA CTACCGAGTT CGTTAGGTTC AAGATGGAGA GGGATATCCA ACCTTTCCTT
GGGGATAGAT TCATCGTGTC CCACAACGGC ATCATTGCAA ACGACAAGGA GATTGAGAAA
AAGTATGAAA TAAAGAGACT CACCACGATT GACAGTGCCA TCCTCCCTCC TCTCCTTGAT
AAGAAGTGGG ATGGTAAACT TGAAACCCTT AGGGACATCC TGAAGGAATT GAGGGGGAGC
TACGCTCTGG TCATCGCTGA CAGGGAAAGG CCTGACAGAA TATTCCTGGG ACAGAACTTT
AAACCCCTGT ACATGGCCTA TGACGTAGAT TTGAACGCTG TGTTCTTCAC CTCGCTTGAT
GACTATTTTG ACGTTAAGCC CTTCGACCAC GTTAACGTTA GGAAGCTAGA ACCATACTCG
GTGGTGGAGG TAACACTGAA CAAGGAATTC AGGACACTGT CCCTTTATGC ACAACCTAGG
AGGAGGGCCC TCGTTATAGC CAGCGGTGGT CTAGACTCTA CCGTCGCTGC AACCAAGATG
ATCAGGGAAG GTTATCAGGT CACACTCCTT CATTTCAACT ACCATCACAA GGCAGAGGAA
AAGGAAAGGG AAGCAGTCAG GAAGATAGCC TCATACCTCA ACGCGGATCT AATGGAGATA
AACACGGATC TATTTTCCTT GATCGGGAAT GCCACCCTAC TCAAGGGAGG AGGGGAGATA
GTCAAGGACA GGAAAGGAGA AGAGGGGGCT GAGTTCGCCC ACGAGTGGGT TCCAGCAAGA
AACGCGATCT TCTTCACAGT TGCCATGGCC ATAGCTGAGG CCAAGGGTTA CGACGCCATA
GTTTCAGGGA TAAACCTTGA GGAGGCTGGA GCTTACCCAG ATAACGAGAT GGAGTTCGTG
AGGATGTTTC AGAGGCTTTC TCCATATGCC GTAGGTCCAG AGAAACGGGT CGACGTATTA
ATGCCCGTGG GGAATCTGGT GAAGCACGAA ATAGTCAAGT TAGGGCTGGA GATAGGTGCA
CCACTTCATC TAACCTGGAG TTGTTACGAG GGAGGAGAGA AACATTGCGG AAAGTGCGGT
CCATGTTACA TGAGAAGGGT AGCATTCGAG ATAAATGGAG TTAAGGATCC TGTAGAGTAC
GAGTCACTTG ATGATCAGAG CAGGCACTAG
 
Protein sequence
MCSVTGVLIM DPSKYAEVNQ KLKSILIRAE DRGRDSFGVI AVEEDGHVRS VKSLGRPSLN 
QEKLDGIITE KTRVLVANNR AEPTTEFVRF KMERDIQPFL GDRFIVSHNG IIANDKEIEK
KYEIKRLTTI DSAILPPLLD KKWDGKLETL RDILKELRGS YALVIADRER PDRIFLGQNF
KPLYMAYDVD LNAVFFTSLD DYFDVKPFDH VNVRKLEPYS VVEVTLNKEF RTLSLYAQPR
RRALVIASGG LDSTVAATKM IREGYQVTLL HFNYHHKAEE KEREAVRKIA SYLNADLMEI
NTDLFSLIGN ATLLKGGGEI VKDRKGEEGA EFAHEWVPAR NAIFFTVAMA IAEAKGYDAI
VSGINLEEAG AYPDNEMEFV RMFQRLSPYA VGPEKRVDVL MPVGNLVKHE IVKLGLEIGA
PLHLTWSCYE GGEKHCGKCG PCYMRRVAFE INGVKDPVEY ESLDDQSRH