Gene Msed_1885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_1885 
Symbol 
ID5104153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp1828444 
End bp1830384 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content50% 
IMG OID640507771 
Productglutamate synthase (NADPH) GltB1 subunit / glutamate synthase (NADPH) GltB3 subunit 
Protein accessionYP_001191949 
Protein GI146304633 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0067] Glutamate synthase domain 1
[COG0070] Glutamate synthase domain 3 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.273779 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCTCGC CATCGGGTTG TGGAGTTTTC GGGGTTCTGA GGAAACCACA CGCCCCTAAG 
ATACAGGGAG ATTCCGTGGT AAGATCCATA GACGCTGTAA GCTATAGGGG AAGCGATAAG
GGGGCAGGGT TTGCTGTATT TAACCTGGAG GAGGGCAACT ACTACTACCT AAAGGCGTTC
TTCCTAGATG ACCCCGGAAA GATGAGGATA GCGATGGAGA GCCAAGGTCT TCAGGTGATA
GAGGAGAGCA TCGAGGCCGA GGATAGGGGG GTATGCAGTT GTAGCTATAA GGTTTCCATC
GGGAACATAG CCCAGTTAAG GAAGGCAGTG AGGAACCTCA ACGAGATGCT ATGGCCAGAG
AGGAAGGGTA GGATATACAG CATGGGTAAA TCCCTTCAGG TGTTCAAGGG AGTCGGCTAT
CCCAAGGATA TAGCCAAGAT ATATGACGTA AACAAGTACG AGGGAGATAT GTGGTTGGCC
CACACAAGAC AGCCAACCAA CTCCCCAGGT AGTTATCCCT ATTGGTCTCA CCCCTTCTCC
TCCTTTGATG TGGCCATCGT TCACAACGGT GACGTTAGCT CCTTCGGTGC CAACCTGGAG
TTCCTTCAAT CCAGGGGATG GGGAGGTTTC GTGGGGACGG ACAGTGAGGT CATGGCGTTC
CTCTTCGAGG AGTTGATCAG CGAAGGTCTC ACTGTGGAAG AAGTTGCAAA GATCCTGGTT
AATCCTTCCA GGAGGACCAG TGCCATATCC CCGCATCATG ATTACCTTTA TAGGAACGCG
AGACTCGATG GACCCTTCAC TGCGGTGATT GGGTATGACT CTGGGGATGA CCTATATCTT
GTGGGTTTGG CCGATAGATC CAAGTTCAGG CCTGTGTTGA TTGGCGAGGA CGATTACTAC
TACTATATTG CGAGCGAGGA AAGTCAGATA AGACTCATGA GTCGCGAGGC GAGAGTTTGG
ACCCTCAGCC CCGGATCTTA CTTCATTGCT TCCTTAAGGA AAGGGATACT TAGTCATGGG
AGAGAGCTAG AGGAGATAAG GAATTTCTCT CCTCCTCCTA CCTTTGTTTC CCCAAATTAT
GATATAGACG CTACTGCAAT TGGTTACAAA GACCTGGACA AGGAAATTCT TAGGACTGGG
AAGAAGGAGG TCAAGGTTGT AAACGTCCTG GGACACAGGT TCATTGGTAT AAAGTTTCCC
AGGGGAGGGC TTAAGGTCAG GTTATATGGG GTTGTGGGAA ACGCAATGGC TAACCTCAAC
GAGAACAACG AATTCTACGT TTACGGTAAC GTTGCAGACG ACTGTTGTGA CACCATGCAT
GGGGGAAAGG TGGTGATTAC CGGGGACGCA AGGGACGTTC TCGCGCAAAC TCTTCAGGGA
GGGAAAGTTT TCGTTGGCGG AAATGCGGGC AACAGGGTCG GCATACAGAT GCGTGAATAC
GCCAACAAGA GACCCTACCT GGTGATAGGT GGAAGGGTGG ATGACTATCT TGGGGAATAC
ATGGCAGGAG GGGTGATCAT GGTTCTTGGA ATGAGGGAGA AGGGTGAAAA AACGGGCAAC
TTCGTGGGAA CAGGAATGGT TGGGGGAAAG ATATACGTAC GTGGTAGGGT AGACCCTGGA
AGGATAGGGA TGCAACCCAA TAGGCTAGAG GTCATGAGAC TTCTAAAGGC ACTCGCCATG
GAGGGTTACG TGCACGACGT GGACTATAAC ATGTCATATA TTGACGTGAT GAAAAAATTG
GAGGGCGAAG CTAAGAAGTA CGCCAAGAGG CTTTTCGAGG AAAAGGTGGG AATACCGACT
TACGAGTACA GGGAGTTGAG TGACTCGGAG TTTAAGGAAG TTGAGCCTAT CATAAGGGAG
TACGATCAAG ATCTAGGTAC AAGAGCTACT GAACTTTTGA GCGAGAAATT CACCGTTATT
TACCCGTCCA AGGAGAAATA A
 
Protein sequence
MISPSGCGVF GVLRKPHAPK IQGDSVVRSI DAVSYRGSDK GAGFAVFNLE EGNYYYLKAF 
FLDDPGKMRI AMESQGLQVI EESIEAEDRG VCSCSYKVSI GNIAQLRKAV RNLNEMLWPE
RKGRIYSMGK SLQVFKGVGY PKDIAKIYDV NKYEGDMWLA HTRQPTNSPG SYPYWSHPFS
SFDVAIVHNG DVSSFGANLE FLQSRGWGGF VGTDSEVMAF LFEELISEGL TVEEVAKILV
NPSRRTSAIS PHHDYLYRNA RLDGPFTAVI GYDSGDDLYL VGLADRSKFR PVLIGEDDYY
YYIASEESQI RLMSREARVW TLSPGSYFIA SLRKGILSHG RELEEIRNFS PPPTFVSPNY
DIDATAIGYK DLDKEILRTG KKEVKVVNVL GHRFIGIKFP RGGLKVRLYG VVGNAMANLN
ENNEFYVYGN VADDCCDTMH GGKVVITGDA RDVLAQTLQG GKVFVGGNAG NRVGIQMREY
ANKRPYLVIG GRVDDYLGEY MAGGVIMVLG MREKGEKTGN FVGTGMVGGK IYVRGRVDPG
RIGMQPNRLE VMRLLKALAM EGYVHDVDYN MSYIDVMKKL EGEAKKYAKR LFEEKVGIPT
YEYRELSDSE FKEVEPIIRE YDQDLGTRAT ELLSEKFTVI YPSKEK