Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1885 |
Symbol | |
ID | 5104153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1828444 |
End bp | 1830384 |
Gene Length | 1941 bp |
Protein Length | 646 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640507771 |
Product | glutamate synthase (NADPH) GltB1 subunit / glutamate synthase (NADPH) GltB3 subunit |
Protein accession | YP_001191949 |
Protein GI | 146304633 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0067] Glutamate synthase domain 1 [COG0070] Glutamate synthase domain 3 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.273779 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCTCGC CATCGGGTTG TGGAGTTTTC GGGGTTCTGA GGAAACCACA CGCCCCTAAG ATACAGGGAG ATTCCGTGGT AAGATCCATA GACGCTGTAA GCTATAGGGG AAGCGATAAG GGGGCAGGGT TTGCTGTATT TAACCTGGAG GAGGGCAACT ACTACTACCT AAAGGCGTTC TTCCTAGATG ACCCCGGAAA GATGAGGATA GCGATGGAGA GCCAAGGTCT TCAGGTGATA GAGGAGAGCA TCGAGGCCGA GGATAGGGGG GTATGCAGTT GTAGCTATAA GGTTTCCATC GGGAACATAG CCCAGTTAAG GAAGGCAGTG AGGAACCTCA ACGAGATGCT ATGGCCAGAG AGGAAGGGTA GGATATACAG CATGGGTAAA TCCCTTCAGG TGTTCAAGGG AGTCGGCTAT CCCAAGGATA TAGCCAAGAT ATATGACGTA AACAAGTACG AGGGAGATAT GTGGTTGGCC CACACAAGAC AGCCAACCAA CTCCCCAGGT AGTTATCCCT ATTGGTCTCA CCCCTTCTCC TCCTTTGATG TGGCCATCGT TCACAACGGT GACGTTAGCT CCTTCGGTGC CAACCTGGAG TTCCTTCAAT CCAGGGGATG GGGAGGTTTC GTGGGGACGG ACAGTGAGGT CATGGCGTTC CTCTTCGAGG AGTTGATCAG CGAAGGTCTC ACTGTGGAAG AAGTTGCAAA GATCCTGGTT AATCCTTCCA GGAGGACCAG TGCCATATCC CCGCATCATG ATTACCTTTA TAGGAACGCG AGACTCGATG GACCCTTCAC TGCGGTGATT GGGTATGACT CTGGGGATGA CCTATATCTT GTGGGTTTGG CCGATAGATC CAAGTTCAGG CCTGTGTTGA TTGGCGAGGA CGATTACTAC TACTATATTG CGAGCGAGGA AAGTCAGATA AGACTCATGA GTCGCGAGGC GAGAGTTTGG ACCCTCAGCC CCGGATCTTA CTTCATTGCT TCCTTAAGGA AAGGGATACT TAGTCATGGG AGAGAGCTAG AGGAGATAAG GAATTTCTCT CCTCCTCCTA CCTTTGTTTC CCCAAATTAT GATATAGACG CTACTGCAAT TGGTTACAAA GACCTGGACA AGGAAATTCT TAGGACTGGG AAGAAGGAGG TCAAGGTTGT AAACGTCCTG GGACACAGGT TCATTGGTAT AAAGTTTCCC AGGGGAGGGC TTAAGGTCAG GTTATATGGG GTTGTGGGAA ACGCAATGGC TAACCTCAAC GAGAACAACG AATTCTACGT TTACGGTAAC GTTGCAGACG ACTGTTGTGA CACCATGCAT GGGGGAAAGG TGGTGATTAC CGGGGACGCA AGGGACGTTC TCGCGCAAAC TCTTCAGGGA GGGAAAGTTT TCGTTGGCGG AAATGCGGGC AACAGGGTCG GCATACAGAT GCGTGAATAC GCCAACAAGA GACCCTACCT GGTGATAGGT GGAAGGGTGG ATGACTATCT TGGGGAATAC ATGGCAGGAG GGGTGATCAT GGTTCTTGGA ATGAGGGAGA AGGGTGAAAA AACGGGCAAC TTCGTGGGAA CAGGAATGGT TGGGGGAAAG ATATACGTAC GTGGTAGGGT AGACCCTGGA AGGATAGGGA TGCAACCCAA TAGGCTAGAG GTCATGAGAC TTCTAAAGGC ACTCGCCATG GAGGGTTACG TGCACGACGT GGACTATAAC ATGTCATATA TTGACGTGAT GAAAAAATTG GAGGGCGAAG CTAAGAAGTA CGCCAAGAGG CTTTTCGAGG AAAAGGTGGG AATACCGACT TACGAGTACA GGGAGTTGAG TGACTCGGAG TTTAAGGAAG TTGAGCCTAT CATAAGGGAG TACGATCAAG ATCTAGGTAC AAGAGCTACT GAACTTTTGA GCGAGAAATT CACCGTTATT TACCCGTCCA AGGAGAAATA A
|
Protein sequence | MISPSGCGVF GVLRKPHAPK IQGDSVVRSI DAVSYRGSDK GAGFAVFNLE EGNYYYLKAF FLDDPGKMRI AMESQGLQVI EESIEAEDRG VCSCSYKVSI GNIAQLRKAV RNLNEMLWPE RKGRIYSMGK SLQVFKGVGY PKDIAKIYDV NKYEGDMWLA HTRQPTNSPG SYPYWSHPFS SFDVAIVHNG DVSSFGANLE FLQSRGWGGF VGTDSEVMAF LFEELISEGL TVEEVAKILV NPSRRTSAIS PHHDYLYRNA RLDGPFTAVI GYDSGDDLYL VGLADRSKFR PVLIGEDDYY YYIASEESQI RLMSREARVW TLSPGSYFIA SLRKGILSHG RELEEIRNFS PPPTFVSPNY DIDATAIGYK DLDKEILRTG KKEVKVVNVL GHRFIGIKFP RGGLKVRLYG VVGNAMANLN ENNEFYVYGN VADDCCDTMH GGKVVITGDA RDVLAQTLQG GKVFVGGNAG NRVGIQMREY ANKRPYLVIG GRVDDYLGEY MAGGVIMVLG MREKGEKTGN FVGTGMVGGK IYVRGRVDPG RIGMQPNRLE VMRLLKALAM EGYVHDVDYN MSYIDVMKKL EGEAKKYAKR LFEEKVGIPT YEYRELSDSE FKEVEPIIRE YDQDLGTRAT ELLSEKFTVI YPSKEK
|
| |