Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_1665 |
Symbol | |
ID | 5104870 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | + |
Start bp | 1607505 |
End bp | 1608995 |
Gene Length | 1491 bp |
Protein Length | 496 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640507559 |
Product | glycine dehydrogenase subunit 2 |
Protein accession | YP_001191744 |
Protein GI | 146304428 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00835735 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGGAGGC AGGCGTATTG GGATGAACCT CTAATCACGG AGTATAAGGG GAAGGGAAGA CAAGGCTTCC TCGTACCTAA GGAAGACCTA GACGTGGAAA TTAAGTTACC TGAAAAGATC AAGAGGGGAA AGGAGCCTGA ACTTCCCGAG GTTTCGGAGC TTGAGGTCGT TAGGCACTTC GTGAGACTAT CCCAGATGAG CTTTGGCGTT GACACTGGAA TGATGCCTTT GGGATCATGC ACCATGAAGT ACAATCCAAA GATAGAGGAA GAGACAGGGG TTGCAGATAG GACTCACCCA TTACAGGATC AGGACACTGT TCAGGGGAAC CTAGAGGTAA TGTACGAAAT GCAAAGGTGG CTTGCTGAGG CAACGGGAAT GGACGAATGT AGCTTACAGG TTCCGGCGGG ATCAGCTGGC GAACTGGCTG GCGTGCTCAT GATCAGGAAA TACCACAGGG ATCAGAATAG GAGGAGGGAG GAGATGCTTG TTGCTGACTC AGCCCACGGA ACAAATCCGG CAAGCGCAGC AATGGCTGGC TTCTCAGTGA TCTACATCAA GTCTAACCAG GAGGGCTTGG TTGACCTCAA CGTGCTCAAA GGGACAATAT CAGATAACGT TGCAGGGTTC ATGTTAACTA ATCCTAATAC CTTGGGACTC TTTGAGGAAA ACATCAAGGA GATAGCTGAG CTGGTTCACT CGGTAGACGG AGTCCTCTAC TATGATGGCG CTAACCTAAA CGGAATCCTG GGAATAGTGA GACCAGGGGA CATGGGATTT GATATAGTTC ATCTCAATCT TCACAAGACC TTCGGAGTCC CCCATGGGGG TGGAGGTCCA GGGGCTGGAG CCGTGTGTGC CAAGGGTAAG ATGACTAAGT ATCTCCCGTA CCCCATAGTG TCCAAGGGAG AGAGGTACTA TCTTGTTAAG CCTGAGAGGT CCATAGGGAA GATCTCGGTG TTTAACGGAA ACTTTGGTAA CCTGATGAGG TCCTATGCCT ACATTCTTGG CCTTGGCGGA AAGGGAGTGT CCATGATTGG AAGAATGAGT ACATTGGCCA CAAACTACCT GATAGCGAAA CTTAGAGGAG TGAGGGGACT GGAGTTGATG GCTCCTCACC GGTTCAGAAA ACATGAGGTA GTATTCAGCG CAAAGAAACT GGCAGAGGAA ACTGGGGTGA CTGCGTTTGA TATAGCCAAG GCTTTACTCG ACAGGGGCTT CTATGCGCCC ACCATATACT TCCCGCCCAA TGTGGAGGAG GCTCTGATGA TCGAGCCCAC AGAAACTGAG CCCATAGAAG TCCTGGATCA GTACGCCAAC GCAATCAAGG ATATTGTGGA GAAAGCATAT TCCAACCCTT CCTCCATTAC TTCGGCTCCC CAAAACACGT CAGTGGGTAG ACTTGATCAG GTTAAGGCAA ATCATCCGAG CACTATGACC CCAACCTATA GGGTTCTCAA GTCTAGGTTA GCGAGCCAAG GAAGAAAGTA G
|
Protein sequence | MWRQAYWDEP LITEYKGKGR QGFLVPKEDL DVEIKLPEKI KRGKEPELPE VSELEVVRHF VRLSQMSFGV DTGMMPLGSC TMKYNPKIEE ETGVADRTHP LQDQDTVQGN LEVMYEMQRW LAEATGMDEC SLQVPAGSAG ELAGVLMIRK YHRDQNRRRE EMLVADSAHG TNPASAAMAG FSVIYIKSNQ EGLVDLNVLK GTISDNVAGF MLTNPNTLGL FEENIKEIAE LVHSVDGVLY YDGANLNGIL GIVRPGDMGF DIVHLNLHKT FGVPHGGGGP GAGAVCAKGK MTKYLPYPIV SKGERYYLVK PERSIGKISV FNGNFGNLMR SYAYILGLGG KGVSMIGRMS TLATNYLIAK LRGVRGLELM APHRFRKHEV VFSAKKLAEE TGVTAFDIAK ALLDRGFYAP TIYFPPNVEE ALMIEPTETE PIEVLDQYAN AIKDIVEKAY SNPSSITSAP QNTSVGRLDQ VKANHPSTMT PTYRVLKSRL ASQGRK
|
| |