Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_5031 |
Symbol | |
ID | 4246686 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 7687850 |
End bp | 7690774 |
Gene Length | 2925 bp |
Protein Length | 974 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638109840 |
Product | glycine dehydrogenase |
Protein accession | YP_724416 |
Protein GI | 113478355 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0403] Glycine cleavage system protein P (pyridoxal-binding), N-terminal domain [COG1003] Glycine cleavage system protein P (pyridoxal-binding), C-terminal domain |
TIGRFAM ID | [TIGR00461] glycine dehydrogenase (decarboxylating) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTAGCAA CATATAAATC TAATATACAG TCAAGTTATC AAATACAACT GGCCAACCAA AACCAAGGTC AGAGGCCAAT AGACTTTTCC CAACGACATA TTGGTCTTAC CTCATCAGAA ATCCAACAAA TGCTGGAAGT ATTAGGTATT TCCTCCCTAG AAGACTTAAT TGACAAAACA GTTCCCGAAA AAATTCGATT CCAGAAACCA CTCAATTTGC CCAAGTCTCT GAGTGAAAAT GCGGCACTTG CTCAAATTAA AGAAATAATC TCTAAAAATC AGATATTTCG TTCCTTTATT GGAATGGGTT ATTATGACTG CATTACCCCA CCAGTTATCC TTCGCAATAT ACTAGAAAAC CCTGGTTGGT ACACAGCTTA TACTCCCTAT CAAGCAGAAA TAGCTCAAGG CCGGATGGAG GCGTTGCTGA ATTTTCAAAC CATGATTACA GACTTAACAG GTTTAGAAAT AGCTAATGCT TCACTACTTG ATGAAGCTAC AGCAGCAGCG GAAGCGATGA GCATGACTTA TGGTTTATGT AAAACTAAAG CCGAAGTTTT CTTTGTTGAC TCTGCCTGCC ATCCTCAGAA TATTGAAGTT GTCAAAACTA GGGCACAACC ATTGGGAATA GAAGTAATAG TCGGGGACTT CCGAACCTTC ACTTTTGACA AACCAATTTT TGGTGCACTC CTGCAATATC CCGCCACTAA TGGAGCAATT TATGACTATC GCGAATTTGT GGAAAAAGTT CACAAGGTAG GAGGTTTAGT AACTGTTGCT GCTGAATTAC TAAGTTTAAC CTTACTCACA CCCCCTGGAG AATTTGGTGC TGATATTGCT GTAGGTAATA CTCAACGTTT TGGCGTGTCT CTTGGATACG GAGGCCCCCA TGCTGCTTAC TTTGCTACTA AAGAAGCTTA TAAACGACAA ACTCCTGGAC GTATTGTTGG GGTTTCTCAA GATGCTAATG GTAACCCAGC CTTACGTTTA GCACTACAAA CCAGGGAACA GCATATTCGC CGGGAAAAGG CAACTAGCAA TATTTGTACT GCTCAAGTTT TATTGGCAGT AATAGCAGGT ATGTACGCAG TCTATCACGG TCCGGGTGGT TTAAAACAAA TTGCAGAAAA TATCCATAAT TTGACTTTTA AGTTGGCAAC AGGTTTAAAA CAGCTTGGTT ATCAAATTGG TGCAGAGTTA TTTTTTGATA CTATTGAGAT CAAATTGGGT GCTGACTCTC CTGTGAAAAG TGCAAAAGAA ATTATTGATG CGGCTGAAAA TTTAGGTATT AATCTCCGAA CTTTTGATGA ACAAACGGTT GGTATTTCTC TAGATGAAAC TACCACTGAA GTAGATGTAC AAAATCTGTG GCAAATTTTT GCTAGTGGAG AAAAGTTCCC AAACATCGAA AATGAAAATA TTTCTACCCT TTCTCAAAGT TATTATGCTC GCACTAGCAA TTATCTAACT CACCCAGTAT TTAAAAGTTA TCATTCTGAA ACCAATCTTT TGCGTTATAT TCATCGTTTG CAGTCTAAAG ATTTATCTTT GACAACATCA ATGATACCTT TGGGTTCCTG TACAATGAAA CTTAATGCTA CAGCAGAAAT GATACCTGTA ACTTGGCCTG AATTTGCCAA TATTCACCCT TTTTCTCCCA TTTCTCAAAC TCAGGGTTAT CAAATAATAT TTCAGCAATT AGAAGAATGG TTGGCAGAAA TCACAGGTTT TGCAGAAATT TCTCTACAAC CAAACGCAGG TTCACAAGGA GAATATACGG GTTTATTAGT AATTCGCGAA TATCATGCTC ACCGTGGGGA AGCACATCGT GATATTTGTT TGATCCCTGA ATCTGCCCAC GGTACAAACC CTGCTTCTGC AGTAATGAGT GGGTTAAAGG TTGTGGTTGT TAAATGTGAT GCCCAAGGCA ATATAGATAT TGCAGATTTA CAGACAAAGG CAGAAAAGCA TAAAGATAAT TTAGCTGCAA TAATGATTAC ATACCCCTCT ACTCACGGTG TCTTTGAGGA AGAAATTCTT GATATTTGTG AAATTATCCA TGCTCATGGT GGGCAAGTTT ATATGGATGG GGCAAATATG AATGCTCAAG TAGGATTATG TCGTCCCGCG GAAATAGGTG CTGATGTTTG TCATTTGAAT TTACATAAAA CCTTTTGTAT TCCTCATGGT GGCGGAGGTC CAGGAATGGG TCCGATAGGG GTTAAGTCTC ACTTGGCACC GTTTTTACCA GGGCATTCTG TTATTAATTT GGGAGGGGAA AATTCTAGTG GAGCTGTATC TGCTGCACCC TGGGGTAGCG CTAGTATTCT GCCTATTTCT TGGATGTATA TTGCGATGAT GGGGACGGAT GGTTTGACTG AAGCAACTAA GATAGCGATT TTGAATGCTA ATTATATTGC CCAACGTTTG GGAAGTTATT ATTCAGTTTT GTACAAGGGT AAGTATGGGT TTATTGCTCA CGAGTGCATT TTGGATTTGC GTCCTTTGAA AAAGTTGGCT GGTATTGAGG TGGAGGATAT TGCTAAACGT TTGATGGACT ATGGTTTTCA TGCGCCGACT GTCTCTTGGC CTGTGGCGGG TACAATTATG GTGGAACCGA CAGAGAGTGA GTCTAAGGAT GAGTTAGACC GTTTTTGTGA CGCGATGATT TCTATTCGTC AGGAAATAGA GGAGATCGAA ACTGGTAAGG CAGATAAAAA TGATAATTTG TTGAAAAATG CGCCTCATAC TGCTGAGAGT TTGATGGTGG ATGAGTGGAA GCATGGTTAT TCTCGACAAC GTGCTGCTTA TCCTGCGCCT TGGACGCGAG AGCATAAATT TTGGCCTGCT GTAGGACGGG TTGATAATGC TTTTGGGGAT CGCAATTTTG TTTGTTCTTG TTTGCCGATA GAGGCTTACA GTTAA
|
Protein sequence | MVATYKSNIQ SSYQIQLANQ NQGQRPIDFS QRHIGLTSSE IQQMLEVLGI SSLEDLIDKT VPEKIRFQKP LNLPKSLSEN AALAQIKEII SKNQIFRSFI GMGYYDCITP PVILRNILEN PGWYTAYTPY QAEIAQGRME ALLNFQTMIT DLTGLEIANA SLLDEATAAA EAMSMTYGLC KTKAEVFFVD SACHPQNIEV VKTRAQPLGI EVIVGDFRTF TFDKPIFGAL LQYPATNGAI YDYREFVEKV HKVGGLVTVA AELLSLTLLT PPGEFGADIA VGNTQRFGVS LGYGGPHAAY FATKEAYKRQ TPGRIVGVSQ DANGNPALRL ALQTREQHIR REKATSNICT AQVLLAVIAG MYAVYHGPGG LKQIAENIHN LTFKLATGLK QLGYQIGAEL FFDTIEIKLG ADSPVKSAKE IIDAAENLGI NLRTFDEQTV GISLDETTTE VDVQNLWQIF ASGEKFPNIE NENISTLSQS YYARTSNYLT HPVFKSYHSE TNLLRYIHRL QSKDLSLTTS MIPLGSCTMK LNATAEMIPV TWPEFANIHP FSPISQTQGY QIIFQQLEEW LAEITGFAEI SLQPNAGSQG EYTGLLVIRE YHAHRGEAHR DICLIPESAH GTNPASAVMS GLKVVVVKCD AQGNIDIADL QTKAEKHKDN LAAIMITYPS THGVFEEEIL DICEIIHAHG GQVYMDGANM NAQVGLCRPA EIGADVCHLN LHKTFCIPHG GGGPGMGPIG VKSHLAPFLP GHSVINLGGE NSSGAVSAAP WGSASILPIS WMYIAMMGTD GLTEATKIAI LNANYIAQRL GSYYSVLYKG KYGFIAHECI LDLRPLKKLA GIEVEDIAKR LMDYGFHAPT VSWPVAGTIM VEPTESESKD ELDRFCDAMI SIRQEIEEIE TGKADKNDNL LKNAPHTAES LMVDEWKHGY SRQRAAYPAP WTREHKFWPA VGRVDNAFGD RNFVCSCLPI EAYS
|
| |