Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_11521 |
Symbol | guaB |
ID | 4719045 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | - |
Start bp | 1002210 |
End bp | 1003511 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640080833 |
Product | inosine 5-monophosphate dehydrogenase |
Protein accession | YP_001011466 |
Protein GI | 123966385 |
COG category | [C] Energy production and conversion |
COG ID | [COG1304] L-lactate dehydrogenase (FMN-dependent) and related alpha-hydroxy acid dehydrogenases |
TIGRFAM ID | [TIGR01304] IMP dehydrogenase family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAA TAATGCTAAT AATTAATTTT CAAATACTCA TATTTCTACA AAATTATAAA AAAACTCTTT TGTTTTTAAG AAATTATTGG CTTATTACAC CTTTAGTTGT TAACTTAATC AGAATAATTA AAAAAACCGT GAATATTGAA ATTGGCTTAA ACAAAAAAGT TAGAAGGGCT TACGGCATTG ATGAAATTGC ATTAGTGCCA GGAACAAGAA CCCTAGATTA CGAATTAACA AATCCTTCTT GGTCAATTGG AAATATTGAA AGAGATATTC CAATTATCGC CAGTGCAATG GATAGTGTTG TTGATGTGAA TACAGCTGTA GATCTCTCTA AATTAGGAGC TATTGGTGTT CTCAACATGG AAGGTATACA AACTAGGTAT GAAAATCCAA AGGAAATACT AAGTCAAATC TCCTCCGTAG GAAAAAATGA ATTCGTACCT TTGATGCAAG AAATATATAA AGAACCCATC AAGCAAGAAC TTATTTTGCA AAGAATTAAC GAAATTAAAG AAAAAGATGG AATAGCAGCA TTAAGTGGAA CACCGCAAGC AGCTATCAAA TTTAAAGAAA CACTTGTGAA GTCGAAAATA GATTTATTCT TTCTTCAAGG TACTGTAGTT TCAACCGAAC ATTTAGGTAT GGAGGGGAAT GAGACATTAA ATATCAAAAG CTTATGTCAA TCTTTAAAAG TACCAGTTGT TGCAGGTAAT TGTGTAACTT ATGAAGTTGC AGAACTTCTT ATGAAATCAG GTGTTGCAGG TCTTATGGTG GGAATCGGCC CAGGAGCAGC TTGCACTTCG AGAGGAGTAT TGGGAATAGG AATCCCCCAA GCAACAGCAA TCTCTGATTG TAGTTCAGCA AGAGATGATT ATTTTCAAGA AACTGGTCGT TATGTCCCCA TAATTGCTGA TGGAGGAATT GTTACTGGTG GTGACATTTG CAAATGCATC GCCTGTGGTG CTGACGCAGT TATGATTGGT TCTCCAATAG CTAAATCAAC AAGTGCTCCG GGCAATGGAT TTCATTGGGG TATGGCCACA CCAAGTCCTA TATTACCTAG AGGTACAAGA ATTGAAGTCG GCTCTACAGG TTCCTTAGAG AGAATATTAA AAGGACCCGC AATACTTGAT GATGGGACAC ACAATTTACT TGGAGCTATT AGGACATCAA TGAGTACTCT TGGAGCTAAA AATATCAAAG AGATGCAAAA TGTTGATATT GTAATTGCGC CATCTCTTTT AACAGAGGGA AAAGTATATC AAAAAGCTCA ACAGCTTGGA ATGGGTAAAT AA
|
Protein sequence | MNKIMLIINF QILIFLQNYK KTLLFLRNYW LITPLVVNLI RIIKKTVNIE IGLNKKVRRA YGIDEIALVP GTRTLDYELT NPSWSIGNIE RDIPIIASAM DSVVDVNTAV DLSKLGAIGV LNMEGIQTRY ENPKEILSQI SSVGKNEFVP LMQEIYKEPI KQELILQRIN EIKEKDGIAA LSGTPQAAIK FKETLVKSKI DLFFLQGTVV STEHLGMEGN ETLNIKSLCQ SLKVPVVAGN CVTYEVAELL MKSGVAGLMV GIGPGAACTS RGVLGIGIPQ ATAISDCSSA RDDYFQETGR YVPIIADGGI VTGGDICKCI ACGADAVMIG SPIAKSTSAP GNGFHWGMAT PSPILPRGTR IEVGSTGSLE RILKGPAILD DGTHNLLGAI RTSMSTLGAK NIKEMQNVDI VIAPSLLTEG KVYQKAQQLG MGK
|
| |