Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0214 |
Symbol | |
ID | 4241810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 325107 |
End bp | 326174 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638105559 |
Product | GHMP kinase |
Protein accession | YP_720176 |
Protein GI | 113474115 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0153] Galactokinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0700927 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.111663 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTTT TTGTACCAGG TCGCCTTTGT TTATTTGGAG AACATAGTGA TTGGGCAGGA GGGTATCGTT CTATGAACCC CCAAATAGAT AAAGGCTATA CAATTGTAGT AGGAGTTGAT CAAGGAATTT ATGCAGATGT CAAACCCCAT CCTACTCATT TAATTATCAA GACAACTTTG AATAATGAAA GTCTGATACA GTCGTTTAAT TTGCATAAGC ATAAAGAAAT ATTATTAGCA GAAGCTCGTC GGGGAGGAGT TTTTAGTTAT GTAGCTGGGG TTGTATATCA AGTTATTAAT AATTATTCAG TAGGTGGTTT AGAAATAGAT AATTATTTCA CAGATTTACC AATTAAAAAG GGTCTATCAT CAAGCGCGGC TATTTGTGTA TTGGTGGCTA GAGCTTTTAA CCTATTGTAT GATTTAAAAC TAAATATTCG TTCGGAAATG GAGTTAGCTT ATCAGGGGGA AGTTACTACT CCTTCTCGTT GTGGCAAGAT GGACCAAGCT TGTGCTTATG GTAAGCAAGC AATTATGATG ATATTTGACG GGGAAAAAAC GGATATTATT GAACTTAATC CACCAAAAAA AGATTTATTC TTTGTTATTG TTGATCTAGG CGCGAGCAAA GATACTCAAG AGATATTGAC TAGATTAAAT AAATGTTATA TAGAAGAAGC TAATAAAGTA AATGAGAATG TACAATATTA TCTTGGAGTT ATTAATGCTG ATATTACTAA ACAAGCAGCA TTAGCCTGGC AAAAAGGGGA TGGAGAAAAA ATTGGTAGTT TGATGCTCAA AGCACAAATT GAATTTGATA AATATATGAT ACCAGCTTGT CCTTCACAAT TAACATCTCC AGTACTCCAT TTATTACTAA ATTATTCACG TCTCCAAGAA TATATTTGGG GTGGTAAAGG AGTTGGTTCT CAAGGTGATG GAACAGCTCA ATTCATTGCT AAAGATGAGA ATAGTCAACA GAAGTTAATT GAAATAATTA ACCTAGATTT TCCTAAAATG CAATGTTTTA AATTAGTAGT TAGGGCTGAA TATAGCAATT CTAAATGA
|
Protein sequence | MKLFVPGRLC LFGEHSDWAG GYRSMNPQID KGYTIVVGVD QGIYADVKPH PTHLIIKTTL NNESLIQSFN LHKHKEILLA EARRGGVFSY VAGVVYQVIN NYSVGGLEID NYFTDLPIKK GLSSSAAICV LVARAFNLLY DLKLNIRSEM ELAYQGEVTT PSRCGKMDQA CAYGKQAIMM IFDGEKTDII ELNPPKKDLF FVIVDLGASK DTQEILTRLN KCYIEEANKV NENVQYYLGV INADITKQAA LAWQKGDGEK IGSLMLKAQI EFDKYMIPAC PSQLTSPVLH LLLNYSRLQE YIWGGKGVGS QGDGTAQFIA KDENSQQKLI EIINLDFPKM QCFKLVVRAE YSNSK
|
| |