Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | NATL1_12741 |
Symbol | |
ID | 4780598 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. NATL1A |
Kingdom | Bacteria |
Replicon accession | NC_008819 |
Strand | + |
Start bp | 1089326 |
End bp | 1090126 |
Gene Length | 801 bp |
Protein Length | 266 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640084553 |
Product | putative imidazoleglycerol-phosphate dehydratase |
Protein accession | YP_001015097 |
Protein GI | 124025981 |
COG category | [R] General function prediction only |
COG ID | [COG0546] Predicted phosphatases |
TIGRFAM ID | [TIGR01548] haloacid dehalogenase superfamily, subfamily IA hydrolase, TIGR01548 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.281535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAATA ATATTGGCCT ACTTTTATTT GATATAGATG GGGTTATTCG TGACGTGACC AACAGCTATC GATTAGCTAT CCAAGAAACT GTAAACTTTT TTAGTGGGTG GAGACCCTCA ATAGAAGATA TTGATTCTAT TAAAAGTGAA GGCTGTTGGA ACAACGATTG GGATTTGAGC CTAGAAATGA TTAATAGACA TGTACAAAAA AACAATCTTT CTTTTTCAGC TCCGTCTAGA AAAAATTTAA TTGAATGCTT TGAAAACTTT TATTTTGGTG GAGATCCAAA TCATGACTCT AGTGAATGGT CAGGCTTTAT TAAAGATGAG ACGTTATTAG TCAAACAAAC ATTTTTTGAG GAATTAACTC AGCGAAGAAT TGGTTGGGGT TTCGTAAGTG GAGCTGAATT ACCATCAGCG AAATTCGTCT TAGAGCAGAG GCTCGGCTTA GCCTCTGCTC CCTTGATTGC AATGGGTGAA GCACCTGAGA AACCTGATCC AACAGGTTTT ATTTCATTAT CATCAAAACT TTCAAAAAAA CCCTTAGGTT TTTCCAATCC CCCAATTGCA TATATTGGAG ATACTGTTGC AGACGTTAAA ACAGTTATAA ATGCTCGAAT TAAGATCCCT GATCAGAAAT TTATCAGTCT CGCTATTGCC CCCCCACACT TACATGTAGA TTCAAGTAGA GAGAAGCGTC TCAGATATGA GGCAGAGTTA AAAAAAGCGG GTGCAGACTT GATAATTAAA TCGATGGACA ATTTAAAAAA TGAAACATTA AATTTGTTCA TAAACCAATA G
|
Protein sequence | MMNNIGLLLF DIDGVIRDVT NSYRLAIQET VNFFSGWRPS IEDIDSIKSE GCWNNDWDLS LEMINRHVQK NNLSFSAPSR KNLIECFENF YFGGDPNHDS SEWSGFIKDE TLLVKQTFFE ELTQRRIGWG FVSGAELPSA KFVLEQRLGL ASAPLIAMGE APEKPDPTGF ISLSSKLSKK PLGFSNPPIA YIGDTVADVK TVINARIKIP DQKFISLAIA PPHLHVDSSR EKRLRYEAEL KKAGADLIIK SMDNLKNETL NLFINQ
|
| |