Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4113 |
Symbol | |
ID | 4245627 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6345000 |
End bp | 6345869 |
Gene Length | 870 bp |
Protein Length | 289 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638109014 |
Product | histidine triad (HIT) protein |
Protein accession | YP_723594 |
Protein GI | 113477533 |
COG category | [F] Nucleotide transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0537] Diadenosine tetraphosphate (Ap4A) hydrolase and other HIT family hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.489073 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTACCA ATAAACAACC AAACCAATAT ACCCACCTTA CAGCTATAGA CCGTAAATTT ATCTCATTTC CAACTCAAGT AATACTAAAA CAAAACTTGC TAGTAGGTAA AATTCTTGAT TTTGGCTGTG GTCTCGGTAA AGACGTTAAA CTACTACAAA AAAAAGGCTT TGATATTATT GGTTACGACC CTTATTATTT TCCAGAATAT CCCCAGGGAA AATTTGATAC TATCCTTTGT TTTTATGTCT TAAACGTCTT ATTTGAACAA CCACAAATCG AAGTGTTGAT GCAAATTTCC CAGTTATTAA ACTCTGAAGG AAAAGCTTAT TATGCTGTGA GAAGAGGTTT GAAAGGAGAA GGTTTTCGAG AACATTACAT ACACAAAAAA CCAACTTATC AGTGTTTAGT TAATTTACCT TTTAAATCTA TTTATACTGA TGAACTTCGT GAAATTTATG AATATACTCA TTATAATCAG CAAAAAAATT CAGAGAACAA ATGCTTATTT TGTAATCCTC GTAAAAATCT ACAATTAATT ACAGAATCAG CAAAAGCTTA TGCAATACTT GACGGTTACC CAGTCACAAA AGGCCATACA TTAATTATTC CCAAACTTCA CCAGGAAAAT TATTTTGAAT TGCCTATAAA TGACCAGTTA GAATGTTGGT TAATGGTAAA TCAAGTACAG AAGATTTTAC AGGAAAAATA TCAACCTGAC GGATTTAATA TAGGAATTAA CGTTAATCAT GCTGGAGGAC AAAAAATGAT GCATACTAAT ATTCACGTTA TTCCTAGATA TCAAAAGAAC GAGTTAGGAA CTAAAGGGGG AATGAGGTCT GTTGTTCCAA AGAGGAGAGG CAAAGTATAA
|
Protein sequence | MFTNKQPNQY THLTAIDRKF ISFPTQVILK QNLLVGKILD FGCGLGKDVK LLQKKGFDII GYDPYYFPEY PQGKFDTILC FYVLNVLFEQ PQIEVLMQIS QLLNSEGKAY YAVRRGLKGE GFREHYIHKK PTYQCLVNLP FKSIYTDELR EIYEYTHYNQ QKNSENKCLF CNPRKNLQLI TESAKAYAIL DGYPVTKGHT LIIPKLHQEN YFELPINDQL ECWLMVNQVQ KILQEKYQPD GFNIGINVNH AGGQKMMHTN IHVIPRYQKN ELGTKGGMRS VVPKRRGKV
|
| |