Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1108 |
Symbol | |
ID | 4242189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 1743770 |
End bp | 1744945 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638106333 |
Product | hypothetical protein |
Protein accession | YP_720945 |
Protein GI | 113474884 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [COG0607] Rhodanese-related sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000685928 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.228939 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTAAATC CTAATCTGGA ACAAATCCAG TTAAATACAG AAGAATATCA ACGTTATTCG AGGCACCTGA TTCTACCGGA AGTAGGATTA GATGGTCAAA AACGTCTCAA GGCAGCTAGT GTTCTATGTA TAGGCACGGG AGGTCTTGGT TCTCCACTAT TGTTATATCT AGCAGCAGCA GGAATTGGAA ATATTGGAAT TGTAGATTTT GATATTGTCG ATAGTTCCAA TTTACAACGA CAGGTTATTC ATGGTACTTC CTGGGTGGGT AAGCCAAAAA TTGAATCTGC TAAAAATCGG ATTCATGAAA TTAATCCTTA CTGTCAGGTT GACCTTTATG AAACCAGGTT AAGTGCTGAA AATGCCCTTG ACATTCTCAA GTCTTATGAT GTGATTGTTG ATGGTACTGA TAATTTCCCG ACTCGTTATT TGGTTAATGA TGCCTGTGTT CTTTTGAATA AACCTAATGT CTACGGCTCA ATTTTCCGCT TTGAGGGTCA GGCAACTGTG TTTAATTATG AAGGTGGACC GAACTACCGT GACCTTTACC CTGAACCTCC ACCCCCAGGA ATGGTACCTT CTTGTGCAGA AGGTGGGGTG TTGGGTATTT TACCAGGAAT AATTGGGGTG ATCCAAGCAA CGGAAACTAT CAAAGTTGTT TTGGGTAAAG GTAAGACTTT GAGTGGTAGA TTGTTACTTT ATAATTCCCT AGATATGACT TTCCGAGAAT TGAAATTGCG TCCTAATCCG ATACGACCAA TTATTGAAGA GTTGATTGAT TATGAGCAGT TTTGTGGTAT TCCTCAAGCT AAAGCACAGG AGGCAGAAAC TAAAATGGCT ATTCCAGAAA TGACAGTTCA AGATTTGAAG CAATTATTTG ATAGTGGGAA GAAGGATGAT TTTGTTTTAG TTGATGTACG GAACCCCAAT GAATATGATA TTGCCAAAAT TCCTGGGTCT GTTTTAGTAC CATTGCCAGA TATTGAGCAG GGCCCTGGTG TGACAAAGGT GAAGGAGTTA ATGAATAATC GCTCTTTAAT TGCTCATTGT AAGATGGGGG GGAGATCGGC TAAAGCTTTA GGTATTCTTA AAGAACATGG TATTGAGGGT ACTAATCTCA AGGGTGGAAT TACTGCTTGG AGTAAGGAAA TAGATTCTTC TGTACCTCAA TATTAA
|
Protein sequence | MLNPNLEQIQ LNTEEYQRYS RHLILPEVGL DGQKRLKAAS VLCIGTGGLG SPLLLYLAAA GIGNIGIVDF DIVDSSNLQR QVIHGTSWVG KPKIESAKNR IHEINPYCQV DLYETRLSAE NALDILKSYD VIVDGTDNFP TRYLVNDACV LLNKPNVYGS IFRFEGQATV FNYEGGPNYR DLYPEPPPPG MVPSCAEGGV LGILPGIIGV IQATETIKVV LGKGKTLSGR LLLYNSLDMT FRELKLRPNP IRPIIEELID YEQFCGIPQA KAQEAETKMA IPEMTVQDLK QLFDSGKKDD FVLVDVRNPN EYDIAKIPGS VLVPLPDIEQ GPGVTKVKEL MNNRSLIAHC KMGGRSAKAL GILKEHGIEG TNLKGGITAW SKEIDSSVPQ Y
|
| |