Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2236 |
Symbol | |
ID | 4243329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3481790 |
End bp | 3483193 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638107338 |
Product | hypothetical protein |
Protein accession | YP_721938 |
Protein GI | 113475877 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.555143 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.628114 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGATGA ATCAATGTTT TTGTAATGTT ATTGTTTCTA ATGAAAATAC GGTTAAACTC ATAAAATTAA CTCCTCAGTT AGCTACTAAA GAGCAGTTAA TAGGTAAGTT AAATCTTGAC AAAATTAACG ATAAGATTAG AGAAATTATT AAAGCTCCTA CTAATTTGAA AGGTGAACAA ATTACTAAGG TGGGAGAAGC TTTGTTTGAG GCTTTGTTTG ATGCTCAGTT AAGAGAATAT TTTTTGGCAT ACTATCAAGA TATTCTCAAA CAACAGGATG CAAATTTGGG AATTATTTTA GAAATAAATG AGGAAGCTAT GCCGGAAGTT GTAGCTTATC CTTGGGAGTT AATGTGTTTA CCTGAGAAAT ATAATCAAGG TCAAATTTAT TTCGCTACTA ATCGTAAATT AAGTTTTTAC CGTCGTAGAT ATCAATTAGA AGAATCAAAT AAATTATCTA TTAAAATTAA TCAGGATGAA CAGTTAAAAA TTGCTTTAGT TGTCTCGAAA CCGACGGCTG ATTCTGGATT AAGTGATGTG GAATATTATG AGGTACATAA ATACTTAAAA AGTATAGATA CTAAACAGGA GCAAGTTAAA TTTTTACCTG TTATGAATTC TCTAGATTTC TATAGTATTG TAGAAAGATT AGAAACTGAG AAACCGGATA TTTTTCATTT TATTGGTCAC GGTCAATTAG TTGAGAAAAA TGGAGAAGAT TTAGGACAGG TTGCTTTTGT TAATGAGTTT GGTGAAGCTG ATTGGAAAGA TGCTAAAATG TTTTGTCAAT TGTTTACTGG TCATCAACCT AAAGTTGTGA TTTTACAAGC TTGTGAAACT GGAAAACAAT CTGAAACTAA TGCTTTTAGT AGTCTGGCTT CTCGCTTAAT GCTTGAGGGA ATTCCGGTGG TTATAGCTAT GCAATATAAG GTTTCTAATC AAACGGCGAT TATTTTTGTG AAGGAGTTTT ATTCAAAAAT AATAGATGGT AATTCTGTTG AGATGGCGGT ACAGAAGGCT CGGTTTAAGT TGCGTATTGA GAAGGGTTAT GAGGCAAGAG ATTTTGCAAT TCCTGTTGTG TTTATGAATG CTTTAGATGG TTATTTATTT GCCAAAGAAT CTATGGAGAT AACTCAGAAA AAAAATGGTT TCTTGAGTTT AACGGGGGAA CAAAGAAGAA AATTTCGTGA AGAGATTGAA AGGGTATTAA GTGAGGATAA ACTGACAAGT ATATTTTCTG AATATCCAGA GAAATTTGGA GATAATTTTT ATAATCAAAT TCCAGGAGGT GACTATCGTA CAAGGTTGAT TAATTTAATC ATTGAGCTAA ATAATAGAGA ACTTATTTTT GATTTTATTA AAGTTGTTAG GTATGAGTAT CATAATTTTG CTCAAGATTT ATAA
|
Protein sequence | MTMNQCFCNV IVSNENTVKL IKLTPQLATK EQLIGKLNLD KINDKIREII KAPTNLKGEQ ITKVGEALFE ALFDAQLREY FLAYYQDILK QQDANLGIIL EINEEAMPEV VAYPWELMCL PEKYNQGQIY FATNRKLSFY RRRYQLEESN KLSIKINQDE QLKIALVVSK PTADSGLSDV EYYEVHKYLK SIDTKQEQVK FLPVMNSLDF YSIVERLETE KPDIFHFIGH GQLVEKNGED LGQVAFVNEF GEADWKDAKM FCQLFTGHQP KVVILQACET GKQSETNAFS SLASRLMLEG IPVVIAMQYK VSNQTAIIFV KEFYSKIIDG NSVEMAVQKA RFKLRIEKGY EARDFAIPVV FMNALDGYLF AKESMEITQK KNGFLSLTGE QRRKFREEIE RVLSEDKLTS IFSEYPEKFG DNFYNQIPGG DYRTRLINLI IELNNRELIF DFIKVVRYEY HNFAQDL
|
| |