Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2222 |
Symbol | |
ID | 4243256 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 3464417 |
End bp | 3465613 |
Gene Length | 1197 bp |
Protein Length | 398 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638107324 |
Product | hypothetical protein |
Protein accession | YP_721924 |
Protein GI | 113475863 |
COG category | [S] Function unknown |
COG ID | [COG4370] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03492] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.917387 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.0837623 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAA AAAATATACT TTTTTTAAGT AATGGTCATG GAGAAGATGC CCATAATTGT CAAATTATTA AAGCTTTTAC AAAAATTTCT CCAGATACAA ATATATCAGC TTTACCTATT GTTGGTGTTG GTAATAGTTA TGAAAATTTG AATATACCAA TTATTGGTCC TCGAGTAAAT ATGCCATCAG GAGGATTTTT ATATCTCAGT CCTTTATTAT TATTTGAAGA TTTGGGAAAA GGTTTAATTA GTTTAACTTG GCAGAAGTTA CAAACTATTT GGACATTTGC GAAAAACTGT GATTTAATTA TGGCTACTGG AGATATTGTA GTGGCAGCTA TGGCTTATTC GACAAGGCTT CCTTATATGA TATTTCTTTC GGCTGATTCT AGTTATTATG AAGGTCGGAT TAATTTGGGT TTAATATTGC CAAAGTTACT TCATAATTCT CGATGTTTAA AGGTTTTTGC TAGGGATGCT TTGACGGCTA AAGATTTAAA AAGACAAGGA GTTACAAAAA CAGAATTTGT TGGTACTCCA GTGATGGATA ATTTAATTTC AACTGGAAAA AATTTACGGC TTAAAACGGA ATTGTTTACT ATTGCTATTT TGCCTGGTTC TCGGTTGCCG GAAGCTGGTA AAAATTTATG TTTGCTGTTG AAACTGGTTA GAGAAATTGT CAAAGTTATG GGAGTAAATG TTTGTCAGTT TCGAGCTGCA ATTGTTCCTA TTTTAATGTT TGAATTAGAG GCGATCGCTA TTTCTGAAGG TTGGGAATGT CAAGGAAGTA AGCTAACATT TTTTACTCAG GAATATACAA TAGAAGTAAT TTGTTATGAG GATGCTTTTG CAGATATTTT ACAACATTCA AGTTTGGTAA TTGGTATGGC TGGAACTGCA ATAGAACAAG CTGTGGGTTT AGGCAAACCT GTAATTACTA TTCCTGGTGA AGGTCCTTCA TTTACCTATC GTTTTGCGGA AGCTCAAACT AGACTTTTAG GTTCTTCTGT ACAGGTTATT GGTAAAAGAA TGGCTAATAG TTTTATTCTC CAAGAAGCAG CTAGAAAAGT TAAAGAAATT TTGGCAGATG AAGAGTATTT ACAAAGTTGC ATTAATAATG GTTTAGAAAG GATGGGGAAG CCTGGTGCTA GTGAAAAAAT AGCTAATTAT CTTGTTAAGT ATCTGAGTTC AGACTAA
|
Protein sequence | MKTKNILFLS NGHGEDAHNC QIIKAFTKIS PDTNISALPI VGVGNSYENL NIPIIGPRVN MPSGGFLYLS PLLLFEDLGK GLISLTWQKL QTIWTFAKNC DLIMATGDIV VAAMAYSTRL PYMIFLSADS SYYEGRINLG LILPKLLHNS RCLKVFARDA LTAKDLKRQG VTKTEFVGTP VMDNLISTGK NLRLKTELFT IAILPGSRLP EAGKNLCLLL KLVREIVKVM GVNVCQFRAA IVPILMFELE AIAISEGWEC QGSKLTFFTQ EYTIEVICYE DAFADILQHS SLVIGMAGTA IEQAVGLGKP VITIPGEGPS FTYRFAEAQT RLLGSSVQVI GKRMANSFIL QEAARKVKEI LADEEYLQSC INNGLERMGK PGASEKIANY LVKYLSSD
|
| |