Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2158 |
Symbol | |
ID | 4244131 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3368523 |
End bp | 3370517 |
Gene Length | 1995 bp |
Protein Length | 664 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638107265 |
Product | hypothetical protein |
Protein accession | YP_721865 |
Protein GI | 113475804 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.653369 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGAAA CTCAACAACT ACCTATATCA ATTCAACCTA TAATTAACTA TCCCCATTCG GCAGAAGTGG GTAAAACTTA TTTGATGGAA ATTGATATTA AACAAACTGA GGATTTTGAA AAATGGAATT ATGAGGAGGA AGAATATCCT ATTTATTTTC GGGTAGATAC CTATTGTCAA GGAGATACTA CTCCATTATT TAAGATTCAA ATTATCGGCG AACCTGCAGT GGTTTTACAT CGCTTTGGGG GGACTTACGG CCCGGCAAAA TTTTTATTAA CTGCTGCTCA AAAGGAGATG GAAGGAGAGA TTAGAGTAAC TTTGGTTAAT GGTTGGGGCG TACCTCTCAA AAGGTTGCGT TTGGAGAATG TGGCTGTAGT TGCGGATAAA CAAGATGACT CAATTATTGC TGGAGTTAAG GTTAGAAATA ATCACGCGAT CGCTATGGGC ATTAACCAAT ATAATTATCT ACCATCTCTA AGATATGCTG TTAAAGATGC AGAGGTAATA AAAAATTGGT TTGAGCAGAA AAACTTTAGC AAGGTTGATT TATTTACAGA AGTAATTTCA GTGAGACTTA AGAGGTTTTT GGCAGAGAAA TTTGAGAAAC CTTTCCTCTC AGCAGAAGAT AATTTTTGGT TTTTCTTTAG TGGCTACGGT GAGTGTATAG AAGATGTAGA CTATCTCATC CCAGCAGATA GTCAAAAGGA TGACTTCAGA ACAACTGCTA TTTCTGTTAA TTCTCTTGTA GAAAAATTAT TTTCTTCTGG TGCAGGAAAA GTAATTTTGT TCTTAGATAC TAACTATATC TCTGGTGAAA TTTCTTTCTT TTCCGGTTTT TCTTTTTCTG TTAAAGATGG GCAGGAACTG ATAGTTTTCT TGAGTAAAGA AGCTAATGAA GTTGAGCAAC TTCAGCAGGG AAGTTTTACT TATGCTTTAC ACGAAGCACT CCGGTCGAGT GAAGGGAGTT TGAGTATAAA TCAACTTCAG AAGTTTTTGA GCGATCGCGG TCGGGAACTT AACAAACGGA ATAACCAGCA GAGGCAAATT CTACAAAAGT TTATTTCTCC CAAAAATTTG GGAGAATGGG TTCCTTTTCC TTCAGACTTT CAGGTTTTTC AGTATACAAC ACCTACAGTT GACGGAGGGG GAAAAATTAT CAAGCAAGAT ACTAAGCTTA CCCAATACTT TAGGGAAACT ATTGCGCAAC AGTTAGAACT AGAAATGGTA ATTATTCCTG GTGGTAACTT TACCATGGGT TCTCCTGAAA GTGAAGAGGG TAGCTATCAT GATGAACGCC CCCAAAACGA TGTGACTGTC TCTCCCTTTT TTATGGGCAA ATATCCGGTT ACTCAAGGAC AGTGGAGAGC GATCGCTTCT CAGACAAACT TGAAAGTAAA TTTAGACCTA GACCCAGAGC CATCATACTT TAAGGAACCA TATCAAGATA TAGATAGATG GGAGAGACCA GTTGAGAAGG TAAACTGGTA CCAAGCAGTA GAGTTCTGTG AAAGACTATC GAAATTAACA GGAAGGAATT ATAGACTACC CAGTGAAGCA GAGTGGGAAT ATGCTTGCCG TGCAGGAACC ACTACACCTT TCTACTTTGG AGAAACTATA ACATCTGAGT TAGTTAACTA TAATGGCAAC TATTACGGGA ATGGGCCCAA AGGAGAATAT AGAAACCAAA CTACTCCTGT AGGTCAATTT CCACCAAATG CTTTTGGATT ATATGATATG CACGGAAATG TATGGGAATG GTGTGCTGAT AATTGTTCCC GTGATGGTTA TCATTATGCT CCTACATATG GAAGTCCTTG GGTTGCTAGT AATAAAAAAT ATTATACCGA AATTGGGGCC TTAGTACGGG GCGGTTCCTG GGTCTCCTAT CCTAGTTATT GCCGTTCTGC GTTTCGCAGC GGCTCTCTTA GGCGCGACGA CCACTTCATC TTTATCGGTT TTCGTGTTGT CTGCGATGGC GGGATAACTC TCTAA
|
Protein sequence | MNETQQLPIS IQPIINYPHS AEVGKTYLME IDIKQTEDFE KWNYEEEEYP IYFRVDTYCQ GDTTPLFKIQ IIGEPAVVLH RFGGTYGPAK FLLTAAQKEM EGEIRVTLVN GWGVPLKRLR LENVAVVADK QDDSIIAGVK VRNNHAIAMG INQYNYLPSL RYAVKDAEVI KNWFEQKNFS KVDLFTEVIS VRLKRFLAEK FEKPFLSAED NFWFFFSGYG ECIEDVDYLI PADSQKDDFR TTAISVNSLV EKLFSSGAGK VILFLDTNYI SGEISFFSGF SFSVKDGQEL IVFLSKEANE VEQLQQGSFT YALHEALRSS EGSLSINQLQ KFLSDRGREL NKRNNQQRQI LQKFISPKNL GEWVPFPSDF QVFQYTTPTV DGGGKIIKQD TKLTQYFRET IAQQLELEMV IIPGGNFTMG SPESEEGSYH DERPQNDVTV SPFFMGKYPV TQGQWRAIAS QTNLKVNLDL DPEPSYFKEP YQDIDRWERP VEKVNWYQAV EFCERLSKLT GRNYRLPSEA EWEYACRAGT TTPFYFGETI TSELVNYNGN YYGNGPKGEY RNQTTPVGQF PPNAFGLYDM HGNVWEWCAD NCSRDGYHYA PTYGSPWVAS NKKYYTEIGA LVRGGSWVSY PSYCRSAFRS GSLRRDDHFI FIGFRVVCDG GITL
|
| |