Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0807 |
Symbol | |
ID | 4241770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 1282672 |
End bp | 1283964 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638106085 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_720697 |
Protein GI | 113474636 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAATT ATTCTAACTT CTCTCCAAGT AATAATCCTA ATTCCCACTC TACATTAGAA AGTCCAAATG GTACCCCCGA AGCAATTACT TCTGTAAAAA AGTTAAACTT CATAGATAAT GAAATAGAAC AAATGCTTCC CACCCCTGCG ACACCCAATA CTTTAGTGGA GCATCAAAAA TCTGAAGGGC CAAAAAATCC TGTAGCATCT ATTGTTACTC TAGTAGCGAT CGCATTGATG ATTCTGGGTT TAGCCATAGA TAATGTGCTG CTTGGTTATA CCAGTGCTAT TATAGTAATA CTATCTTCAG TCAAAATGAT TTGGCCTAGT TGGGGTAAAG TCTGGAAAAC TTTGATTCCT TCAGTTTGGC GTAATCTAAT CATTGCCTGC TTTGGTCTCC TGGCAGCTAT TGTTGGTTTG CTGATGTTAA GTGGGGCAAA CCAACAACCT GGTAGTAGAA ATATCCACAT TAACTGGGAT GCTATTGGAG CGGTGGGTGA ACTTATTGGA GCTTTGGGTC AAATTTTAAT TGCAATAATT GCTGTATATG TAGCTTGGCG ACAATACGTT ATTTCTAAAG ATTTGACAAT TCAACAAAAC CGCATTACTC AACAACAAAC TATTGATGCT TACTTTCAAG GGGTTTCTGA TTTGGCAATG GATGAAAAAG GTTTCTTGGA AGATTGGCCA CAGGAACGAG CGATCGCTGA AGGTCGTACA GCGGCTATTA TTAAAAGTGT AGATGAAGAA GGGAAAGCTA AAATTCTCAG ATTTTTATCT CAGTCTAGAC TGGTAACACC AATTAAACGT GATAGACTGC TAGGCCGTCC CATATTTGAT GGTCAAGGTG GTTATGCTGA AGATAGGGAA CATGGTACTC GTGTTATTGA TTTAGGAGTA ATGTTAGCAG GTGCTGACCT GAAAAACACA GATTTGCGGT GGACAGAGTT AAGTGATGCT AATTTAGTGA GAGCTAATCT TAGTGGCTGT GATTTAGTCA AGGCTAATTT CTCTCGTACT ATTCTATATG AAGCAAGTTT GGTAGGTGCT GATTTGAGGG GAGTCAGATT TTTCTATGGT ACTGCTGAAT ATGCTACTCC CCGCAGTCGT ACTCATATAC CTAACTATCA AACTGGTGCT TATACTGGTG CTGTGGTAGA AAATGTTGAC TTTACGGAAG TGAAGCGGTT GTCTGATGAA CAACATTATT ATTGTTGTGC TTGGAGTGGA GAAAGAAGTA GAAAAACTAT ACCAAATGGT TGTGGAGGTA TTCCAAATAA GTTAGGGCGT TAA
|
Protein sequence | MNNYSNFSPS NNPNSHSTLE SPNGTPEAIT SVKKLNFIDN EIEQMLPTPA TPNTLVEHQK SEGPKNPVAS IVTLVAIALM ILGLAIDNVL LGYTSAIIVI LSSVKMIWPS WGKVWKTLIP SVWRNLIIAC FGLLAAIVGL LMLSGANQQP GSRNIHINWD AIGAVGELIG ALGQILIAII AVYVAWRQYV ISKDLTIQQN RITQQQTIDA YFQGVSDLAM DEKGFLEDWP QERAIAEGRT AAIIKSVDEE GKAKILRFLS QSRLVTPIKR DRLLGRPIFD GQGGYAEDRE HGTRVIDLGV MLAGADLKNT DLRWTELSDA NLVRANLSGC DLVKANFSRT ILYEASLVGA DLRGVRFFYG TAEYATPRSR THIPNYQTGA YTGAVVENVD FTEVKRLSDE QHYYCCAWSG ERSRKTIPNG CGGIPNKLGR
|
| |