Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3529 |
Symbol | |
ID | 4244355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5437935 |
End bp | 5440724 |
Gene Length | 2790 bp |
Protein Length | 929 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638108501 |
Product | hypothetical protein |
Protein accession | YP_723090 |
Protein GI | 113477029 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTATAC CTTTACTTCC ATTGCTTGCT CCCGCCGCAG CTTCAGCATG GAATGGAGTA GTTAATGCCT TTAATGGTGC CTCCGGTAGA ACAATGCAAC AAAATTTAGA AGGGCAACGA CTAGCTCATC AAAAAGAATT ACAAGAAAAA CAAATAGTCG CTCAATTTGA GATTGAATAT ATGCGTACAA TGTCTCAGAT AAAGTTGCAG CAGAATAATC AACAATTTCA ACAAAAACTA GAAATTGCCC GTCAAGAATT TCAAGGAAAA ATAGTTGAAT ACCAATGCCA AGAAAATCGG AAACTTCAAG AATTTATTAA AGCCGTAGAT ATGCAAATAG CTAAAAGTAA CCAAGAATTT CAAACCTGGT TATTTCAACA ACAAAAACAA CTTCAAATTG AATCAGCTCA ATATAACCGA GAAACCCAAT ATCTCAACGC GGTATATCAA CGAGAAACTA CCCTAGAAAT TAAAGAATTA GATAACTGGC CTCTGAAAAA TTATTCTTGG CAAATACTCC AATATCATCA AGGTCGCGAC CCCATTCCTG TACAAATAAT ACTAGCACCT CCAGAAATAG ACTTTGACCG TTTTGAACAT CTAAATAAAA CTTCAACTTC CACATTTCCT AAAGTGGAAA AACGCCTTTC TCAACACTTA CGAAACTTCT TAGAAAAACA CTATCCCCTA GAAAACCAAC AACGACCCAC AGAACTAATA GACCATACTT GGGATAGTAA CCGTTTTGCC GGAGGTAGCG CAGTTAAAAC TATATTTAGT CGCTTAAAAT CTGAACCAAT ACTATTAATA GAATCAGAAG TAGATGGAGA TTTAATTAAT ATCAAAGTTG CCTATTGGAG TGGGGGACAA GAAATATCTC CCTTCTACAA AACCATAATT TCTCAGCTAC ATTATCCGAA AGTTCTCTAT GAATTTGCTA TTCAACGTGC CTTACATTGG GAAACTAATG TTAAACAAAA ACTCTTAGCA AGAGGTAAAA CTGAACAAGA GATAAATCAA AAATTTGCTG GAGATAATGC TTTAAATTTA AGTATTTATA GAGAAGATAA AGAGCTACAA GCGGAAGGAA TAGAATTTGA AATAGCCTAC AAAACCAATA GCAAAGACTT TGGTAAATTA TCAGCTTTGT TAGGAGCATA TCATAATGTT ATTGCTGCCC TATTTACTGA TATTCATTAT TTAATTAATA CTAATTTATC GCCGAAGTTG CCAGAATTAT TGGCAGAGTT GGAAACAGAA TTTTCTATAG AAAAATGTCT GGCAGATGAC CTATTTCAAA TGGTAGTTGA AAGTTATATT GGTAGTCTCA AAGCAATGGA ACAAGACCGC TCCGAATTAG TACCAGGATT TGCTGCGGAT ATTGCTTTTG GTTTGACTGG TTTATCTAAT TCTAATTTGG CAGAAAAAAT GTTAGATTTT TGCTTCGAGT CTTGGCTAAG TTCTCGTTAT CTTTCTGTGG AGAATGGTAA GGAAAAATTT GATATGGTGG CTAAAAATTT ACTGCCTCTG GATCAGGAAT TTATAGGTAA AGTTAATCGC TGTTTGGTTG GTCTGGGTCA GCAGAAAAAA TTGAGTTTGA TAAATTCTTG TTACCTCCGG GGAATTGATA AATTTAATCT CGAAAAATAT GAGGAAGCAA TTATTGATTT TGGCTATGTT ATAAGTTTAA ATCCTAAATT TGCTGATGGT TATTATCGGC GAGGTTTGGC TTATATTCAG TTGGAAAATT ATGGAGATGC GGTTGATGAT TTGACTCAGG TTATTAGGTT AGATCCTAGT CACGCTGTGG CATTTAATTA TCGGGGTTAT GCTTACTATA AATTAGGTGA ATATCAATGG GCAGTTGATG ATTATAACCG GGCGATAAAT TTGGGTTTGA CTGAGGCGGT GAAAAATCGA GATATTGTTC TGGGTGTTTG GGAAGAAATT AAAAGGCAGC AGGAGTCGAA AGGGGATCTC TTTACTTTCG AGGTAATTAC TGTTAACAAT ACTGGCAAAG TTATTAGTTC TAACCAGGGA AGTGCTAGGC AAAAATTTGA AGACTTGGGT AGTGGAATTA AATTAGAAAT GGTTTATATT CCTGGGGGAA GTTTTCTCAT GGGGTCGGCG GGAAATGAGG GGAGCAGAAG ATCTAATGAA GGTCCCCAAC ATGAAGTAAC TCTCCAACCT TTTTATATGA GCAAATATCC TATTACTCAA GACCAGTACC AAGCCATTAT GGGAAATAAC CCTTCGTATT TTAAGGGAGG AAGCCGTCCA GTAGAACAAG TAAACTGGCA CAATGCTACA GAGTTTTGCC AAAAGCTATC ATCAAAAATT GGGAAAATTT ACAGGCTACC CAGCGAGAGT CAGTGGGAAT ACGCCTGCCG TGCCGGGACT ACTACTCCTT TTTATTTTGG AGAAACTATA ACCTCTGAGT TAGTTAACTA CCGTGGTGAC CATTCTTATG GCAATGCTCC TAAAGGAATA TATCGGGGAG AAACAACAGA TGTGGGAAGT TTTCCTCCTA ATGCCTTTGG TATATATGAT ATGCACGGGA ATGTTTGGGA ATGGTGTGCT GATGATGGGC ATGAAAACTA TAATGGTGCG CCTACAGACG GCAGTGTTTG GCTAGATGGA GAAAAAAATG GATCACCGCT GCGGGGCGGT TCTTGGTACT TCATTCCTAA TTATTGCCGT TCTGCGGTTC GCGACTACTA TTTTAGGCGC GACGACCCCT ACTACTTTAT TGGTTTTCGT CTTGTCTGCG ATGGCGGGAG AACTCTTTAA
|
Protein sequence | MPIPLLPLLA PAAASAWNGV VNAFNGASGR TMQQNLEGQR LAHQKELQEK QIVAQFEIEY MRTMSQIKLQ QNNQQFQQKL EIARQEFQGK IVEYQCQENR KLQEFIKAVD MQIAKSNQEF QTWLFQQQKQ LQIESAQYNR ETQYLNAVYQ RETTLEIKEL DNWPLKNYSW QILQYHQGRD PIPVQIILAP PEIDFDRFEH LNKTSTSTFP KVEKRLSQHL RNFLEKHYPL ENQQRPTELI DHTWDSNRFA GGSAVKTIFS RLKSEPILLI ESEVDGDLIN IKVAYWSGGQ EISPFYKTII SQLHYPKVLY EFAIQRALHW ETNVKQKLLA RGKTEQEINQ KFAGDNALNL SIYREDKELQ AEGIEFEIAY KTNSKDFGKL SALLGAYHNV IAALFTDIHY LINTNLSPKL PELLAELETE FSIEKCLADD LFQMVVESYI GSLKAMEQDR SELVPGFAAD IAFGLTGLSN SNLAEKMLDF CFESWLSSRY LSVENGKEKF DMVAKNLLPL DQEFIGKVNR CLVGLGQQKK LSLINSCYLR GIDKFNLEKY EEAIIDFGYV ISLNPKFADG YYRRGLAYIQ LENYGDAVDD LTQVIRLDPS HAVAFNYRGY AYYKLGEYQW AVDDYNRAIN LGLTEAVKNR DIVLGVWEEI KRQQESKGDL FTFEVITVNN TGKVISSNQG SARQKFEDLG SGIKLEMVYI PGGSFLMGSA GNEGSRRSNE GPQHEVTLQP FYMSKYPITQ DQYQAIMGNN PSYFKGGSRP VEQVNWHNAT EFCQKLSSKI GKIYRLPSES QWEYACRAGT TTPFYFGETI TSELVNYRGD HSYGNAPKGI YRGETTDVGS FPPNAFGIYD MHGNVWEWCA DDGHENYNGA PTDGSVWLDG EKNGSPLRGG SWYFIPNYCR SAVRDYYFRR DDPYYFIGFR LVCDGGRTL
|
| |