Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1701 |
Symbol | |
ID | 4244086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 2583999 |
End bp | 2586362 |
Gene Length | 2364 bp |
Protein Length | 787 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638106830 |
Product | hypothetical protein |
Protein accession | YP_721439 |
Protein GI | 113475378 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAGTC GTGAGTGGAG GACAAGGCGG AATCGCTTTC GACGAATGTA TGATGACCGT TGGCGCGATC TAACCCCCGA TTTAATGTTT CCCATAGACG CGCCTGAAAG AACTCGTGAG TTATCTTTTA ATGTAAAAAA TAATACTGCA ATGCCTGGAA TTGGTGAATC CATGCGTTTT GCTGCACGCT TAGGAATTGG TCGTCATGTA CCATGTTTTG TTTTTTTCAC TGATATAGGT GAATTAACGA TTGATGTTTT TCCTGTGGGA AATTTATCAG CCGATGAGGC TTTCTATCAA ATGAGATATT GGATTGACGA CTTCTATCAA GAAAACCAGG TATCACTGAA CAAATGGAAT CAAGTTGAGC AAGATATTAT TTCCTTTATA AGTTCCATTG ATCGATCACT CACAGACATT AAGGATTGGA TTAATAAAAG CGAGAGATTA TGGGATGAAC TGATATTGGT CGCCCAAATT ATTGAAAAGC TTCGCAAATT AGGACAGCAG ACAACAGACT ATAAATCTTT GATTGATAAT CTTAGTACTT CTTCTTGGAG ATGCAACAGG ATTGTGTCTG ATTGTCGCGT TCGCTTGGAA AGTATATCCA AAAAACGAGA AAAGCATCAA CTTGAACAAG AAAACCTTAA GGTTGCTATT AACAAACTCA AGACAATTTC CATCTCCACC AATATTTATG ATGAATGTCT TCTTGCAGCA AGCCAACTTC TTACTTCAAA AGCATCAAAA ATCTTAGAAA AAGCAGCAAA ACGTATAAAG CAAAAAAGCA ACTCAAAATT CATCTCATTA GAAAATCAAT TATTTCAGTG GTGGGGAAAT ACCAAGAAAT GTATTCCATC ATTTAATAAG TTTAAGAAAG CCCATAAAAA ACAATCTGAA TTGCATACTC AGTCTCATGA AATTCTCAAA TATAAATATA ATAATTATAA TAATTTTATT GCATCTATTT TTGAACTACC ATTTTCAGAT AGGCAAGAAA TATTTTTAGA GAAAGCTAAA CTATTATGTG ATGAATCTGA TATAGATTTC TCTGAATACT CTTTGCAATT AACTGAATTT TTTACACAGT TACATACTCA AGTTCCTAAA TGGATAGATG GAACTAATCT AAAAATCTCT ATTCTTTTTC CCTTTAAGAG CCGAGATAGT ATTAGTTTTG ATACAGTTAT GGCCTCAATT GGATATGAGC ATCCTATTAG CCAAATGATC AGAGAAAATA TGACGGTTGA GCAAAAAAAG AAAAAAGAAA AATTAGTATC AGAAACTGAA AGAATAGTAT TGCAATATAG AGATGAGGCA TTAGCTGAAC TAATCAAGCT AAGAAAACAA CCATTGGATG TATCTACTGA AGAAATAGAC ACATACTCAA CCTGCCTTGA CAATATGTAT AATCTACGCA ATGAAATTGA AAATGAACTA ATCAATTTGG CTAACTCTAG CTCTAGTCTG GAAAAATCTT TGCGTTTAGT TGAACCAAAA GATATTGAAA ACTTTCTAAA GCTATTGAAT GAGTATAGAG AGACTACTAA CAAGTTTTTT TATCCATACA AGAAAAATCC CAGAATTCAA CAAGTGAATA TTGACCAACC GCTGCCACAA ATTTTTGAAT TAAAATTACG AGAAAATCAT CTAAATACAT CAAACACAAG AGCTAGAGAA TTAGAACAAA AATTAGCAAA AACCATACCC AATTCTGAAA ATGGGGTCAA ATTGTTACAA AATGTCCAAC AGAAGTCTTA CACAGTGACC CCTAAAGCAC GGCTTGTAAG TGAGATCTTG AAAATCAAGG AAAGCCCTCA GAATAGCTCA CGACCAAATT CTGTTCCTCC AATATTTTCT GATTCAAATT ATCCAGAAAA TTTTGAAGAA ATACTATGCG GATTAAATGA TCAAGAGTTG AGAATTCTAT CAAACTCAAT AGCAAAGTTA GATTTAGATG CTGTAGTAGG CTCTAGGGAA GAGATTATTA ATATAATTCT TACTATTGTT GGTTTACTTC CTAGTAGAGA ACTTACTACT CATCGTCAGT ATGCAAATCG CCCAATTAAT TTAGAGGTAA GTACCATGAC AGAATCCAAA AATGTTGAAG TTGAAATGAA CTTTAACAGT CAAGTTATTG GAGCAACTGG AAAAAATGAA GGTATTATCA ATATTAATAC CTCTGATCGC CAAATTCTTG CAGAGGCTGC TGAGGAGATC CAAAAACTTC TAAGGCAGCT TGAGAAAACT AACCCTTCTG CAACTGAGCT TGAACAAGTA GCTTATGTTG ATGTTACAGT TTATCCAAGT CACCAAGCAA CGAACCATTG CTGCTTTAAG GGCGGGAGGT GGAACTGCAA TTGA
|
Protein sequence | MRSREWRTRR NRFRRMYDDR WRDLTPDLMF PIDAPERTRE LSFNVKNNTA MPGIGESMRF AARLGIGRHV PCFVFFTDIG ELTIDVFPVG NLSADEAFYQ MRYWIDDFYQ ENQVSLNKWN QVEQDIISFI SSIDRSLTDI KDWINKSERL WDELILVAQI IEKLRKLGQQ TTDYKSLIDN LSTSSWRCNR IVSDCRVRLE SISKKREKHQ LEQENLKVAI NKLKTISIST NIYDECLLAA SQLLTSKASK ILEKAAKRIK QKSNSKFISL ENQLFQWWGN TKKCIPSFNK FKKAHKKQSE LHTQSHEILK YKYNNYNNFI ASIFELPFSD RQEIFLEKAK LLCDESDIDF SEYSLQLTEF FTQLHTQVPK WIDGTNLKIS ILFPFKSRDS ISFDTVMASI GYEHPISQMI RENMTVEQKK KKEKLVSETE RIVLQYRDEA LAELIKLRKQ PLDVSTEEID TYSTCLDNMY NLRNEIENEL INLANSSSSL EKSLRLVEPK DIENFLKLLN EYRETTNKFF YPYKKNPRIQ QVNIDQPLPQ IFELKLRENH LNTSNTRARE LEQKLAKTIP NSENGVKLLQ NVQQKSYTVT PKARLVSEIL KIKESPQNSS RPNSVPPIFS DSNYPENFEE ILCGLNDQEL RILSNSIAKL DLDAVVGSRE EIINIILTIV GLLPSRELTT HRQYANRPIN LEVSTMTESK NVEVEMNFNS QVIGATGKNE GIININTSDR QILAEAAEEI QKLLRQLEKT NPSATELEQV AYVDVTVYPS HQATNHCCFK GGRWNCN
|
| |