Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3463 |
Symbol | |
ID | 4244463 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 5301467 |
End bp | 5304766 |
Gene Length | 3300 bp |
Protein Length | 1099 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638108438 |
Product | hypothetical protein |
Protein accession | YP_723027 |
Protein GI | 113476966 |
COG category | [S] Function unknown |
COG ID | [COG1649] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.639127 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAAC AAAAACAAAA AAAAGCTTTG TCTAAAAAAA ATTTACTACT TCTACCAAGT TTGTTAGGTG CATTACTAAG TTTAATGGTA CTGTTACCAA AAGCTGGTAG AGCTAGTGTT TTATTAGGGA TAATTCGAGA CTTCCAGAAT ACAGATGAAT GGAATCAAGT TATTAGACGA ATAGATGCTC TCGGTATTAG TTATGAACCA ATTGATTTAA GACAAATAAA AACAGTAGAT GAGCTATCTG GGGTAAGGGT GATCTTTTTA CCAAATATTG AAGTTTTAAC TGAACCTCAA GTACAAATTA TTGAAGAGTG GGTGAAGGGT GGAGGAAAAT TAATAGCTAG TGGTCAAATA GGTCAAAAAT CTCAGTTAGG AGTAAGGCAA AAGTTGCGAT CGCTACTTGG TTCTTACTGG GCTTTTCCTC TAAGTCAACC GACAATACCA GAACCCAAAT ATCGTTGTTT AGATCTAACT TGTACAAAAT CTACAAATTG GGCACCAAAA ACCAATAATG TAGGTACAGT CACAGGTGGA ATATTAATTC CAGCCGGTTT AAATAGCACA ACTGCAGCTA CTTGGAAGGG AACTTCTGGT TCTTCAGCAG TAGTAATTAC TCCTCAAGTT ACTTATTTGG GTTGGCATTG GGGAAATACT GAATCTGCTG CTGTGGATAG TATTTGGTTG CAGGCTATTT TGAATCGTTA TCAAGGTCAA CCAGAATTTA GTGCCAGAAA TAATAATATT TTTTCTTTAG AAAATAATAG ACGGAGTGTA TCTGATGCTA ATAACTCCAA CCCGGTAAAA ATTCACCCAA GAAGTAACCC TTCGCCGAGT TCAAGGAAGG AAAGTGTGAG TCCAGTTAGA GTTCAACCTA AAAGGAATTC TGTAAATACT GGGGAAAATT CAAATATAAA TCCAACTGCT CCTATAGCTG AAAATTCAAT TGTAAATGAA GATAATATTT GGCTGAGAAG GCAGGAAAAG AAGAGCGGTA CTGAAAACGA ACCAGAAAAT TTGATTCAAG CAAGGGAAGA ATTAGGTAGC CAAAATAGTA ATTTAGGATC AAAAATAACA ACGGTTCCTG AAAGTGAAAT TGTTGGTGGG AAAGATACTG CTGTTAGTGA GTCTGAAAAT TCAACTACAG AGACTACTAA TAATAGAGGT AGGAATATTT GGCAAAGGCT CCAACAAGAA AAAGAAGAAC AAAATCAGGC TAGTGAAGTT GTTGGTGAGA AAGATCAAGT TATTGACTCA AGTACTAACT CTACTCTACA AAATAATAAT AGGGGGAGTA TTTGGAACCG GACTCAAGAA GAAACAAAAA CAAGGGCATC ATCTTCTCGC CGTCATCCTA TAATGCGCTT ACTCAGGTCT CTACCACCAA TAGAAGTACC AAGTGCTCAA AGAGACCCTT CATCAGGATC AGCATCTCCT GGTTTAGATA TTCGACAAGG AAATTATCCA ATTAGCAGGG CGAAAGCTTA TGCCATGTTA GAGGAATTAA ATAATCTCCT TGGTAGATTT GAAAGCGCAT TGATAGCAGC TAAATCTGCA AATGTAAAAG TTGATCTGGC AGCAGATGAT GTGAGTTTGT TGGCTGCGAG TACTGGCAAT GCTAGGTTTA TTGCTCAAAG AAATCAAAAA ATTAGGGGTG GTCAACAAGT TATTGTTAAA GTGCGTCAGG TAATTCAAAA TTTTCCCCAA CAGGTAAAAG CTAAACAATA CGCTGCTGCG AGAAACCAAT GGCTACAAGC AAGGCAAATG TTGTGGAATA ACTACCCGAC TGATGGTCAA AGAGCGGGAG CTGAGATTCG AGCTGTTTGG TTAGACCGGG GAACAATTGT GAGGGCGAGG TCTGAAAGAG GTTTGGCTGG GGTATTTAAC CGACTTGCTG CTGCTGGTAT TAATACTGTT TTCTTTGAAA CCATTAATGC TGGTTATACG ATTTATCCTA GTAATGTTGC TCCAAGACAA AATCCTTTGA CAACTTCTTG GGATCCTCTG AAGGCGGCGG TGAAGTTAGC CCACGAAAGG AATATGGAGT TACACCCTTG GATTTGGGCG TTTGCAGTGG GGAACAAAGC TCATAACCAG GCTCTTGGTC AAGGAGATAG TTATTTGGGT CCGGTAATTT CGGCTCATCC TAGTTGGGTG ATGACTGATA AAAGGGGTCG CAAAAGACAT CCTTTAGATG GCAAGGTTTA TATGGATCCT GCGAATCCTG AGGTGAGGCA ATATTTGCTG AATATAATAG ATGAAATTGC TAGTCGGTAT GAGGTTGATG GGATTCACCT TGACTATATT CGCTATCCTT TTCAAAATCC TGAACGGAAT TTTTCTTATG GTTATAGTAC AATAGCGCGT AATCAGTTTC GGCAGTTGTA TGGGATAGAT CCGATGAAGA TTTCGTCACG GGATCGCCAG AATTTGTGGA GGTGGACTGA GTTTAAGATT AACCAAGTTA ATAGTTTTGT TGCTAATACT TCTAGTTTTC TCAAAAAGAA GTATCCAAGG TTAATTTTTT CGGTAGCGGT GTTTCCTTTT CCTCGTCATC AACGCTTTGA TCAAATTCAG CAAGACTGGG AAAGTTGGGT TATGAATGAG GATATTGATT TGTTGACTCC TATGACTTAC GCTTTAGATA CAAATCGTTT TCAGCGAATA ACTCAACCGC TGACAAATAC TGGAGTGTTA GGTAGTACTT TGATAACGCC GGCGGTTAAG CTTTTGAATA TTCCTGAAAT TGTAGCAGTA GACCAAATTC AAGCAGCTCG AGATTTACCT ACTGGGGGTT ATATTATTTT TGCGGCGGAA AGAATTACTG GTGGTTTCCA TGGATTTTTA ACTCGTACTC AAGGTGGGGT AGAAATGGAT AATACTAGAA GAGCTTCATT AAATACTTTT GGTAAAACAG CTAAAGTTGT GGAGGGTGTA ATTCCTTACC GTCAACCATT TATTGCTGCT GCAGATCGTT TTCAAGCTTT GAAAAAAGAA TGGAGTTTTT TGTTGGGAAA TGAGCAACTT TTCCTGAGGG AGCTTCAGCT TGAGAGTTGG GGTAATGAGG TAGGGGAGTT AGCAACAGCT TTGGAAAATT TGGCTGATAG TCCTGATCAT AGTAATTTTA ATATTGCTAA GAGAAAGCTG AGGAAATTCC AGTTAAAATT TAGAAGTAAT ATGAGTGATC ATGCTAGGCA AAATGCTTAT CAAGTTCAGA CTTGGCAAAA TCGTTTGACA GCTTTGGAGA TGTTGTTAAA TTATGGGGAA AGGGTCAAGT TAAATGATCA GAGGTTTTAA
|
Protein sequence | MNQQKQKKAL SKKNLLLLPS LLGALLSLMV LLPKAGRASV LLGIIRDFQN TDEWNQVIRR IDALGISYEP IDLRQIKTVD ELSGVRVIFL PNIEVLTEPQ VQIIEEWVKG GGKLIASGQI GQKSQLGVRQ KLRSLLGSYW AFPLSQPTIP EPKYRCLDLT CTKSTNWAPK TNNVGTVTGG ILIPAGLNST TAATWKGTSG SSAVVITPQV TYLGWHWGNT ESAAVDSIWL QAILNRYQGQ PEFSARNNNI FSLENNRRSV SDANNSNPVK IHPRSNPSPS SRKESVSPVR VQPKRNSVNT GENSNINPTA PIAENSIVNE DNIWLRRQEK KSGTENEPEN LIQAREELGS QNSNLGSKIT TVPESEIVGG KDTAVSESEN STTETTNNRG RNIWQRLQQE KEEQNQASEV VGEKDQVIDS STNSTLQNNN RGSIWNRTQE ETKTRASSSR RHPIMRLLRS LPPIEVPSAQ RDPSSGSASP GLDIRQGNYP ISRAKAYAML EELNNLLGRF ESALIAAKSA NVKVDLAADD VSLLAASTGN ARFIAQRNQK IRGGQQVIVK VRQVIQNFPQ QVKAKQYAAA RNQWLQARQM LWNNYPTDGQ RAGAEIRAVW LDRGTIVRAR SERGLAGVFN RLAAAGINTV FFETINAGYT IYPSNVAPRQ NPLTTSWDPL KAAVKLAHER NMELHPWIWA FAVGNKAHNQ ALGQGDSYLG PVISAHPSWV MTDKRGRKRH PLDGKVYMDP ANPEVRQYLL NIIDEIASRY EVDGIHLDYI RYPFQNPERN FSYGYSTIAR NQFRQLYGID PMKISSRDRQ NLWRWTEFKI NQVNSFVANT SSFLKKKYPR LIFSVAVFPF PRHQRFDQIQ QDWESWVMNE DIDLLTPMTY ALDTNRFQRI TQPLTNTGVL GSTLITPAVK LLNIPEIVAV DQIQAARDLP TGGYIIFAAE RITGGFHGFL TRTQGGVEMD NTRRASLNTF GKTAKVVEGV IPYRQPFIAA ADRFQALKKE WSFLLGNEQL FLRELQLESW GNEVGELATA LENLADSPDH SNFNIAKRKL RKFQLKFRSN MSDHARQNAY QVQTWQNRLT ALEMLLNYGE RVKLNDQRF
|
| |