Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1787 |
Symbol | |
ID | 4243769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2728673 |
End bp | 2730601 |
Gene Length | 1929 bp |
Protein Length | 642 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638106911 |
Product | hypothetical protein |
Protein accession | YP_721519 |
Protein GI | 113475458 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.799835 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACTA ACAATCATTT ATCCATAAAT CCTTATGTTT TTGGCAAACC AATTTATGAA TACAACAACT TATTTGGTAG AAAAAATGAT GTTGATAAAA TTAAAGATCA CATCATTAAC AAAGATATAA AAATAACTTT ATTGCATGTC CAAAGACGTA TTGGTAAAAC TTCATTGATA ACTTGTTTGC CTCAGTCTTT CACTGAGGAG CAGAATGGTG TTAAGTTTGT TACTTTTTCA TTTCAAGGTT ATAAAGATAA GCCAATCCCT GAAATACTAA ATTATCTTGC TGATGAGATC GCTGGTACTA TTCAACTTCC TCAAAAAGTA AGAGATCAGG CTGATACTAC GCACAACTTT TTTGAACTTT TTTTGCCGAA AGTTATCGAT CAATATTTGT CAGGTCAAAA GTTGGTTCTT CTCCTTGATG AATTTGATGT TTTAGAAGAA AAAGATAAGA AAGGAAAAGT ATTATTTGAT TACTTAAAAA AAGCTGTTAA GGAACAAAAA AAACTATTTG CTATTCTGGT TTTTGGTAGA CCCTTAAAGG ATATGAAGTA TCTAGAAACA TTTTTACAAG AAGAAGGTCA AGAGACTATA GAAGTTGGTT TACTGGATTA TGAAGGTACA CAAGATTTGA TTGTTGAGCC ACTTAAGAGA ATACAGAGCG TATTTAGATA TGAAAAAAGT GCAATAGATA GAATTTGGGA ACTATCTGCT GGCCATCCTT CTTTGACACA ACTGCTGTGC TCAAATGTTT TTGTTCATTG TAGGAATAAG CAACAGAGTG TAGTTCGGAA AGACGATGTA GACTCAATTT TAAGTCAAGC AATGGAAGAA GGCCAGGCAA TATTACAAGG CTTTCTAGAG CCTTTAAGCG ATATTGAAAA GTTATTTTTT TTTGCAGTAG CAGAAGCTCA AGAACAAGGC ACAGACCCAT TAAAGATTCT GAAAATAATA CAAAAAACTA CAATAACACC GGCAGATTTT AGAAGAGCAC GAGAGCGTTT AATAGAGTTA GGCTTTGTAG AAAAAAATGG CAAAGGTCTT AAAATAAAAG TTGAATTAGT CCGGCTCTGG CTAATAGAAA AAAACCCCTT ACCCAACAAT AAACAGAGGA AACCGAAGGA AGGAAGAAAA AAGATCAAAC GTCATATAAC CTCAAGTCAA CCCAACAGAC CTAACCCAGT TGCACAATTC ATTGCTTTTA TTGCGTTGAT AAGCGTTATA GTTTTTATTG GACAGAAGTT ACTCTCTAGG ATTGATAGCT CCAAAAACTA TGAACGCTTT CAGTCTGACT GTTACAGACT ATCAGAGGAA ATAAGCAACG CTTTAGAAGA GAAAAAAGAT ACAACGCAGT TGCAAGTCAT CAAAAAAGTT AGAACTGAAT GGTCGAGAGA AAAAAAAGGC TTATTAGACA AACAATGCCC ATATTCTTAT GAACTAGATG CAAAATATAA TGCATTACTA CAGTACTATG GACAAAGTAA AGTAGATACT GGAAACTTTG ATGAAGGTAT AGAAGCATTT TGTGAGATTA CCAGTGAATA CAAGAATTTT TCTGACATTA AAAAAATCTT TGAAAGATGG GTACTAATAG ACAAAAGATT ATCTAATGAA AGTACAAAAA GGGTGCTGAA GCAAATAATT AAGCAAAATC AATCAGGAAA TGATTGCCTT GTTTATTCAT TTAAAGACGA TAGAAATAAA AATGATCTGT ATGACCTGAA AGCTCAAGTT CATGCTGATG ATTATGAGTA TGGCGAAGCG GTCGAGTCAT ATTGCAAAAT TACAGAAAAC TATTATAAGT TTGAGACTGT TGTTAAACAG CTAAAAAAAT TGAAACGAGA AAATGTAGAG AAAGTAGAGG AAAAACTCAA AGAATTAAAC GATCCGTGTC CAGCATTTCC TCCCTCACCA GACAATTAA
|
Protein sequence | MQTNNHLSIN PYVFGKPIYE YNNLFGRKND VDKIKDHIIN KDIKITLLHV QRRIGKTSLI TCLPQSFTEE QNGVKFVTFS FQGYKDKPIP EILNYLADEI AGTIQLPQKV RDQADTTHNF FELFLPKVID QYLSGQKLVL LLDEFDVLEE KDKKGKVLFD YLKKAVKEQK KLFAILVFGR PLKDMKYLET FLQEEGQETI EVGLLDYEGT QDLIVEPLKR IQSVFRYEKS AIDRIWELSA GHPSLTQLLC SNVFVHCRNK QQSVVRKDDV DSILSQAMEE GQAILQGFLE PLSDIEKLFF FAVAEAQEQG TDPLKILKII QKTTITPADF RRARERLIEL GFVEKNGKGL KIKVELVRLW LIEKNPLPNN KQRKPKEGRK KIKRHITSSQ PNRPNPVAQF IAFIALISVI VFIGQKLLSR IDSSKNYERF QSDCYRLSEE ISNALEEKKD TTQLQVIKKV RTEWSREKKG LLDKQCPYSY ELDAKYNALL QYYGQSKVDT GNFDEGIEAF CEITSEYKNF SDIKKIFERW VLIDKRLSNE STKRVLKQII KQNQSGNDCL VYSFKDDRNK NDLYDLKAQV HADDYEYGEA VESYCKITEN YYKFETVVKQ LKKLKRENVE KVEEKLKELN DPCPAFPPSP DN
|
| |