Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3216 |
Symbol | |
ID | 4243811 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 4920861 |
End bp | 4922648 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 638108217 |
Product | hypothetical protein |
Protein accession | YP_722808 |
Protein GI | 113476747 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00990527 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTAATA GCTATATTTA TGGTACTGGA GAAGACGATC TGCTATATGG TACCGGAGGA AACGATAAGG CGTATGGAAG CTCCGGAAAT GATATTATCG AAGGTAAAGG AGGTGATGAT GAACTTTATG GTAATGGTGG GAATGACACA ATTAAAGGAG AAGAAGGCGA TGATACAGTT TTAGGTCATC CTGGAGATGA CTCAATAGAA GGAAACGCAG GTGATGATAA GCTTTATGGT GGTGATGGGG ATGACCTAGT CTCAGGGGGA GAAGGAAACG ATATGCAGAC TGGAGGAGGA GGAAACGATA CCATAGAAGG AGGAGAAGGT GATGACAAAC TTTATGGAGG GCCTGATGAT GACTTGATCA AGGGAGAAGA CGGTCAAGAT AAGCTTTATG GTAATGATGG GAATGACGCA ATGGAAGGAG GTGCGGATGA TGACACAGTT TTAGGTCATG CTGGAGATGA CTCAATAGAA GGAAATGCAG GTGATGATAA GCTTTATGGT GGTGATGGGG ATGACCTAGT CTCAGGGGGA GAAGGAAACG ATATGCAGAC TGGAGGAGGA GGAAACGATA CCATAGAAGG AGGAGAAGGT GATGACAAAC TTTATGGAGG CCCTGATGAT GACTTGATCA AGGGAGAAGA CGGTCAAGAT AAGCTTTATG GTAATGATGG GAATGACGCA ATGGAAGGAG GTGCGGATGA TGACACAGTT TTAGGTCATG CTGGAGATGA CTCAATAGAA GGAAATGCAG GTGATGATAA GCTTTATGGT GGTGATGGGG ATGACCTAGT CTCAGGGGGA GAAGGAAACG ATATGCAGAC TGGAGGAGGA GGAAACGATA CCATAGAAGG AGGAGAAGGT GATGACAAAC TTTATGGAGG GCCTGATGAT GACTTGATCA AGGGAGAAGA CGGTCAAGAT GAGCTTTATG GTAATGATGG GAATGACGCA ATGGAAGGAG GTGCGGATGA TGACACAGTT TTAGGTCATG CTGGAGATGA CTCAATAGAA GGAAATGCAG GTGATGATTT ACTTTCTGGT GATTCTGGGG ATGACGTAGT CTCAGGCGGA GAAGGAGAGG ATATACTGTA CGGAGGAGCA GGAAACGATG AACTTTATGG TGGTGATGAT AATGATACCA TTGATGATGG TGGTGGGGAT GACATACTAG AAGGAAATAC GGGTAATGAT ATACTTCGTG GTCGTGGAGG TCAAGACGAG CTCATAGGAG GTGATGGTGA TGACCTCATC ATTAGTTATA GTGATGCAGG AGAACCTGAA ATTGCCCAAC AGACAAATGA ATCTAAGGTT TATCCAGATC AACCCTTTTT TGAAGCTAAT GATACTTTAA CTGGAGGATC AGGGGCTGAT ACTTTCCTAT TTAGACCGTT GATGAATGCC AAGCCTGAGA TTATTGAGAA ACACAATATG CACAATATGC ACCAAGATAT GCATAATATG AACTTGCATA AACAAATCGC TGGCGAAAAT GGTTCAGTCC ATGATCACTG GGTTGATAGT ATTGGTGATG ATATCATCCT TGACTTTAAT AGAAGTGAAG GGGATAAAAT TGAGATTAAG GGTCATACAG TAAAAGTAGA AAAAATTGAA GAGCTTGATG ATGGAAGTGG ATCTATTGTT CATTTAATTA GTGATCAAGG GGCTAATGGT GGAGCCCACC ACCTAGATAA ACTAGGAACA ATTACTGTTT ATGGTGATCT GGTTACAGAA TCAGATTTAA TAGCAGATAT CTTTGGGGAT GGTGCTGGTA TGGGTTGA
|
Protein sequence | MSNSYIYGTG EDDLLYGTGG NDKAYGSSGN DIIEGKGGDD ELYGNGGNDT IKGEEGDDTV LGHPGDDSIE GNAGDDKLYG GDGDDLVSGG EGNDMQTGGG GNDTIEGGEG DDKLYGGPDD DLIKGEDGQD KLYGNDGNDA MEGGADDDTV LGHAGDDSIE GNAGDDKLYG GDGDDLVSGG EGNDMQTGGG GNDTIEGGEG DDKLYGGPDD DLIKGEDGQD KLYGNDGNDA MEGGADDDTV LGHAGDDSIE GNAGDDKLYG GDGDDLVSGG EGNDMQTGGG GNDTIEGGEG DDKLYGGPDD DLIKGEDGQD ELYGNDGNDA MEGGADDDTV LGHAGDDSIE GNAGDDLLSG DSGDDVVSGG EGEDILYGGA GNDELYGGDD NDTIDDGGGD DILEGNTGND ILRGRGGQDE LIGGDGDDLI ISYSDAGEPE IAQQTNESKV YPDQPFFEAN DTLTGGSGAD TFLFRPLMNA KPEIIEKHNM HNMHQDMHNM NLHKQIAGEN GSVHDHWVDS IGDDIILDFN RSEGDKIEIK GHTVKVEKIE ELDDGSGSIV HLISDQGANG GAHHLDKLGT ITVYGDLVTE SDLIADIFGD GAGMG
|
| |