Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3840 |
Symbol | |
ID | 4242291 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 5931920 |
End bp | 5933398 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638108772 |
Product | hypothetical protein |
Protein accession | YP_723355 |
Protein GI | 113477294 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.83902 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.693476 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGTAA AATTAAAACG ACCAATTTTA GTAGGGGGAA TAGGTTTATC TCTACTGTTA TGGCTACTCT CAGAGGTCCA AAATTTTATA ACAGACAATA GTGAACCTAC TATTTTAGGA ATTATAGTAG TCAGTCTGGG GGTTTGGTTA TTAAAAAGAC CAAAATATCT ACCTTCCCAT AAACCAAGTA TAATCGTATC ACCTACAAAA CAAGCAGCAG AAGAGGCGAT CGGTTTATTA TCTATTACTA TAGATAAGGT TTCTATGACA GTAGAAGGTA TTAATAACTC TGAAAAAATA AGTAATGATA TTACTCAACT ACATCAGCAA GTAAAAATTA TTACTCAGGA ATTAGAAAGG CAAGAATTAA GTATAGTTAT TACAGGCAAT AAAGGAGTTG GCAAAACTAC TTTTACTGAA ATATTAAAGT CTCAATGGAA CTCTAAAAAA TCACCAAAAA TAAAAGTTGT TGATATAACT TGGGAGCCAG AATGGGCAAC AAACAATGAA TATGCTAAAG TTAATCCTCT ATCGCCTTAC GATTTGATAT TATTTTTGAC TACAGGTGAC TTAATAGATT CAGAATTTCA AGCTTTATCA AAACTAACAA CTCTTGGTCA ACGTTTTATC TTAATTTGGA ATAAACAAGA CTTATATTTA CCAGACCAAA AACCACAAGT CATCCAAAAA ATTAAAGAGA CATTATCTAC TATTAACTCA GAGAAAAATT TAGTAGGAAT TTCTGTAAAA CCTAACCCTA TTAAAGTGCG AAAATATCAG CAAGATGGCA CTATTCAAGA ATCTATAGAA CAACCATTAC CAGAAATATC TCAATTAACA GAAAAACTAA ACCAACTATT AGAGGAAGAA AGAGAAAAGT TAGTCTGGGC AACTACTATA AGAAAAGCGG AAATATTTAG ATTAGAAGCT CAAAATATTT TAAATAAAAT CAGAAAAGAA CGAGCGCTTC CTGTCATTGA AAAATATCAA TGGATAGCTG CTGCTACAGC ATTTGCTAAT CCAGTTCCAG CTTTAGACTT ATTAGCAACA GCAGCTATAA ATACTCAATT AGTAGTAGAC TTAAGTGCTA TATATGAGCA AAAATTTTCT ATAGAAAAAG GTAAACAAGT AGCAGGTACT ATGGCAGAGT TAATGTTAAA ACTAGGACTA GTAGAACTAT CTACCAAAAC ATTAACTACT CTACTTAAAA GTAACAGTTT AACTTTTGTT GCTGGTGGTG CATTTCAAGC AGTAAGTGCT GCTTATTTGA CAAGAGTAGC AGGTATGAGT TTAGTAGAAT ATTTGACTAC TCAAGCAGAT ACTAATTCCG TGAATATTGA TCAATTAGGA ACAATTATTC AAGGTGTATT TAGTAAAACC CAAGAAAATA ATTTCTTGAA GTCTTTTGTT ACTCAGGTAA TGAGTCATAT TTTGCCACAG GGAAAACAAT TAGAATTTGT CTCATCTCCG GCGCAATAA
|
Protein sequence | MAVKLKRPIL VGGIGLSLLL WLLSEVQNFI TDNSEPTILG IIVVSLGVWL LKRPKYLPSH KPSIIVSPTK QAAEEAIGLL SITIDKVSMT VEGINNSEKI SNDITQLHQQ VKIITQELER QELSIVITGN KGVGKTTFTE ILKSQWNSKK SPKIKVVDIT WEPEWATNNE YAKVNPLSPY DLILFLTTGD LIDSEFQALS KLTTLGQRFI LIWNKQDLYL PDQKPQVIQK IKETLSTINS EKNLVGISVK PNPIKVRKYQ QDGTIQESIE QPLPEISQLT EKLNQLLEEE REKLVWATTI RKAEIFRLEA QNILNKIRKE RALPVIEKYQ WIAAATAFAN PVPALDLLAT AAINTQLVVD LSAIYEQKFS IEKGKQVAGT MAELMLKLGL VELSTKTLTT LLKSNSLTFV AGGAFQAVSA AYLTRVAGMS LVEYLTTQAD TNSVNIDQLG TIIQGVFSKT QENNFLKSFV TQVMSHILPQ GKQLEFVSSP AQ
|
| |