Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_5018 |
Symbol | |
ID | 4246673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 7669368 |
End bp | 7671188 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 638109827 |
Product | hypothetical protein |
Protein accession | YP_724403 |
Protein GI | 113478342 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2942] N-acyl-D-glucosamine 2-epimerase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTACTC CAGTAGAGTT TACTTTTTCC GACTTGATCG CCGGGTATGT TACAAACTTC GACTCAGGCA CAGATATTTT TGGGCTCAAA ACAACAGATG GCAGAGAATT TAAAGCCAAG TTAACTCCTA CTAGTTATGC TAAGCTAGTA CAAAACTTAG ACGAAGCATA CCCAGATGCT ACAGGTGCTA TGCGATCGAT GCTGGTGCCT GGTAGGTATG TATTCACTTA TGGCGTTTTT TACCCAGATA GTTCCATATT TGAAGCTAAA CAAATAGTAT TTGTCGGTCG TCAAGCAGAT GACTATATAT TTGAAAAATC AAACTGGTGG GTACACCAGG TTCGCTCCCT AGCTAATTTT TATGTCAAAG CTCAATTTGC TGGTGAAGAA ATAGACTACC GCAACTATCG CACAACTTTA AGTCTTTCTG GTGTAAGGTC TCAGGTGAAT TTCCGACAAG AAACAGATAC TATCTCCCGG ATGGTTTATG GGATGGCTAC AGCATATATG ATGACTGGGG AAGAAATTTT CCTAGAAGCA GCTGAGAAAG GTACTGAATA TCTGAGAGAC CACATGAGAT TTGTGGACTT AGATGAAGGT ATAGTTTATT GGTATCACGG TATCGATGTG CAAGGGGAAC GAGAGCAGAA AATTTTCGCT TCAGAATTTG GTGATGACTA TGATGCTATT CCCGCTTACG AACAAATTTA TGCCTTGGCA GGTCCATTAC AAACTTATCG CATTAATGGC GACCCTAGGA TAATGGACGA TACCGAGAAA ACTATTAAGT TATTCAACGA CTTTTTTCTA GATAAAACTG ATCGCGGTGG TTATTACTCC CACCTAGATC CTATTACTCT AGATCCTCTG AGCGAGTCAT TAGGTCGGAA CAAAGGTACT AAAAACTGGA ACTCAGTTGG TGACCATGCA CCAGCATATC TAATTAATCT CTGGTTAGCG ACAGAAAAAC CTGAATATGC AGATATGCTC GAATACACTT TCGATACTAT TGAAAAGCGT TTTCCAGATT ACGAAAACTG CCCCTTTGTC AATGAAAAAT TCTTTGAAGA CTGGAGTGCA GATCATACTT GGGGATGGCA GCAAAACCGG GCAGTAATTG GTCACAATAT GAAAATTGCT TGGAATTTGA TGCGGATGAA TAGCCTCAAA CCTAAGGATA CTTATGTTGA ACTGGCAAAG AAAATTGCTG AGGTGATGCC AGCAGTAGGG AGTGATCAAC AACGAGGTGG TTGGTATGAC GTGGTGGAAA GAGCATTAGG AGAAGATGAA AAAAATCATC GTTTTGTTTG GCATGATCGC AAAGCTTGGT GGCAGCAAGA ACAGTCTATT CTGGCTTATT ATATTCTTGC AGGAACTCTT AAAGATCAAG AATATCATCG TTTAGGGCGG GAAGCTGCAG CTTTTTACAA CGCCTGGTTT CTGGATACAG AAGATGGTGG GGTTTATTTC AATGTTCTTG CTAATGGTAT CCCTTTCTTG GCAAGTGGTA ATGAACGAGG TAAAGGTTCC CACTCTATGA GCGGTTATCA CTCTACAGAA TTATGCTATC TAGCTGCAGT TTATACTAAT CTATTGGTTA CTAAGCAGCC AATGGATTTT TATTTTAAGC CTATTCCTAG TGGTTTCCCT GATAATATTT TACGAGTATC ACCAGATATT CTGCCTCCTG GTAGCATTAA AATTGGTTCT GTAGAAATTG ATGGTAAACC TTACAGTGAT TTTGATGCGG ATAAACTTTT TGTGAAGTTG CCTGATACTA AGGAACGGGT GAAAGTTAAG GTCAATATTG TACCTAATTA A
|
Protein sequence | MTTPVEFTFS DLIAGYVTNF DSGTDIFGLK TTDGREFKAK LTPTSYAKLV QNLDEAYPDA TGAMRSMLVP GRYVFTYGVF YPDSSIFEAK QIVFVGRQAD DYIFEKSNWW VHQVRSLANF YVKAQFAGEE IDYRNYRTTL SLSGVRSQVN FRQETDTISR MVYGMATAYM MTGEEIFLEA AEKGTEYLRD HMRFVDLDEG IVYWYHGIDV QGEREQKIFA SEFGDDYDAI PAYEQIYALA GPLQTYRING DPRIMDDTEK TIKLFNDFFL DKTDRGGYYS HLDPITLDPL SESLGRNKGT KNWNSVGDHA PAYLINLWLA TEKPEYADML EYTFDTIEKR FPDYENCPFV NEKFFEDWSA DHTWGWQQNR AVIGHNMKIA WNLMRMNSLK PKDTYVELAK KIAEVMPAVG SDQQRGGWYD VVERALGEDE KNHRFVWHDR KAWWQQEQSI LAYYILAGTL KDQEYHRLGR EAAAFYNAWF LDTEDGGVYF NVLANGIPFL ASGNERGKGS HSMSGYHSTE LCYLAAVYTN LLVTKQPMDF YFKPIPSGFP DNILRVSPDI LPPGSIKIGS VEIDGKPYSD FDADKLFVKL PDTKERVKVK VNIVPN
|
| |