Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3937 |
Symbol | |
ID | 4244020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6090836 |
End bp | 6093529 |
Gene Length | 2694 bp |
Protein Length | 897 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 638108859 |
Product | glycosyl transferase family protein |
Protein accession | YP_723441 |
Protein GI | 113477380 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0634036 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000739311 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTGCCC AAGAAAAAAA ACAACCCAAT ATTAATCATA TTCTCACCCT AGCTATCCTG TGGTCATTAG GGGCCGTGAG CGATCGCCTC TGGTTTACCT TCGATAAATC AGTTCCAGCT TGGGATCAGG CCGACTATCT CACAAGTTCC CTAACCTACT GGCGTGCTTT GCAAAACCTA CAATTATTCT CTGGAGAATG GTGGAAAAAC CTTTGGCAGC TTTCCCCAAA AGTACCCCCC TTAACTTATA TTCTTGCAGT TCCCTTTCAA AATATTTTTG GTAGAGGAGC TGACCAAGCC ACCTTAGTAC ATCTATTATT TAGTGCTATT TTACTAACCT CAGTTTACAG TTTAGGCAGC AAACTATTTA ATCAAAAAGT AGGTTTTTGG GCAGCAGTAT TATGTATATT ATTTCCAGGG TTATATAGAT ATCGTTTACA ATTTTTACTC GACTATCCCC TAACAGCAAT AGTAACCTTA AGCTTCACTT GTTTAACACT TTGGTATTTT TCAAAACCGA ATCATAAACA GGAAATAGAC AACAGCGAAA AGACAACAAA AAAAACCGAA AAAAAAGCAA TTAACTTAGA CTTAGAAAAA ACCAACCTTA CCAAAACCAA CATCACAAAA ATATCTCCTC AAATAGATCA ACTATCTCAA TCCAAAAATC TCAAAAATTG GCTATTAGCA ACTGCTTTTG GCTTAACTTT TGGCCTAGCA ATTTTAGTTA AGCAAACCAC AATATTTTTT CTGTTTTTTC CCTTACTTTG GGTAACTATA AATATTCTCA AAAAAAAGCA ATGGAACCAA CTAATTCAAC TCACTTACTC TCTATGCTTA TCCATCACAA TTTTTTACCC TTGGGCAAGA ACAAATTGGC TATTAATGTT AACCTCCGGT AAACGAGCAA CAATAGACTC TGCCATAGCC GAAGGAGACC CCAACCTATT AAGTTTAGAT GCTTGGATTT ATTATGGAAA ACTACTCCCC AACCATATTT CCTTACCTCT ATTAATCATT CCAATTAGTG GATTAATTTT TTACTTAATC CGGCTCAATA AAAAACAAAA AAACCTTTCC ACTCAGCTCC ATTCTTTTCA ATGGCTATCC ATATTTTTAA TCGGCGGATA TCTAATTTGC TCCCTCAACA TCAACAAAGA CTTTCGTTAC ACCTTACCAT TATTGCCAAC CTTATCAATT TTATTAGCCT TTGGTTTAAT GCAATTTCCA AGAAAAGTAG GAAAACAAAT TCGCTATCTT ACTATTAGTT TAGCAATTAT TTTAATGCTA TTAAATATCT GGTCTGTAGG TGGAAACTTC CCCAGAAAAA TCACCGCTTG GTTAAGTCCT GGTGCAGCTT ATTCTGCTCA TTTAGGCAAA GAATGGCCCC ATAAACAAAT AATTGCTGAA ATCATCAAAA CTAACCCCTA TTTACGGTCA ACCTTAGGAG TTTTACCTTC CACTCCAGAA ATTAATCAAC ATAACTTAAA TTATTACGGT GCTCTGCAAA ACTTACAAGT CTATGGTCGG CAAGTAGGAA CCAACTTCAA ACAAGTAGAA CAAGATGCGC GATCGCTATC TTGGTTTATC ACAAAAACCG ATGAACAAGG TTCTGTCAAA AGAATCAAAA AAGCTCAAGC TGCTATTGTT AACCTCATCG AAAAACACCC AAACTTTCAA CTACAAAAAA GCTGGCAGTT ACCAGATAAT AGTAACTTAA ATCTATATCA TAATCAGTTT TTACCAGTAG AAGTTCATCC CCTGACCAAA CCACAAAAAA AAGTGAAACT AGACTATATA ATTTTATCCC CAAAAATTCG CTCAGGAACA ACCATACCTA TTACCTACAA ATGGTCCGGT TCTTGGCAAA AACTACAGTC AGGTTTAGTC TTATTAACCT ATCGCCGAGA AAATATTAAT CAAACAGAAC TTCAGCCATT AACTTCTAAC AATAATGCAT CAGAAAAACC TCTTACCAAA TTCATTCACG ACCACGGTAT CGGCATGGGC AGTTTACACC CTAGTCTACT TCAAGCAAAT AAGTCTGAAG TCGGGTTCGA AATTATTGAA CGAACAGGAA TGTTACTCCC ATCTAACATT GTACCGGGAA CTTATATATT AGAAGCCACC TATCTTAACC GAGAAACCGG GGAAAATTAT CCCATTTCTG TAGAACCAAC CCTGCGAGTA CAGGTTGAAC CCCAAGCACA AGCGTTACCA GCACCAGAAC TAGATCTAGT AACCCAGTTG CGGACGTGGG GCCAAAAGTT ACCCCAGGGA ATAAAAGGTT TGGAAACCAT ATTTGCGGAA GTAGCGCGAG TTAACCAATA CGATCCAGTT CAAGACTATA CTCAGCAAGC AGAAAAAACT TTAACCTATC GTTTGCAACA GGAACCAGAT AATTTAGAGT GGCTCTATAG TTTAGCTTTA GCCCAAGTTT TACAAAAAGA TGCCCAGGGA GCGATCGCTA CCTTAACTAG AGTAACTAAA CTAGATGCTC AAAACGCCTT TGCTCATGCT TATTTAGCTT TTGTCTATTT ATACGATCTT AATCCTGGAG CAGCGAAAAT AGCATTAAAG TCAGCTTTAG AAATTGATCC CCATCAACCA GAAATTAGGT CATTAAATGG TATTGCTGCT TTAATGCAAG GAAATTTTAT TCAAGCATGG CATAACTTGA TAACTCAAAA ATAA
|
Protein sequence | MTAQEKKQPN INHILTLAIL WSLGAVSDRL WFTFDKSVPA WDQADYLTSS LTYWRALQNL QLFSGEWWKN LWQLSPKVPP LTYILAVPFQ NIFGRGADQA TLVHLLFSAI LLTSVYSLGS KLFNQKVGFW AAVLCILFPG LYRYRLQFLL DYPLTAIVTL SFTCLTLWYF SKPNHKQEID NSEKTTKKTE KKAINLDLEK TNLTKTNITK ISPQIDQLSQ SKNLKNWLLA TAFGLTFGLA ILVKQTTIFF LFFPLLWVTI NILKKKQWNQ LIQLTYSLCL SITIFYPWAR TNWLLMLTSG KRATIDSAIA EGDPNLLSLD AWIYYGKLLP NHISLPLLII PISGLIFYLI RLNKKQKNLS TQLHSFQWLS IFLIGGYLIC SLNINKDFRY TLPLLPTLSI LLAFGLMQFP RKVGKQIRYL TISLAIILML LNIWSVGGNF PRKITAWLSP GAAYSAHLGK EWPHKQIIAE IIKTNPYLRS TLGVLPSTPE INQHNLNYYG ALQNLQVYGR QVGTNFKQVE QDARSLSWFI TKTDEQGSVK RIKKAQAAIV NLIEKHPNFQ LQKSWQLPDN SNLNLYHNQF LPVEVHPLTK PQKKVKLDYI ILSPKIRSGT TIPITYKWSG SWQKLQSGLV LLTYRRENIN QTELQPLTSN NNASEKPLTK FIHDHGIGMG SLHPSLLQAN KSEVGFEIIE RTGMLLPSNI VPGTYILEAT YLNRETGENY PISVEPTLRV QVEPQAQALP APELDLVTQL RTWGQKLPQG IKGLETIFAE VARVNQYDPV QDYTQQAEKT LTYRLQQEPD NLEWLYSLAL AQVLQKDAQG AIATLTRVTK LDAQNAFAHA YLAFVYLYDL NPGAAKIALK SALEIDPHQP EIRSLNGIAA LMQGNFIQAW HNLITQK
|
| |