Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4437 |
Symbol | |
ID | 4246090 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 6839236 |
End bp | 6842421 |
Gene Length | 3186 bp |
Protein Length | 1061 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 638109320 |
Product | glycosyl transferase family protein |
Protein accession | YP_723897 |
Protein GI | 113477836 |
COG category | [N] Cell motility [R] General function prediction only [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG1216] Predicted glycosyltransferases [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTCTTA CTTCTTTTGC TAAAGGAAAC AAACTCTTAC GGGAGGGAAA ATTCGAGTCG GCGATCGCTT ATTATCAAAA AGCCATAGAA GAAAATCCCC AGTTTACCTG GTCTTACCAA AATTTAGGGG AAGCTCTTGA GAAAACTGGG CGGATAGAAG AAGCGATCGC TTCTTTTCGT CAGGCTGTGG CCATAGATCC GCAATCCCAC TGTTTTCTAT ACAAATTGGG GATAACATTG AGTCGGCAAG GCCAGTTTCA GGAAGCTGTG GGTTACTTAC GTCGGGCGAT CGATTTAAAC AAAAATGTGC CTGAGTTTTA TCTAGGTTTG GGAGCTGCCT TGGTGAAGTT GAGGCAATGG TCTGAAGCAG TTGAATGTAT TCATCAGGCA TTGAGGGTGT TGGATGAAAA AGTAGGAACG TTATATGAAA GAGCTCTACA GGCAGAGGGT TATTTTTATT TGGCAGAAGC TAAGTCTGGT CAAGAGCAAT GGTCTGATGC AATAAAGTTA TATTCTCAGA GTTGGGAAAT TTATCCATAT CGGGTTAACT GCTGTATCAG TTGGGCAGTA GCTTTAGGTA AGTTAGGAAG ATGGAGTGAA GCGGTGGCTT TATATCGTCA AGCTGTAGCT TTCTCTGGGG AGTCTGGTGA GGTGTACTTT GGTTTAGGGA AGGCTTTGGG ACAGTTAAAA CAATGGGAGG AGGCTGTTGT TGAGTATCGA CGAGGGATAG ATTTTGGTTT TGATGGGGCG GAGGTGCGCC ATTCTCTGGG GTATGCTTTC CTACAGTTAA AAAAATGGGA GGAGGCTATT GTTGAGTATC GTTTGGTGGT AGAGGTTGAT CCTAAGTTCG CACCAGTTCG GCACCAGCTT GGGTATGCTT TGATGCAATT GGAGCATTGG GAGGAGGCTG TGATTGAGCT GCGTCAAGCT GTGGAGTTAT ATCCTAGGTC GGCTATAGTT TGGCAGCAGT TGGGAGATGT TTTATGGCAG TTGGAGGAAG ATGGGGAGGC TGAAGAAGCT TATCAAAAGG CGACAGAACT AAATCCTGAC ATGGCTAATT TGCCAAAAGC AAAAAGTGTG GCAAGTGCAC TTTCCAAGTC AGAGATAGGG ACAATAAATA ATACAGATCG GTTTGTTCGG TTAGTACACG AAGCTGATAA GCGAGCTTTT ATAGACTACT CATACACAGA ACTATCCGCC TCCCACAATT TCGACTTAAA TTGTCTTAAT CTACATTGGG TGACTTGTGA CTTTTCTCCC GGTGCTGGGG GACACATGAC AATATTCCGA TTTATCAAAT TACTAGGTCA ACTCGGTCAT CACAATACTC TCTGGATATA CCAACCAGTT GTTCATAAGT TTGAAACAGA GGCATTAGAG ACTATATTAA AGTATTTTCA AACTTTACAG GTAGAAGTCA AGTTTATTAT CGGCAGGGAA GAGTTTGAGA GTGCAGCAGG AGATGTTATT ATTGCCACAG ATTGGGGTTC GGTCCAGTTT GCTGTCTCCA ATCCTAATTT CCACAACAGA TTTTACTTTG TTCAAGACTA CGAGCGGTTC TTTTCTCCCC AAGGAACAAA AGCACTTTTG GCTGATTTGA CTTATAGTTA TAAACTAGAT TGCATTTGTG CAGGGCCGTG GCTAGAAAAA ATAATGTCTG AAAAATACGG TTCTTGGGCT TGTAAGTTTT GGCTAGCAGT GGATACTTCG GTTTATTTTC CTCAAACAGA TGAAAAAGTT AATGATGTTG TCAAAATTGC CTTTTATTAT CGTCGTGGCA CAGAGAGAAG AGCTGTTGAA TTGGGGTTGC TGGCATTAGA AAAGTTAGCA ACTTATAGAG AAGATTTTGA AGTGCATTTC TTTGGGGGGA ATACTAATTT TGATCGTGCA CCTTTTCAGT TTAAGTCTCA TGGAATTTTA ACTGCCCAAC AGCTAAGGGA ACTTTACCAA GATAGTGACA TTGGTATTGT ATTTTCATCA ACTAATTATT CATTAGTTCC TCAAGAGATG ATGGCTTGTG GTTTACCAGT GATAGAACTG GCTGGTGAAA GTACGGAAGT TGTATTTCCC CCAGGGGTAG TAAGGTTAGC CGGTCCGGCT CCTCTGGATA TCACTGATGC TATTGTAGAG TTAATGGACT CAAAAACTCC CCGTGAAGAA CAAGCACATT TAGCAACAGA ATGGGTAAAA CAGTTTACAT GGGAACAAGA AGTAGCTAAG ATAAACGGTT TTATTCAAAA TAGGCTTCTT GAAAAAAAGC CAAATTCCAT AGTTGTTAAG GCAAAACCCT CTAAACCAAA AGCTTCTGTT TTTATACCAA CCTTGAACGG TGGTGAGTTG CTTAAACAGG TAATAGAAAG GGTGAAGGAG CAAGTCACAC CGTGGTTGTT TGAGATTGTA GTTATTGATA GTGGCTCAAC AGATGGAACC TTAGAATGGA TGAAGGCAGA CCCAGTAATT AGACTTTACG AGATACCTAA GTCCGAGTTT CAGCATGGTA AGACTAGGAA CTTGGGGGCA TCTTTGTCTG AAGGGGAAAA TATTGCTTTT TTGACCCATG ACGCTCTGCC AGTGGATAAA AATTGGTTAT ATTATCTGGT GACTACACTA GAAAATTTTC CTAATGCGGC AGGGATATTT GGCAAGCATT TAGCATATCC TGATGCAGAT GCTTTTACTA AGCGAGACCT GGAAAACCAT TTTCAGATTT TTGATGAACT ACCAGTGTAC CTTGATAAGA ATACAAATTT CAAGCTATAT AAGAACAAAG ATTTGTCCTG GAAACAAAAA CTTCATTTTT ACAGCGACAA TAATTCCTGT ATGCGTCGAT GTGTTTGGGA AAAAATTCCA TATCCAGAGA TTAGTTTTGG AGAGGATCAG GCTTGGGCTT GGCAAGTTAT TGAGGCGGGG TATGGAAAGG TGTATGGGAG AGATGCAGTG GTGTATCATT CACATAATTT TTTGCCGGAG GAGATTTTCA GTCGCAGTCT GGAGGAGGCT TCATTTTTCC AAAAAACTTT TGGGTATGAG TTGGTTAATA AAGACAACAT TTACGAACAA ATAAAGTTGC TAAATGAACA TGATAGCCAG TGGGGAAGGG ATAAAAATTT AGATGAAAAA GTAATTATAA TGAGACAAAA AAATAATGAG GCAAGAATAC ATGGCTATAT GGCAGCTTTG AAGTAG
|
Protein sequence | MLLTSFAKGN KLLREGKFES AIAYYQKAIE ENPQFTWSYQ NLGEALEKTG RIEEAIASFR QAVAIDPQSH CFLYKLGITL SRQGQFQEAV GYLRRAIDLN KNVPEFYLGL GAALVKLRQW SEAVECIHQA LRVLDEKVGT LYERALQAEG YFYLAEAKSG QEQWSDAIKL YSQSWEIYPY RVNCCISWAV ALGKLGRWSE AVALYRQAVA FSGESGEVYF GLGKALGQLK QWEEAVVEYR RGIDFGFDGA EVRHSLGYAF LQLKKWEEAI VEYRLVVEVD PKFAPVRHQL GYALMQLEHW EEAVIELRQA VELYPRSAIV WQQLGDVLWQ LEEDGEAEEA YQKATELNPD MANLPKAKSV ASALSKSEIG TINNTDRFVR LVHEADKRAF IDYSYTELSA SHNFDLNCLN LHWVTCDFSP GAGGHMTIFR FIKLLGQLGH HNTLWIYQPV VHKFETEALE TILKYFQTLQ VEVKFIIGRE EFESAAGDVI IATDWGSVQF AVSNPNFHNR FYFVQDYERF FSPQGTKALL ADLTYSYKLD CICAGPWLEK IMSEKYGSWA CKFWLAVDTS VYFPQTDEKV NDVVKIAFYY RRGTERRAVE LGLLALEKLA TYREDFEVHF FGGNTNFDRA PFQFKSHGIL TAQQLRELYQ DSDIGIVFSS TNYSLVPQEM MACGLPVIEL AGESTEVVFP PGVVRLAGPA PLDITDAIVE LMDSKTPREE QAHLATEWVK QFTWEQEVAK INGFIQNRLL EKKPNSIVVK AKPSKPKASV FIPTLNGGEL LKQVIERVKE QVTPWLFEIV VIDSGSTDGT LEWMKADPVI RLYEIPKSEF QHGKTRNLGA SLSEGENIAF LTHDALPVDK NWLYYLVTTL ENFPNAAGIF GKHLAYPDAD AFTKRDLENH FQIFDELPVY LDKNTNFKLY KNKDLSWKQK LHFYSDNNSC MRRCVWEKIP YPEISFGEDQ AWAWQVIEAG YGKVYGRDAV VYHSHNFLPE EIFSRSLEEA SFFQKTFGYE LVNKDNIYEQ IKLLNEHDSQ WGRDKNLDEK VIIMRQKNNE ARIHGYMAAL K
|
| |