Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2707 |
Symbol | |
ID | 4244970 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 4187462 |
End bp | 4191922 |
Gene Length | 4461 bp |
Protein Length | 1486 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638107770 |
Product | glycosyl transferase family protein |
Protein accession | YP_722369 |
Protein GI | 113476308 |
COG category | [M] Cell wall/membrane/envelope biogenesis [N] Cell motility [R] General function prediction only [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis [COG1216] Predicted glycosyltransferases [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.228337 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATCAA TAAATCCTCA GGAAACTGCA GCAGAAAATT TCCACAAAAA AGCAGAAGCT TATTTAGCAG AAAAGAAATT TGACGAAGCG ATCGCCTCCT GCGAATTAGC AATAAAAATA GAAGATAATT ATTTTCCTGC CTACAAAACT TTAGGGAATA CTTGGCAAGC TCAAGGAAAA TTAGCAGAAG CAGAAAATTG GTACAAAAAA GCGTTAGAAA TCAAATCAAA TTGGCCGGAA ATATATGCTA ATCTAGGCAG TTTATATGCC ATGCAGCAAA AATGGGAAGA AGCAATATCC TATTATCAAA AAGCTGTAGA TATCAAACCA GATTTTGCTG GTGCTTATCG CAATATGAGG AAAGTATGGT TAAGTTTAGG CAATCAAAAA TTAGCTACAT ATTGTCAGTA TAAAGTATTG TCTATAGAAC CAGAAAAAGC TACCTTTGAA GAATTTATGA ATGTAGGGAA AACTCTGGAA AAGGAAGGTA GCTTAAATGA CGCAATATCT TGTTATCGAA TGGCAACTAA ATTAAATAGT AATTCATCAG AAGCTTATCA AAATTTAGGA GAAGCTTTAA AAGAGCAAGG AAACTTAGAT GAAGCGACAG TCTGTTGTCG GAAAGCGATA GAATTAAAGG TAAATGGTAA GAAAATAGAA AATTCAGATG ATACTTTAGG AGCAGCGAGT AAATACAATT TAATTTCTAG TCAAGAGTTA AATTCTGGAG ATACTTTAGT TGGTAGTTTA AGTAGTAATA AAGTTAATGG TCATAACACT CTTGTGAGAG CAACTATCCC AACAACTTCT CCGACGGTTA ATGCTGAAGA TATAGAAACT TATATGGTAG AGGCAGAGAC TTACGTTAAC CAGAAAAAAT GGGAACAAGC AATTGCTGTA AGTAAGCAAG TTATTCAAAC TAAAACAGAA CCCAAAGCTT ATAAAATAAT TGGTAATTCT TTACAAGCTA TGGGTAAATT ACAAGAAGCA TTAGATTGGT ACAATAAAGC ATTAAAAATT AAGCCTGATT TTGGAGAAGT ATATGCAAAT ATAGGAACTA TATTTGCTCA ACAAAAACAG TGGGGACAGG CTATACAAAA TTATCTAAGA GCAATTGAAA TTAAGCCAGA ATTTGCGGGA GCATATCGAA ACTTAGCGAA AATTTATACT CAGGTTAATA AGTCACAAGA AGCAGCAGAA TACCTATATC AGGCAATAAG ATTAGAACCA GGAAAAGCTA CTGCACAAGA TTTTTTATTC ACTGGCAATA CTCTGAGTGA AAATGGTAAA TTAGAACAGG CGATCGCTTG TTATCAACAA TTAATATCTG CTGATCCAAA TAGTTTTGAA GCTTATGAAA AACTAGGAGA TAGTTTACTA AAGCAGGGGC AGTTAGAGCT AAGTTTACAG AATTATAAAA ATGCTCAAAA ACTAAAACCA TATTCTACAG AAATTAAGCA AAAAATTGGA GAAATTTATT ACAGATATGG AGAATATTTC CAAAAAAAAG AAAAAGTAGA AGAGGCAGTA AAAGCTTATC GTCAAGCAAT AGAAAATTAT CCACAATATG ATATACCTTA TGGAAAATTA GGAGAAGTTT TTTCCCAACA GGAAAAATGG GAAGAAGCTG TCAAAGTATA TGAAAAAGCA AGTCAAATAA AACCAGATAA TTCTTGGTAT TATAACAGCT TAGGAGAAGC TCTAAAGAAG TTAGAAAAAT GGGAAGAAGC TGTTATGGCT TACCGCAAAG CTATACAATT AAATCCTGAC TTTTCTTGGT CTCATAATAA CTTAGCAGAC TGTCTAGTAA AACTAGGGAA ACGGGAAGAA GCTGTTGTAG CCTATCGTCA AGCAATAAAA TTAAAACCTG ACTTTACTTG GTCTTATATT AATTTAGGAA ATACTCTTTG GGAAATAGGA AATTGGCAAG AGGCGATCAA CCCCTATAGT CGCGCTTTAG AATTAAAAGC AGATCTGCCA GAAACTTATC AAAAATTGGG ACATGCTCTG AAAAAACGTG CTGAGTTAGA TTTAGAAGAA TCAATTAAAT GGTATCGTAA AGCTATAGAA AATGACCCCG ATAATGAGAA GCTTTATCAC AAAGCGTTAG AGGTTAAGCC AGAAGACCAC ACACTTTATT TACAACTAGG GAATACATTA GTAAAAAAAG GAAAAACTCA TGGAGCGATC GCCTTTTATC AGTTAGGTTT ACAAATTAAT CCAGATGATT CAGAAATTCA AGGGCAACTA GAAAAAATAT TACCGAAAAA AAATACTTTA AGGGAAAAGA AAAAAGATAA TAGGAAAAAG AATAAAGTTC CTATTTTTCC ATTACCTTCT AGTCCTGTTA TTGAAAAAAA TGAAGAGAAA AAAGTTCCTA TTTCTCCATT ACCTTCTAGT CCTGTTATTG AAAAAGAAGA TACCTATCAA TTATGGCTTA AGGAAAACTT ACCTAACTCA GACCAACTCA AGAAAATGTC AGAAAATTTA AAGATTTTCC GATACCAACC ATTGATTAGT ATTATACTGC AAGAAAAATC AGGATCTCCA TACATTCAAA AGACGATTGA ATCATTAATA AATCAAATTT ATCCTAACTG GGAACTTTGC TATATTTCTG ATAAACTTCC GCAAAATATA GAAGCAAACA ACAAAATTAA ATTAGTCATA AAAAGTGAAA ATAGGGATAT TGCTACAGAC TTAAATGCGG CATTAGCATT AGCAACAGGA GATTTTGTTA CTCTATTAAA TCCTGAGGAT ATTTTAACAC CAGATGCACT TTATGAAATG GTGTTATTTC TGAATAGATA TCCAGATTCA GATATGATTT ATTCTGATGA AGATAAACTG ACCAGTGAAG GAAAATTAAT TGAACCATAC TTTAAACCAA ATTGGTCTCC AGATTCATTT TTATCCCGAA TGTATACAGG TCATTTGTGT ATATATCGCC GAGAATTATT GGAAAAATTG AATGGTTTTC GAGTCGGATA TGAAAGTAGT TATGAATATG ATTTAATTCT CCGTTTAAGC GAAGTAAGTC AAAAAATATT TCATATTCCT AAGGTACTTT ATCACAGCAG AATTCCGGAA AATGGAGGAG AAATAGATTG GGAAATATCA AAAGAAACTA CTAAAAAAAC ATTAACTGAA GCATTAGCCA GAAGAGGAGA AATGGGAACA GTTTATAGTG TGCCCAACCA TCCTGGTTTT TATCGAGTCC GATATCAAAT ATCAGAACAA AAATTAGTCA GTATAATTAT TCCCACTAGG AATTTAGGAA ATATTTTAGA TAGATGTTTG GAGTCAATAT TTACTCAAAC TACCTATCCA AATTATGAAG TAATAGTAAT AGATAATGGT AGTGATGAAC CAGAAACATT ATCTATTATT GAGAAGTGGA AAAATCAACA ACCAGAACGT TTCAAATGTT ATGAAAAGAA TATTCCTTTC AATTTCTCCA AACTCAATAA TTATGCCGTA GAGAAAGCAG AAGGTGATTA TTTACTATTC TTAAATAATG ATACGGAAGT CAAAACAGCC GACTGGTTAG AAGCAATGGT TGAGCAAGCA CAAAGAGAAA CAATAGGAGC TGTCGGAGCT TTATTATTAT ATCCAGACAA TACAATTCAA CACGCCGGAG TTGTACTAGG AATGCGAAGT GTTGCAGACC ATAGTCACCG AGGTTTTTCT CCCACCGACG CTGGATACAA AGGTCAAATT ATTTCTGTGA ATAATTATTC TGCCGTAACT GCAGCGTGTT TAATGTGTCG GCGAGAAGTA TTTGAGCAAG TGGGAGGATT TGATGAAGAG TTAGCTGTAG CATTTAATGA TGTAGATTTA TGTTTAAAAA TAATATATAA AGGTTATCGA AATATTTATT TACCTCACGC CGTTTTATAT CACCACGAGT CAAAAAGTCG GGGAGTAGAA AATACCGGAG AAAAACAACT ACGTTTCCAA CAAGAAATTC AGAATATGAA GCAAAGGTGG AAAGATTTAA TAGACGAAGA TCCTTGTTAT CATCCTCATT TAACTCGACA ACAAGAAGAC TTTAGTTTAA GAGTTCAAAC AAATGTAGAA GTTGCTCTTT CTATGTATGA TAAAGATCCA GAAATAGTTG GATGTTCTAT AGATGTACCA TCACCAGGAG TAAAGCAAAA TATTAGTTCT ATTTGTATAG GTGGGTGGGT AGTTGGAAAA AAATCTTCTC CAGTCACAGT TAAACTAATT TCTTTAACTA AATCAGGTAA AGTTTTGCGG GAAGTTCCTG CTAATTTACA TCGTCCAGAT GTAGGTAAAA TTCATCCTGA GTATCCTTAT GCTCAACATT CTGGTTTTTG GGGAGAAATA GAAGTAATTG AGATTGCACC AGAGTCGAAA ATTTCCTTAG AAGCTATTTT CAAAGATGGT TCTCATGTGC GACTGGGTAT GGTAAGTTTT AAGTGTCCTA ATTTGATTTA A
|
Protein sequence | MASINPQETA AENFHKKAEA YLAEKKFDEA IASCELAIKI EDNYFPAYKT LGNTWQAQGK LAEAENWYKK ALEIKSNWPE IYANLGSLYA MQQKWEEAIS YYQKAVDIKP DFAGAYRNMR KVWLSLGNQK LATYCQYKVL SIEPEKATFE EFMNVGKTLE KEGSLNDAIS CYRMATKLNS NSSEAYQNLG EALKEQGNLD EATVCCRKAI ELKVNGKKIE NSDDTLGAAS KYNLISSQEL NSGDTLVGSL SSNKVNGHNT LVRATIPTTS PTVNAEDIET YMVEAETYVN QKKWEQAIAV SKQVIQTKTE PKAYKIIGNS LQAMGKLQEA LDWYNKALKI KPDFGEVYAN IGTIFAQQKQ WGQAIQNYLR AIEIKPEFAG AYRNLAKIYT QVNKSQEAAE YLYQAIRLEP GKATAQDFLF TGNTLSENGK LEQAIACYQQ LISADPNSFE AYEKLGDSLL KQGQLELSLQ NYKNAQKLKP YSTEIKQKIG EIYYRYGEYF QKKEKVEEAV KAYRQAIENY PQYDIPYGKL GEVFSQQEKW EEAVKVYEKA SQIKPDNSWY YNSLGEALKK LEKWEEAVMA YRKAIQLNPD FSWSHNNLAD CLVKLGKREE AVVAYRQAIK LKPDFTWSYI NLGNTLWEIG NWQEAINPYS RALELKADLP ETYQKLGHAL KKRAELDLEE SIKWYRKAIE NDPDNEKLYH KALEVKPEDH TLYLQLGNTL VKKGKTHGAI AFYQLGLQIN PDDSEIQGQL EKILPKKNTL REKKKDNRKK NKVPIFPLPS SPVIEKNEEK KVPISPLPSS PVIEKEDTYQ LWLKENLPNS DQLKKMSENL KIFRYQPLIS IILQEKSGSP YIQKTIESLI NQIYPNWELC YISDKLPQNI EANNKIKLVI KSENRDIATD LNAALALATG DFVTLLNPED ILTPDALYEM VLFLNRYPDS DMIYSDEDKL TSEGKLIEPY FKPNWSPDSF LSRMYTGHLC IYRRELLEKL NGFRVGYESS YEYDLILRLS EVSQKIFHIP KVLYHSRIPE NGGEIDWEIS KETTKKTLTE ALARRGEMGT VYSVPNHPGF YRVRYQISEQ KLVSIIIPTR NLGNILDRCL ESIFTQTTYP NYEVIVIDNG SDEPETLSII EKWKNQQPER FKCYEKNIPF NFSKLNNYAV EKAEGDYLLF LNNDTEVKTA DWLEAMVEQA QRETIGAVGA LLLYPDNTIQ HAGVVLGMRS VADHSHRGFS PTDAGYKGQI ISVNNYSAVT AACLMCRREV FEQVGGFDEE LAVAFNDVDL CLKIIYKGYR NIYLPHAVLY HHESKSRGVE NTGEKQLRFQ QEIQNMKQRW KDLIDEDPCY HPHLTRQQED FSLRVQTNVE VALSMYDKDP EIVGCSIDVP SPGVKQNISS ICIGGWVVGK KSSPVTVKLI SLTKSGKVLR EVPANLHRPD VGKIHPEYPY AQHSGFWGEI EVIEIAPESK ISLEAIFKDG SHVRLGMVSF KCPNLI
|
| |