Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4771 |
Symbol | |
ID | 4246425 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 7329873 |
End bp | 7332980 |
Gene Length | 3108 bp |
Protein Length | 1035 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638109621 |
Product | glycosyl transferase family protein |
Protein accession | YP_724197 |
Protein GI | 113478136 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0463] Glycosyltransferases involved in cell wall biogenesis |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.201139 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCCAA AAGTAAGCGT AATTATTCCT GTATATAACT GCGAACTCTA CATCGCTCAA GCAATTGAAA GTGTCTTGAA TCAAACTTAT ACTGACTATG AAATTATTGT TATAAATGAC GGTTCTACAG ATAATACCCA TCAGGTATTA CAGCCATACA TGAAAAAAAT TCGTTATTTT TATCAAGAGA ATAAAGGATT ATCTGCTACT CGTAACCAAG GGATAAAAAT GGCAAAAGGA GAATTAATTG CTCTCCTGGA TGCTGATGAT TTATTTCTCT GTTATAAACT TCAAGAACAA GTAGCTATTT TTGATGCTCA ACCAAACATA GGTTTAGTGC AAAGTGGGTG GCGAGTTGTG AATGAAAAAG GAGAAAAAAT TGAAGATATT GAACCTTGGT ATAAGTCTCC AAAATTAGAC TTAGTAAGTT GGTTAAAATG GAAAGCTACT AATCCTAGTG CTATGATGTT TCGGAAAGAA TGGTTGGAAA AAGTAAATGG GTTTAATGAA AATTTACGCA GACTAGAAGA TTTTGATATT GTAATTCGAT TAGCTTTAGC AAGTTGTCAA GCTACTTGGT TTCCCAAAGT TGCTGTCTGT TATCGTCAAC ATAGCGGTAA TATGACTCGG AATTTACTTG CTCAAACAGA AGTAGAAGAA AAAATTTTAG ATGAACTTTT TTCTAATCCT AATCTCCCTG AAAAAATTCA ACATTTAGAA AGAGAATTAC GCTACGGTTC TCTAATTTGG AATTGTTGGT GTTTATATAA ATCTGGCTAC TTCGCTGAAA TGGCTGATTA TTTGCATAAA TCTTTAAAAT TTTCTCCTGT TTCTGTACTT AAAACTATAT GTGAGTGGCT ATATTATTTT CACTATTATG CTTCCTTAGA AAACTATTCA TTTAATTCAG TTTCTTTGAA TAAAATACCT GAGTGGCAAC TTTTAATAAA TCAAATCATG CCAAATGGTT TACCAGTAGT TAGCGTAGTT ATTCCGACTT TCAATAATGC TGATTATATT CTTAAAGCTA TCGAAAGTGT TTGTAATCAA ACCTATACTT TGTGGGAAAT TATTATTATT GATGATGGTT CTACCGATAA TACTTATCAA GTTTTAGAAT CTTATCTAAA TACAACAGAC TTCTTGCAGA ACTCCGCCGC AAGAAATAGA CAACAGAAAA CAAAAATAAA TACTCAGGAA AAAGAGAATA ATCAAGATTT AATTTTAGAT TTATTCCAGA ATTCTGCCAA AGGAAATAGA CAACAAAAAA TCGAAAAAAA TACTCAGGAA AAAGGCAGTA GTCAAGATTT AACTTTGACT TTTGCAAGAT GTCTAATTAA ATATCTATAT CAAGAAAATC AAGGACCATC AATTGCTCGG AATTATGGTA TAGAAATAGC TCAGGGAGAA TATATTGCTT TTCTCGATGC TGATGACTTT TTTCTACCCG AAAAATTAGC TGAACAAGTG GGATGTTTTG CAGAAGATTC AACTTTAACA ATGGTACAAA CTGGGTGGAG ATTTGTTGAT GAGAAAGGTA ATACTATTAA AGATGTAGAA CCCTGGGAGA ATAGTACGGA ATTAAACTTA GAAAATTGGT TGATTTGGCA AGCAACTTTG CCTAGTGCCA TGATGTTTAA AAGTAAATGG TTAAAAGTTA ATGGTGGGTT TGACAAACGA TATTTTGGGA TTGAGGATTT AGAAATAGTT TTGCGGTTAG CTTTAACCGG AGGAAAAGCT ACTTGGTTAC GGAAAGTTTG TGTCTGTTAT CGACAGCGGA ATAGTAGTGT CTCGGGTTTA AAAAACCGAG AAAAAATTAC TCAGGAATTA GAAAAATTAT TAGATGACTT TTTCCATAAA CCCAATTTAC CTACTGCCAT ACGTCAAATA GAAAATCAGG TAAAATATCG CTCTCTACTT TGGAATTGTT GGCGTTTATA TAAAGCTGGT AATTTATCAG AAATGACTAC TTATTTGCAG AAGTCTTTGA GTTATACTTC TGATTCTCCC ACAGTAGCAA TTTGTACTTG GATAGAATTA TTTACGAAGC TAGATAGTCA AGAAGGAAAT CAATTTAATG CTTATAGTTT TAGTCAGTTA CCAGAATGGA AAAAATTAAT TAGTAATATT TTTAATCCCG TTGTGCCAAG AGTTAGTGTA ATTATTGCAA CCTACAATAA TGCTCACTAT ATTTTGGAGG CGATCGCCAG TATTTTTAAT CAAACTTATA CCTCTTACGA AATAATAGTT ATTGATGATG GTTCTACTGA TAATACTCGT CAAGTTTTAG AACCTTATCT AGATAAAATT TGCTATGTTT ATCAAGAAAA TAAAGGAGTT TCCCACGCCA GAAATTTAGG CTTAGAAATA GCCCAAGGAG AATTTATTTC TTTTCTAGAT GCTGATGACT TTTTCTTACC AGATAAATTA GCAAAACAAG TAGCTGTTTT TGATGCTCAC CCCTCTCTGG GAATTGTCCA TAGTGGATGG CGTTTAGTTA ATAAAAAAGG AGAGAAAATT TCTGATATAG AGTTATGGCA TAGTTCCCCA GAATTAGACT TAGAAACCTG GGTAGTTTGG AAACCTGTAA CTATTTCTAT GATGTTTAGT AAAAGTTGGA TAAAAAGTGT CGGAGGTTTT GATACTCGTT GGCATCATGG AGAAGATATT GATTTAGTTT TGCGTTTATC TGTGGATGGT TGTGAAGCAA TGTGGTTGCC AAAAGTTACT TATTGTTATC GTCAACATCA TTGTAATGCA ACGAGAAAAT CTACTCAACA AGCTGCTAGC ATGATGTCTG TTTTAGATAG CTTTTTTAGT AGACTAAATT TACCAATATC TATTCGTAAA TTAGAGAGCC AATCTTATTA TTATACTTTG AGTTGGCTAG GATGGCAAGC CCATCGAAAT GGGAGTTTAA TTGAGATGGA TAAATATTTA CGACAAGCTT GGAAATATAC TCCTTTCTCT ATGACAGAAA CTATCGATTA TTGGGTCAAT AGTTTTAATA ATCTTTATCA ACAATATGGT TATGATTTTA ATGCTTATGC TTTGACTAAT TCAGATGTTT GGCAACAATT AATGTTAGAC ATGGTTATTA CTCTTTGA
|
Protein sequence | MNPKVSVIIP VYNCELYIAQ AIESVLNQTY TDYEIIVIND GSTDNTHQVL QPYMKKIRYF YQENKGLSAT RNQGIKMAKG ELIALLDADD LFLCYKLQEQ VAIFDAQPNI GLVQSGWRVV NEKGEKIEDI EPWYKSPKLD LVSWLKWKAT NPSAMMFRKE WLEKVNGFNE NLRRLEDFDI VIRLALASCQ ATWFPKVAVC YRQHSGNMTR NLLAQTEVEE KILDELFSNP NLPEKIQHLE RELRYGSLIW NCWCLYKSGY FAEMADYLHK SLKFSPVSVL KTICEWLYYF HYYASLENYS FNSVSLNKIP EWQLLINQIM PNGLPVVSVV IPTFNNADYI LKAIESVCNQ TYTLWEIIII DDGSTDNTYQ VLESYLNTTD FLQNSAARNR QQKTKINTQE KENNQDLILD LFQNSAKGNR QQKIEKNTQE KGSSQDLTLT FARCLIKYLY QENQGPSIAR NYGIEIAQGE YIAFLDADDF FLPEKLAEQV GCFAEDSTLT MVQTGWRFVD EKGNTIKDVE PWENSTELNL ENWLIWQATL PSAMMFKSKW LKVNGGFDKR YFGIEDLEIV LRLALTGGKA TWLRKVCVCY RQRNSSVSGL KNREKITQEL EKLLDDFFHK PNLPTAIRQI ENQVKYRSLL WNCWRLYKAG NLSEMTTYLQ KSLSYTSDSP TVAICTWIEL FTKLDSQEGN QFNAYSFSQL PEWKKLISNI FNPVVPRVSV IIATYNNAHY ILEAIASIFN QTYTSYEIIV IDDGSTDNTR QVLEPYLDKI CYVYQENKGV SHARNLGLEI AQGEFISFLD ADDFFLPDKL AKQVAVFDAH PSLGIVHSGW RLVNKKGEKI SDIELWHSSP ELDLETWVVW KPVTISMMFS KSWIKSVGGF DTRWHHGEDI DLVLRLSVDG CEAMWLPKVT YCYRQHHCNA TRKSTQQAAS MMSVLDSFFS RLNLPISIRK LESQSYYYTL SWLGWQAHRN GSLIEMDKYL RQAWKYTPFS MTETIDYWVN SFNNLYQQYG YDFNAYALTN SDVWQQLMLD MVITL
|
| |