Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_4433 |
Symbol | |
ID | 4246086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6832866 |
End bp | 6836132 |
Gene Length | 3267 bp |
Protein Length | 1088 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 638109316 |
Product | hypothetical protein |
Protein accession | YP_723893 |
Protein GI | 113477832 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1874] Beta-galactosidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0682487 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA TCTTGTATAT ACATATAGGA ACTGGTAAAA CAGGAACCAG TTCAATACAA AATTTTTTAT CTGAGAATCA AAAAAATGGC AGGTTATCAA AGCACTTTAA TACAAGGTAT TGTTTCTCGG GTCGTCCCAA TATTCATAAT ATATTGTTGT GTAAAAATTA CCAGAGGGGA AATGAGAAGA CTCAAGAGTT GATAGATGCA AAGATTCGTG AATTGAGAAA GGAAATTTAT CAAAGTTCAG AACAGTTATT TATTATTAGT TCTGAGTATT TTCCCGGAAA CACTCAAGAT GAGATTCATG AATTAGTAAA TACTGTAAGT GATATATGTC AAGTTAAAAT TATTGTTTAT TTAAGAAGGC AAGATGACTT TTTATGGTCT TGGTATACTC AACTTGTCAA GACAACTAAT ATATTTCTTG ATATATACAA ATTAAAAGAT AAATTGTATA AAGGCCCGCA GCAGGTTTTC AACTATGAAA CAAGTTTAAG AAAATGGGAG AATATCGTTG GGAAAGAAAA CATATTAGTC AATATTTACG ATAGGAATAA TTTTAAGGAA AATTCAATAT ACTTTGATTT TATTCACAAT TTAAATATTG ACTTAGATCT GAAAAAAATG ATAGTTCCAG AAGAGGTAAA TCTAAGCCTT CAATGTGAAC AAATATGTCT GATACGAACA CTTATTCCTT ATGTGAGTGA ATGGAATGAC GAACAACTAA AAATGTTGCA AAAACCACTT AATATCCCGT TTTTGTATAC CAAAAGTCTC CTTTCCCCAA GAGAACGAAA AGAAATTTTA AATGACTTTA GACAACACAA TGAATTTGTG GCTAGACGAT ATCTTAATCA AGAAAATTTG TTTGATGAAA ATATTGAAGA TGAAGACACA TGGAGAGAAC CAAATATATT TAGTTCTGGA TATTTTGAAA AATATATTGA TTATATCAAA ACCGAAAAAA AGACGTTGTT TGAATTACTG CAACCGATAT TAAAGAATGT CAACAGAATG AAAATCAAAA CAGTGAGAAA TATGTTATTA GTTAATAACA TATTTATAGT TGATTGGACG GAGAAGGCGG GTTGCACCAT AGTATGTAAA ATGTTTTTTG ATGTTATGGG GATACTCCAA GAAGCCTTAG ACTATAATAG TTGGATACAT AATTATCGAC AAGATAAATT TTATGAAAAA TTTGGTAAAG TAACGGAGGA TATGCTTTCT GGGGACAAGT TTGTGAAAAT GAAATTTGTC AGAAATCCAT ATACCCGTGC AGTTAGTTCT TATTTTGCTG CAAATGAAAA CAATAAACAA TATTTAAAAG GATATGAACA TCTTGATTTG TCATTTTATG ATTTCTTGCT TTTAGTTAAG CAGAAAAAAA TAGTTAATGA TCATTGGAGA ACTCAATATC AAGAAATTGA ATCCAGATTT AATTTTGATG AAATAATTAA AATTGAAAAT ATTGAACAAG AGGTAAAAAG ACTGAATCAA AAATATAGTC TTAATTTACA ACTCCATACA CATTCTAGCC ACCATCATAA AAAAGATAAA AATAAAAGTG AATTTGTTGG CAGAAAAAAG TATAGTGAGT TGGAAATTTC TGGTACTATT CCAGATTATA GAAACTTCTA TGATGATGAA ACTAAGTTAC TTGTGAGTGA AATTTATGAA AAAGATCTTA AAACTTATAA GTATACATTC GATCTGGAAA AAAATCAGGA AAGAATTAAA GCAGAGAATA CGGATAAAGA TAAAAAAATG AAACACAATA TCAATTACAT TACAATTTGG GAGAACGAAT GGAATTCGGT TTATGCTGAC GAATTTCTAA TTAAAGCTGC CAATTCTGAT ATTAATACTG TTGCCTTTGA CATGCGGTGG CACAAGCACG AAACAAATGA AGGAAATTTT GATTTTTCTC ATACTCAAGA CAGATGCGAT AAACTCATTG AGCATGGGTT CAAACTTATG CCATTAATTT CTATCTTTTA TTGTCCGAAT TGGGTTCATG AAAAATATCC TGATATAGTT GAGTTGAACG AAACCTTCGG TGCCATAGGC TCGGGTCATA AAGGTGTAAG TTCTGCATAC GAAAAGTCAT TGCCCTTGGC ACTCAATTTT ATAGATAAAT GCATTGATGC ACTGGAATGC TATAGACAGG ATATAACAGC AATAAGTGTT TCGTGGAATA ATGAACACGA AACAAAATTT ACTCAAACAC ATGATTTGTT TAGACCCTAT GAAAAGTCTG CTCAAGAAAA ATTCAGATTA TTTATTGAGA AGAAAAATAG TTCAATAAAA TATTGGAACG ACAGGTGGAA TACTTCCTTT GATTGTTTTG CAGAAGTAAC TTTACCCACA TTACGATGTG AGAACAAAAA ACAAATTGAG CAATTTCAAC TCCAGGGGCA ATTTATATTT GATTTTTACG CATTTAGAAG ATTTTTGCTT GTTAACATAT ATCAAGGTTG TTGTGAACGA ATAAAATCTC ATCAGTATAA AACATGGCTG CATTTTGGTG AAATATTTAC CTGTATTGAT GCAATATATC AAGGTGATGT TGTGTTTGAT ATGATCACAG AAAATTGGTT GGATATCGTG GTTATAGATT CAAATTTATC TAAGATAGGG GTTGAAACAA ATGACCCGTT CGTTTCTTAT GTAATTGTTT CTGCTTGTCA GCAATACGAT AAACCAGTGA TATTTGAAGT TGCGGTGGAG CGCGATAAAT CAATTGAGGT GTATAAAGAA AGTATTCTTT GGGCACAAAA AAAGAATGTA GACGGGCTAG GACATACAAA TTTTATGAAC CGGAAAAAAT TTCATTGTCT TTTTCAGAAT TCATCTCTTG CCGCAGGTAT GGATGGTGTT GAGTCAACAA AACATTTGCT TTTGATACAT CCTATTCGTG GATGTGGTGT ATTGAGGCAA AGACAACATA TTGTGGGAGC AATATCAGTA GATCCTGTTC AAGATTATTT GATGGGTGAA GTTAAGCACT GGTATGACAG AGGATTTAAT GTAGATATTA TCGGAGATCC TTTACTATTA AATAAGATTA ATCAAGATGT TTATAATGAA ATAGTTTATT TGGAACCATT GGCGTTATTG TCAGTTGACA AGAAGCATAT TTCAGATTTT GTGGAAAGGT TGCCGAGTGA CAAACCGTAT TATCATAAGA AGTTGGATCG AGCATTCTTT GAACATCAAA CCGGGTTTGA TGGTGAACCT TTGGAATTTT TATCTTTTGA AGAATAA
|
Protein sequence | MKKILYIHIG TGKTGTSSIQ NFLSENQKNG RLSKHFNTRY CFSGRPNIHN ILLCKNYQRG NEKTQELIDA KIRELRKEIY QSSEQLFIIS SEYFPGNTQD EIHELVNTVS DICQVKIIVY LRRQDDFLWS WYTQLVKTTN IFLDIYKLKD KLYKGPQQVF NYETSLRKWE NIVGKENILV NIYDRNNFKE NSIYFDFIHN LNIDLDLKKM IVPEEVNLSL QCEQICLIRT LIPYVSEWND EQLKMLQKPL NIPFLYTKSL LSPRERKEIL NDFRQHNEFV ARRYLNQENL FDENIEDEDT WREPNIFSSG YFEKYIDYIK TEKKTLFELL QPILKNVNRM KIKTVRNMLL VNNIFIVDWT EKAGCTIVCK MFFDVMGILQ EALDYNSWIH NYRQDKFYEK FGKVTEDMLS GDKFVKMKFV RNPYTRAVSS YFAANENNKQ YLKGYEHLDL SFYDFLLLVK QKKIVNDHWR TQYQEIESRF NFDEIIKIEN IEQEVKRLNQ KYSLNLQLHT HSSHHHKKDK NKSEFVGRKK YSELEISGTI PDYRNFYDDE TKLLVSEIYE KDLKTYKYTF DLEKNQERIK AENTDKDKKM KHNINYITIW ENEWNSVYAD EFLIKAANSD INTVAFDMRW HKHETNEGNF DFSHTQDRCD KLIEHGFKLM PLISIFYCPN WVHEKYPDIV ELNETFGAIG SGHKGVSSAY EKSLPLALNF IDKCIDALEC YRQDITAISV SWNNEHETKF TQTHDLFRPY EKSAQEKFRL FIEKKNSSIK YWNDRWNTSF DCFAEVTLPT LRCENKKQIE QFQLQGQFIF DFYAFRRFLL VNIYQGCCER IKSHQYKTWL HFGEIFTCID AIYQGDVVFD MITENWLDIV VIDSNLSKIG VETNDPFVSY VIVSACQQYD KPVIFEVAVE RDKSIEVYKE SILWAQKKNV DGLGHTNFMN RKKFHCLFQN SSLAAGMDGV ESTKHLLLIH PIRGCGVLRQ RQHIVGAISV DPVQDYLMGE VKHWYDRGFN VDIIGDPLLL NKINQDVYNE IVYLEPLALL SVDKKHISDF VERLPSDKPY YHKKLDRAFF EHQTGFDGEP LEFLSFEE
|
| |