Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_0380 |
Symbol | |
ID | 4241614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 588749 |
End bp | 591631 |
Gene Length | 2883 bp |
Protein Length | 960 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 638105707 |
Product | tetratricopeptide region |
Protein accession | YP_720321 |
Protein GI | 113474260 |
COG category | [S] Function unknown |
COG ID | [COG4995] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCCA AAAATACCCC AAATAATTGG AAATTCCTTA AAACACTAAT TAACCAACGG CATTTCCTCC AAAGTCTGCT GTTGTTAATT TTATTTTTCA TAGGCCTAGC AATTCCCCCA GTGGTTGCTC AGATGAGCCC CCAAACTCCC ATATTCCAAA TTCAACCTGA TAGCTTCAGC CTGGAAAAAC AGGCTAGAAG CCTCTATCAA ACCAGAAACT TTCCTGAAGC CGCCAGATTT TGGGAACAAG CGGTAGTCGC CTTTGCTAAA CAAGGGGATC AACTCAACCA TGCCATGGCA CTAAGCAACC TTTCCCTAAC TCGTCAACAA CTTCGAGAGT GGGAATTAGC ACAAAAGGCT ATTGACGAAA GCCTAAATAT CCTACAAACC CTAGAAAAAA CACCAGAAAC ACAGCGCATC CTCGCTCAAA CCCTGGATAT TCAAGGAAAG CAGTTGCGGG AGGTAGAAAA GCCGGAAAAA GCTCTAGAAA CTTGGCAACA AGCTGCTGAC CTGTATCGGG AAATTGACAG ACCAGAAGCA GCCGCCCAGA ATGAGTTTAA CCAAATTCAG GTACTCCAAG ACTTAGGACT TTATCCCCGC GCTTGCAAGA GTTTATTATC AATAATAGAG CTGAATGTTC AAAATTGTCA AGCATTAAGG AAATTAACCC CAGAAGATTT AAAGCAACAA CTCCAAGTAT TTGCTCAACG CCCCGCTTCC TTACTCGAAG TCCAGAAATT GCGAAATCTC GGTGATGTAC TGCGAGTTTT AGGTCAGCCA ACTAACTCGG AAGTTCTATT AGAAGCTAGT TTGGAGGGAG CTAAACAATT AGAACCTTTC CCAAGCAAAA ACGCCGAAAT TGCTGCTATC TATCTCAGCT TAGGCAATAC CGCTCGTGTC CAAGGCAAAA ATGACAATGC TAGAGAAGCT TTAACATTCT ACCAAGAGGC AGTTAAAGCA GCTACTGAGT CAGGAACAGC TACCCTCATA ATTCAAGCTC AACTCAATGA ACTGAGTCTG TTGGTGAAAA AGCAGTCCTG GCCTCAAGTG CCAGATTTGG TGTCTCAAAT TGAACCCCAA TTAAATAATT TGCCTCCTAG TCGCGCCGCT ATTAATGCCC GACTTAATTT CGCCCAAAGC CTATTCTGCT GGAAAGAGCC AACCCTCAGT CAGGAAGAAC GTCAACTCTC ATCTCCTATT ATTGAGCAGT GCAGTTTAGC CAGGGATAGA GAGAAAAATA ATCAACTTCA GCCCTCAGAT GTTCCGAAAT GGGAAGATAT TGCCGAAATA GTTACTACCG CTGTCAAGCA GTCTCAAACC TTGGGCAACA AGAGAGCAGA GGCTTATGCT CTCGGTTATC AGGGCAGCGT TGAGCAACAA ATGGAAAAAT TTTCAGAAGC TCAAGATTTA ACCATAAAAG CTCTGAACAT ATCATCATCT TTCCAGTTGC CAGACATTGC CTATCTCTGG CAGTGGCAAC TGGGGCGCTT GCGGGAAATT CAGGGAGAGG AAGATGACGC GATCGCTGCT TATAATACCG CCTTTGCCAC CCTTCAGTCC CTGCGGGGAG ACTTAGTTTC TATTGATCAG GAAGTTCAAT TTACTTTTCG CGACAGTGTA GAACCTGTCT ATCGAGAATA TGTCGATTTA CTCTTGCGAG GGGAAGATAT CTCTCAAGAC AACCTTAAAC GAGCTCGTGA GGTGATTGAA GCCCTGCAAC TAGCTGAACT GAATGACTTT TTTGGCAATG CTTGTTTGGA AGCAAAACCC AAGAAAATTG ACACAGTGAT TAAAGAAACT TCCTCCCCAA CAGCATTCTT CTACGCAGTT ATTTTGAAAG ACCGTTTAGA AGTAATTTTG GCTTTATCAG GAGGCAAAGA ATTACAACAT TACCATACTA ACAAATCTCA AGATGAAGTT AAAGCAATCA AAAAAACAAT AAAAGACCTA CGCACATTTC TCTCCAATAA GACGACGGCT TTGGAAGATG TGAAAAAACA ATCCCATAAA ATATATGATT GGCTAATCAA GAAAGCTCAA AAACAACTAG AAACAAATGG GATCCAGACC TTGGTTTTTG TTCTAGATAG TCCATTACGT AATATTCCTA TGGCAGTTCT ATACAATATC GAAACTGAGC GATATTTGGT AGAAGACTAC GCCATCGCCT TCACCCCAGG CCTGCAACTT TTGAGTCCTC AACCTGTCAA ACAATTTCGC TTAAATGGGT TAACAGGAGG TGTGAGCGAA GAGCGAGAAA CTGAGTCAAT AAATTTTGGT AGAACGAAAC CACAAGATTT TACTGAAATT CCCTTTGTCA AGGAGGAATT AAAAAAAATT CGCTCAGTAA TATCTAGTTC AACAGAAACT CTATTGAATG AGAAATTTCT TAAAGAGCGA CTGCAAAATG AACTCAATTC AGCTAACTTC AACACCGTTC ACCTATCAAC TCACGGCAAA TTTAGTTCTA ATATAGAAGA CACCTATCTT TTAGCTTGGA ATGAACTACT CAAAATGGAA GATTTAGAAA ACTTGTTCCA AATTAAACTG GCTAATCAGT CAATTCCCAT TGAATTACTC GTCCTGAGTG CTTGCCAAAC AGCCAAAGGC GATGAGCGAG CAATATTGGG AATGGCTGGG GTAGCAGTGA AAGCGGGTGC CCGTAGCACA CTAGCCACCT TGTGGCCAGT CTTTGATGAA TCCACGGCCG AATTCATGTT CTTATTTTAT CAGCAGTTAA TCCAGAACCA GGCTCAAAAT ATGACCAAAG CTGAAGCACT TCGTCAAGCT CAACTAAAAT TATGGGCTCA GAAAAAACCA GGAAAACGGT GGAATCACCC CTATTATTGG GCTCCCTTTA TTTTGCTAGG TAATTGGCTA TAA
|
Protein sequence | MKPKNTPNNW KFLKTLINQR HFLQSLLLLI LFFIGLAIPP VVAQMSPQTP IFQIQPDSFS LEKQARSLYQ TRNFPEAARF WEQAVVAFAK QGDQLNHAMA LSNLSLTRQQ LREWELAQKA IDESLNILQT LEKTPETQRI LAQTLDIQGK QLREVEKPEK ALETWQQAAD LYREIDRPEA AAQNEFNQIQ VLQDLGLYPR ACKSLLSIIE LNVQNCQALR KLTPEDLKQQ LQVFAQRPAS LLEVQKLRNL GDVLRVLGQP TNSEVLLEAS LEGAKQLEPF PSKNAEIAAI YLSLGNTARV QGKNDNAREA LTFYQEAVKA ATESGTATLI IQAQLNELSL LVKKQSWPQV PDLVSQIEPQ LNNLPPSRAA INARLNFAQS LFCWKEPTLS QEERQLSSPI IEQCSLARDR EKNNQLQPSD VPKWEDIAEI VTTAVKQSQT LGNKRAEAYA LGYQGSVEQQ MEKFSEAQDL TIKALNISSS FQLPDIAYLW QWQLGRLREI QGEEDDAIAA YNTAFATLQS LRGDLVSIDQ EVQFTFRDSV EPVYREYVDL LLRGEDISQD NLKRAREVIE ALQLAELNDF FGNACLEAKP KKIDTVIKET SSPTAFFYAV ILKDRLEVIL ALSGGKELQH YHTNKSQDEV KAIKKTIKDL RTFLSNKTTA LEDVKKQSHK IYDWLIKKAQ KQLETNGIQT LVFVLDSPLR NIPMAVLYNI ETERYLVEDY AIAFTPGLQL LSPQPVKQFR LNGLTGGVSE ERETESINFG RTKPQDFTEI PFVKEELKKI RSVISSSTET LLNEKFLKER LQNELNSANF NTVHLSTHGK FSSNIEDTYL LAWNELLKME DLENLFQIKL ANQSIPIELL VLSACQTAKG DERAILGMAG VAVKAGARST LATLWPVFDE STAEFMFLFY QQLIQNQAQN MTKAEALRQA QLKLWAQKKP GKRWNHPYYW APFILLGNWL
|
| |