Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1543 |
Symbol | |
ID | 4242022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2343203 |
End bp | 2347798 |
Gene Length | 4596 bp |
Protein Length | 1531 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 638106686 |
Product | protein splicing site |
Protein accession | YP_721296 |
Protein GI | 113475235 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0553] Superfamily II DNA/RNA helicases, SNF2 family |
TIGRFAM ID | [TIGR01443] intein C-terminal splicing region [TIGR01445] intein N-terminal splicing region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.706415 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATTT TACATAGTTT TTGGGTGACA GAACATGAAC AACCTAGTTT TTTATTTATT TGGGGAGAAG CTTGGCACAG AGTTACAGAA GAGGATGCTG GTGAAGTAGA AAAAATTACT AATAATCCTT ATTCAATAAC TTTAGATGAA CTATTAAAGT TATCTGAGGC AAATATATAT TTATCTTTAG AAAAGCACAA AAAATCCACA AATCAAACTT TAGGTATCCC CACAAAGTTA TCAGAATCTA ACCAAAAATT ATACCCAATT CACTCTGCTA TAAGTTTATC AGAAATTCCA GAAAATTTAT ATATTTATCC CTGGAAAATA GAAGGAATTT GTCTAGAACC TGATGAAGCA ATTAAATTTT TACAATCTAT TCCTCTAGGA CAAACTACTG AATCTTTCAT CGGGTCAGAT TTAAAGTTTT GGTCTCATCT TGCTCGATGG AGTCTTGACT TATTAGCAAG ATGTAAGTTT TTACCTTCTA TTGAACAAAG ATCTATTACA CAAGAATTTA TTACAACTTG GCAACCTTTA ATTGATAGTT CTATAGACCA AACAAGATTG AAAAATTTTG CCCAACAAAT GCCTCTTGTT TGTCGTACAT ATAATCTTGA TTGGCAATTA CATCCTTCTT TAGAAAGTTT ACAAATAATT ACTAAAAATA ATAGTTACTC AAATTTACCT CGACTTCAAT CAAGTCAAGA AATTATACAG GATTTCTTGA AAAATACAAT AGATAAACAG ATTAGACAAC TATCAGCTGA AATATCTCTA ACAGAAACTA CCTCATTAAA TTCTTCAATT AGACAATGGT TGAAGTCTTT ATCAGGAAAA TTAAGTTTAA AATTACCTGC TCCTCAAGAA ACAAAAAAAA TTCAAAAAAT ACTTGATAAT TGGAAGTCAC CTCTACAAGA ATATCAAGCT ATAGAAAATA AGTTTGTTGC CTGTTTTTGT CTCCACTCAC CATCAAATAA TTCCCAACAA TGGAAGCTAG AATATTGTCT CCAAGGACTT GATAACCCAG ATTTTTTGGT TGATGCTAAA ACTATTTGGG AAAATCCTGT GGCAAGTTTA AATTATCAAG GTAAAACAAT TAAACTGCCT CAAGAAACTT TGCTTAAAGG TTTGGGTTTA GCATCAAGAA TTTATCCAAT TATTGAGCCT AGTTTACAAG AAGCAACTCC CCAATATTGC TTACTAACTT CACAACAAGC TTATGATTTT ATTAAAAGTG GAAGTGGGCG GTTTATTGAT AGTGGACTGG GGGTAATTTT GCCTCCTAGT TTAGCTAACC GAGAAGGGTG GGCTTCTCGT TTGGGCTTAA GTATTCAGGC TACAGCTCCA AAAATGAAAA AAACTGAAAA ACTTGGGTTG AAAAGTCTTC TCAATTTTAA GTGGCAGTTG TCTATTGGAG GTCATAAATT AACTAAAGCT GAATTTGAAA AATTAGTTTC TCAAGATAGT CCTTTAGTAG AAGTTAATGG TGAGTGGGTG GAGTTACAAG GTCAAGATGT CCGAGCTGCT AAAAATTTCT TTGCCTCTCG TAAAGACCAA ATGAGTTTAT CTTTGGAAGA TGCTTTGCGC CTGGCGACTG GGGATACTCA GACTGTGGAA AAGTTGCCTG TTGTTAATTT TGAAGCTGGG GGTCAATTTC AGGAACTTTT AGATACTTTA ACTAATAATC GTTCTTTAGA GGAAGTTTCT ACTCCTGAGA ATTTTCGGGG AGAACTGCGC AATTATCAAG CACGAGGGGT AAGTTGGTTG AGTTTTCTGG AACGTTGGGG TTTGGGTGCT TGTTTGGCGG ACGATATGGG TTTGGGTAAG TGTGTGTTGC ACGATACAGA AATTTATGTT AATGGTATGG TAATGGAAGC AGAACAAATT TGGCAAGCTT ATGCAGGTGA GGCTGAATTT GATGGGGAAG GGTTTTGGAC AGAGCCTAAT AAGGAGTTAT TGGTAAATTC TTTAGATGAG ACAACTGGTA AAATTGTCTT TGCTCGGATC CGACGGTTAT ATCGCCAGTG GGTACGGGAA AAGTTACGGA AGGTTAGGTT GAAGGATGGT AGTAGTATTA CTATTACTTG TAGGCATAAA TTATTTATTC GCGATAGTTG GAAAAATGAT TTTCAGGTGG GAGATGATGT TTGTGTACCT GCAAAGCTGA TGTGGGATGG GAAACCAGAA GATCCAGATG TGGTTAAGTT TGTTGCTTGG CAAGTAGCTG AAGGATGGGA AAGGGTTAAT TCTGGAATGT TTGGTGTTTC TCAGAAGGGA AAGGATGTTT TGGAAGGTTT ACTTGAAGTT TTTAGCAGAC TTGGAAAACG ATATGATATT AAAATTAATT GCCCTAAAGT TGTTGCTCAT GGTAGCAAGA AAAATTGTTA TGAATTTAGT GCGCATAGTT TAGAGTATCG GAAATTTTTG GAAGAAAAAA GATATGGCTG GGGGAAGCGA TCGCATGAAA AAACTATCCC ACTATTTATA ATGCAGGCTG ACTTAGACAG CGTTAGAGTT TTTCTGAGCA ACTACTTTGA TGCAGAGGGT TGGGTAAACA AAACTGTAAG ATGTGTCGAA ATTTCTACAG CATCATCCCA ACTTATTCAA GAATTATCTA TCCTTCTCAG ACGCTTTGGG GTTTGGATGA AAATTTCACC TCAACAGAAA TGCGCTACTA ATGGTACTGG AGTTTTTTGT ACCTATTATA TTGGTACATT TGGTGGTAAT TCGGCACGCT GTTTTTTACA AGAAATAGGA TTTAATGATT CAGGAAAACA AGAAAATCTT AAGTCAATTT GTGAAAAAAT TGCTGACTCT AATGTTGAAG GAATTCCTGC TTCTGATATA GTTGCTGAGT TAGTAGAAAA AACTCAACTA CCAGTGGGAA GTTTAGGAAT TCAGGATCCA ATTTATATGG ATGGTTGCCA AGATTTTTCT CCTACTAGTT TAGAAAAAGT TATTAACAGT ATTGAGGATA TTATTAGTGG AGCTGGGGAA GAGGAGTATG GTCAACTGAA ATCTTCAAAA TTGAGGAATA AAACTTTAGA GGCTTATTCT TTACTGAATA TTTTAGAGCT AGAAATTTAT AAAACTAGGC TGCAAAAATT ACTCAACCAA GAGGTCTATT ATTGTCAGAT AGAATCTATC GAAGAGATGG AATATGAGGG ATGGGTTTAT GATTTTGAAG TGAGCAAATA TCATAATTTT GTTGCTAATA ATATTATTTG TCATAATACT ATTCAAACTA TAGCTTTTCT ACTGAAACAA CAAGAGCAAA AAGCACTTAA AGGACCTACG TTATTAGTTT GTCCTACTTC TGTATTAGGA AATTGGGAAA GAGAGGTCAA AAAGTTCGGA CCGACTTTAA AAGCAATAGT ACATCATGGG GATAAACGTG CTAAAGGTAA AGGGTTTGCT ACAGCAGTAA AAGATACAAA TTTAGTCATA ACAAGTTATG CTTTATTGCA TCGAGATGAG AAAATATTGG AGACAATAAA ATGGCAGTCA GTGATAGTTG ATGAAGCCCA AAATATTAAA AATCCAGAGG CAAAACAATC TCAAGCAGCT AGAAAGTTAG ATGCATCATT TAGAATTGCT TTAACAGGAA CTCCTGTAGA AAATCGCCTT TCAGAATTAT GGTCAATTTT GGATTTTTTG AATCCTGGAT ATTTAGGTCA AAAGCAATTT TTCCAGCGTC GGTTTGCTAT TCCTATTGAG AAATATGGCG ATACAAGTTC GTTGCAAATT TTGCGTTCTC TTGTTCAACC ATTTATTCTC CGTCGCCTTA AAACTGATAA AGATATTATT CAAGATTTAC CAGAAAAACA GGAAAATACT ATTTTTTGCC CACTGGCAAA TGAACAAGCA TTGCTTTATC AAAATATAGT TGAAAATTCT TTGGCAGAAA TTGATACTGT TGGTGGTATT CAGCGAAAAG GAAAAATTTT AGCTTTGTTA ATAAAACTGA AACAGTTGTG TAATCATCCT GTACTTTTAC AAATCAAAAA AGGTAGTAGG AAAAAGGTAG AAATTACTGA TAAAAATTCA GGGAAATTAC AACGTTTAGG GGCAATGTTG GAAGAAATAA TTTCTGAGGA AGAACGGGCA ATTATTTTTA CACAATTTGC TGAATGGGGA AAGGTTTTAC AACCTTATTT GCAGAAAAGT TTAGGTAGGG AAGTTTCTTT TTTATATGGT TCTACTCAGA GGAGTAAAAG GGAGGAAATG ATTGATCAAT TTCAACTGGA TCCTCAAGGT CCTCCTGTCA TGATTTTGTC GTTAAAAGCT GGAGGTACTG GTTTAAATTT GACTCGGGCT AATCATGTTT TTCATTTTGA TAGATGGTGG AATCCTGCAG TGGAAAATCA GGCTACGGAT CGGGTATTTA GAATTGGTCA AACTCGAAAT GTTCAGGTGC ATAAGTTTGT TTGTACTGGA ACTTTGGAGG AGAAAATACA TGATTTAATT GAAAGTAAAA AGGAGTTGGC TGAGCAGGTT GTAGGTGCGG GTGAAAAATG GTTGACTGAA CTTGATACGG ATCAGTTGAG AAATTTATTA ATACTTGACA GAAATCAAGT GATTAAAGAG GAATAA
|
Protein sequence | MAILHSFWVT EHEQPSFLFI WGEAWHRVTE EDAGEVEKIT NNPYSITLDE LLKLSEANIY LSLEKHKKST NQTLGIPTKL SESNQKLYPI HSAISLSEIP ENLYIYPWKI EGICLEPDEA IKFLQSIPLG QTTESFIGSD LKFWSHLARW SLDLLARCKF LPSIEQRSIT QEFITTWQPL IDSSIDQTRL KNFAQQMPLV CRTYNLDWQL HPSLESLQII TKNNSYSNLP RLQSSQEIIQ DFLKNTIDKQ IRQLSAEISL TETTSLNSSI RQWLKSLSGK LSLKLPAPQE TKKIQKILDN WKSPLQEYQA IENKFVACFC LHSPSNNSQQ WKLEYCLQGL DNPDFLVDAK TIWENPVASL NYQGKTIKLP QETLLKGLGL ASRIYPIIEP SLQEATPQYC LLTSQQAYDF IKSGSGRFID SGLGVILPPS LANREGWASR LGLSIQATAP KMKKTEKLGL KSLLNFKWQL SIGGHKLTKA EFEKLVSQDS PLVEVNGEWV ELQGQDVRAA KNFFASRKDQ MSLSLEDALR LATGDTQTVE KLPVVNFEAG GQFQELLDTL TNNRSLEEVS TPENFRGELR NYQARGVSWL SFLERWGLGA CLADDMGLGK CVLHDTEIYV NGMVMEAEQI WQAYAGEAEF DGEGFWTEPN KELLVNSLDE TTGKIVFARI RRLYRQWVRE KLRKVRLKDG SSITITCRHK LFIRDSWKND FQVGDDVCVP AKLMWDGKPE DPDVVKFVAW QVAEGWERVN SGMFGVSQKG KDVLEGLLEV FSRLGKRYDI KINCPKVVAH GSKKNCYEFS AHSLEYRKFL EEKRYGWGKR SHEKTIPLFI MQADLDSVRV FLSNYFDAEG WVNKTVRCVE ISTASSQLIQ ELSILLRRFG VWMKISPQQK CATNGTGVFC TYYIGTFGGN SARCFLQEIG FNDSGKQENL KSICEKIADS NVEGIPASDI VAELVEKTQL PVGSLGIQDP IYMDGCQDFS PTSLEKVINS IEDIISGAGE EEYGQLKSSK LRNKTLEAYS LLNILELEIY KTRLQKLLNQ EVYYCQIESI EEMEYEGWVY DFEVSKYHNF VANNIICHNT IQTIAFLLKQ QEQKALKGPT LLVCPTSVLG NWEREVKKFG PTLKAIVHHG DKRAKGKGFA TAVKDTNLVI TSYALLHRDE KILETIKWQS VIVDEAQNIK NPEAKQSQAA RKLDASFRIA LTGTPVENRL SELWSILDFL NPGYLGQKQF FQRRFAIPIE KYGDTSSLQI LRSLVQPFIL RRLKTDKDII QDLPEKQENT IFCPLANEQA LLYQNIVENS LAEIDTVGGI QRKGKILALL IKLKQLCNHP VLLQIKKGSR KKVEITDKNS GKLQRLGAML EEIISEEERA IIFTQFAEWG KVLQPYLQKS LGREVSFLYG STQRSKREEM IDQFQLDPQG PPVMILSLKA GGTGLNLTRA NHVFHFDRWW NPAVENQATD RVFRIGQTRN VQVHKFVCTG TLEEKIHDLI ESKKELAEQV VGAGEKWLTE LDTDQLRNLL ILDRNQVIKE E
|
| |