Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_3934 |
Symbol | |
ID | 4244017 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 6080614 |
End bp | 6085413 |
Gene Length | 4800 bp |
Protein Length | 1599 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638108856 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_723438 |
Protein GI | 113477377 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.146607 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.000100505 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGTCTAATA GTGGTAATAT AGAAAAGTTG CTTCGAGGCT TGAAGATGTC TAAAGGAAGA TTCTCGCCTT TTTTGGCTCG TTATAGTTAT GTGAGTCAGC GCGATCGCTT AATTGAAAAT TTCCGAGCAT CTTTTCCTGG TGTATTGAAA GTATTGGCCT TGGATGAGTC TGTGACCCAA ATTTATACAA CTGTCAGAAA ATATTTAGAA GGCAAAGAGC TGGATGTTTT GATGGTCTGT GGTTGGGAGT CGTTGAGGAA TATTAATGAA TTATTAGTGG CAATGGGATA TGTGCGGGAA GAGTTTAGAA AACATTGCCC TCTACCAATA GTTTTTTGGG TAGATGGGAA CGTTTCCCGG AAATTTATTC GGTTAATTCC AGATTTTTAC AATTGGGTGA GTCTGACTGT ATTTGAATCT GCTAATGATG AATTAATTGA TTTTATTCAG CAAACGAGTG AGAATGTTTA TCAGAAGGTT TTGGAAAGTG GGGTAGGAAT ATTTTTAGAT TATACTGACC TGGGTTTGCC AGAGTCTAGT TATCAAGATT TGTTAGAGGC TCGGCAAGAG TTGGCCAATA GAGGAATTAC TTTAGAAGCA GAACTAGAGG CGAGTCTGGA ATTTGTTTTA GGTAGGTTGA CAGATGATTT TGAGGAAACA GCCCGAGATC ATTATCAACG GAGTCTTGAA CTTTGGCAAC AACTTAAGAA CCCGTTGCGA GTTGCTCATA CTTATTATTA TTTGGGTTTG TGGTGGCAGA GTTATGGGGT GACACATCGA ACAGAAAAAA AGATAGCTGA CAAGAATGCT GGGGATTATT TTCAACAGTC AGTAGAAGGT TTTGAGGCAA TTAAGCGTCT AGACTTAGTT GCTAAATTTA TCAATGGTTG GGGAGAAGTT TTGCAGGTTT TGGAGAGTTG GGAGCAGTTG GAAACTGTTG CGAATAGAGC AATAGAAGTT TTAGATACCT CCCAGCCTTC TTTTAGACTA GCCCGTGCTT ATGATTTTCT TGCCGAATGT GAACTAGCTA AAGGTAACTA TAAACAAGCC AAACAACTTG CTGAAACTGC TATAGAAATT TTCAACAATA CTCAAAGCGC TGCATCTGTT CTTACCTCAG AAAAAGATCA AAAAACTTTA GATCGGGAAA AGTATTATCA TCGGGGTAGA TATTTATTTT CTTTGGCAAA AGCTGAAAAA GGTAGTGGTA AAATTAAATC AGCTATAGAT ACTTTGGAAA AAGCTAAAAA GAGGACGAAG CCTGAATATA ACCCGGATTT TTACATCAAT ATTATTGAGA AATTACGAGA AATATATTAC CAAGAAAAAG AATATCTTAA AGCTTTTAAA TTAAAGCAAG AACGACAAAG AATAGAACAA CTATTTGGGT TTATAGCTTT TGTTGGGGCG AAGACTTTGG GCTCTATAAA ATCTAATATT CATCTCGCTT TACCTTATTT AAAACCAAAC AATAATCAAC AAAAAATAAC TCAGGAAATA GCAGCATCTG GACGAGAATT TGATGTCAAA ATATTACTGG AAAGGTTGAG TCAACCTCAA CATAAGCTTA TAGTTCTTTA TGGAGAATCA GGAGTTGGTA AAAGTTCGAC TTTGCAGGCT GGATTAATTC CTGTTTTAGA GAAAAAATCA ATTGATACTC GTGATGTCGT GGTGGTTTTA CAGCGAGTTT ATGTTAATTG GATTTGCCAA CTAGGAAAAG GTTTAGCTAA ACAACTGGAA ACAACTAAAA ATTTAGCTGT TAATTCAGAG AAACTAACCT CAATTGAAGG AATTTTTGCT CAGTTAAAAA AGAATGAACA ATTAAATCTA ATGACGGTTA TTATTTTTGA TCAATTTGAG GAATTTTTCT TTGTTAATAG GGAACTTCAG GATAGACGAG AATTTGCTCA GTTTCTCCAA AATTGTTTAG AAATACCTTT TGTGACAATT ATTTTATCAT TGAGAAAAGA TTATATTCAT TATTTGTTAG ATTTTAGTCG TTGGGCTAAT TTAGAAGTAA TTAATGAGAA TATTTTAGAC AAAAATATCC TCTATTACTT AGGGAATTTA AGACCTTCAC AGGCAAAAAT AGTCATTGAA AAATTAACAG CAAATTCTCA GTTTAAAATA GACTCAGCAC TGACAGAAAA ATTGGTAGAA AACTTAGCAC AGGAGTTGGG AGAAGTTCGT CCTATTGAGT TACAAGTAGT GGGGGCACAA CTCCAGGAAG ATCAGATTAC GACTTTGGCA AAATATCAAG AATTAGGAAA TATTCCTCAA GTAGAATTGG TAGAGAGATC TTTAAAGTCA GTGGTGAAAG ACTGTGGAGA AGAAAATGAA AAATTTGCCT GGGTGGTGTT GTGGTTATTA ACGGATGAAA ATAATACTCG ACCTCTGAAA ACTCAAGCTG AGTTGGTGAA AGAAAGTGAC TTTAATCCGG AGAAGTTAGA GTTAGTATTA AATATTTTTG TAGGTTCGGG ATTAGTATTT TTATTGCCAG AAAAACCAGC AGCTCGTTAT CAGTTAGTAC ATGATTATTT AGTTTGGTTT ATTCGACAAA GAAAAGGAAA TAAAATACAA GAGGAGCTCA AACAGGAACG GGAAAAACGG CAGCAGTTGC AGACGTGGTT AGTTAGGGGT TCTGTTGCTG TTTCCTTGGT GATGGCGATA TTGGTAGGGG CGATATATAT GTCTGTAAGG GAGGAACATA AGGAAAAAAT TATTTCTCAA GCTAATGAGT CTAGAGCGTT ATCTATTTCT GGTCAGCGAT GGGATGGGTT GATGACTGCG ATGAATGTTA GGCAAAAACA AATTGATGCT AAAATTAAAC CAATAGGTAA AGCTAGCAAG ATTACAAATG CCCTCCGAGT CGCAGTTTAC AGCTACGATA AAGATGATGA GTTTCGAGAA ATTAATCGTA CTCAGGCCCA TGAGAATTGG GTAAATGGTA TAGCATTTAG TCCCGATGAA GAAACTATTG CTTCTGGAAG TTATGACAAC ACAATGAAGT TGTGGAACCA TCAGGGAAAT TTATTGCAGA CTCTAAAAGG TCATGAGAAT TGGGTAAATG GTATGGCTTT TAGTCCCGAT GGAGGAACTG TAAAGTTGTG GAACCACCAG GGGAAATTAT TGCAGACTCT AAAAGGTCAT GAGAATTCGG TCTATGGTAT AGCATTTAGT TTTGATGGAG AAACTATTGC AACTGCTGGT GCTGACAAAA CAGTGAAGTT GTGGAACCCT CAGGGAAAAT TATTGCAGAC TATCACCGGC CATGATAACT GGGTCTATGG TATAGCATTT AGTCCTGATG GAGAGACTAT TGCTTCTGCT AGCTGGAAAA CAGTGAAGTT GTGGAACCGT CAGGGGAAAT TATTGCAGAC TCTCACCGGC CATGAGAATT GGGTCTATGG TGTAGCATTT AGTCCTGATG GAAAGACTAT TGCAACTGCT GGTGGTGACA AAACAGTGAA GTTGTGGAAC CGTCAGGGGA AATTATTGCA GACTATCATC GGTCATGAGA ATTGGGTCTA TGGTGTAGCA TTTAGTCCTG ATGGAAAGAC TATTGCAACT GCCAGTGGTG ACAAAACAGT GAAGTTGTGG AACCGTCAGG GGAAATTATT GCAGACTCTA AAGGATCATG ATAATTGGGT ATATGGTGTA GCATTTAGTC TTGATGGAAA GACTGTTGCA ACTGCCAGTG GTGACAAAAC AGTGAAGTTG TGGAACCGTC AGGGGAAATT ATTGCAGACT CTAAAAGGTC ATGATAACTG GGTCTATGGT GTAGCATTTA GTCCCGATAA AGAGACTATT GCAACTGCCA GTGGTGACAA AACAGTGAAG TTGTGGAACC GTCAGGGGAA ATTATTGCAG ACTCTCACCG GCCATGAGAA TTCGGTTTAT GGTGTAGCGT TTAGTCCTGA TGGAAAGACT ATTGCAACTG CCAGTGGTGA TCAAACAGTG AAGCTGTGGA CAAATTGGCG GATAGAAGAT TTAACTAAGC ATGGTTGCGA GCGGTTAAAT AATCATTTAG TTGCCCATCC CCAGAAGTTA GAGGAACTCA GAATTTGCCA AACTGACGAA CGCAAGAAAT TAGCTGCTAG TAGTTGGGTA ATAGAGGGAG AAAAGTTGGC AAGGGAAGGG AAGGTTAAGG AGGCAGTTGC CACGTTTAAG AAAGCAGCAA AATGGAATTC TGATATCAAC ATCAACTCAA ATTTTCTAGT TTGGGCAAAA TTATTAGCTG AGGCAGAAAC ACTGATGGAG GAAGGAACGG AATTAGCAAA AGAAGAAAAA ATAGAGGCTG CAGTGGAAAA ATACCAACGA GCAAAAGAAC TTGATCAAGT GGCATTTACC CCTACTTGGC AAAATGTTGA CCCAGAAGCT AAAGCTAAAT ATCAGGCAGT GGATGCTTTG TTGAATAAAG GGGATGAACT TCTGAGGAAG AGAAAGGTTA AAGAAGCGAT CGCCTCCTAC GAAAAAGCAG AAAGGATCGA CTCCACCCAA ATTTCTGCTT TTAACTGGAA TAAACTTTGT CGGGATGGTA GTCTTGATCA AAAAGCTGCT GATGTTATGT TTGCCTGTGA AAAAGCAGTT GCCATGAGCC CGCAAGATGG CAAAATTATT GATAGTCGCG GACTTGCTAG GGCATTAACA GGGAATATTG AGGGAGCGAT CGCTGATTTT CAAGTCTATG TAGAATGGAC GAGAAATGAG GAGAAAAAAG CACAACGACA ACAATGGATA AAAGCACTTC AGGCAGGGGA AAACCCATTT ACTGATAAGC TGTTGGAAGA GTTAAGGTAG
|
Protein sequence | MSNSGNIEKL LRGLKMSKGR FSPFLARYSY VSQRDRLIEN FRASFPGVLK VLALDESVTQ IYTTVRKYLE GKELDVLMVC GWESLRNINE LLVAMGYVRE EFRKHCPLPI VFWVDGNVSR KFIRLIPDFY NWVSLTVFES ANDELIDFIQ QTSENVYQKV LESGVGIFLD YTDLGLPESS YQDLLEARQE LANRGITLEA ELEASLEFVL GRLTDDFEET ARDHYQRSLE LWQQLKNPLR VAHTYYYLGL WWQSYGVTHR TEKKIADKNA GDYFQQSVEG FEAIKRLDLV AKFINGWGEV LQVLESWEQL ETVANRAIEV LDTSQPSFRL ARAYDFLAEC ELAKGNYKQA KQLAETAIEI FNNTQSAASV LTSEKDQKTL DREKYYHRGR YLFSLAKAEK GSGKIKSAID TLEKAKKRTK PEYNPDFYIN IIEKLREIYY QEKEYLKAFK LKQERQRIEQ LFGFIAFVGA KTLGSIKSNI HLALPYLKPN NNQQKITQEI AASGREFDVK ILLERLSQPQ HKLIVLYGES GVGKSSTLQA GLIPVLEKKS IDTRDVVVVL QRVYVNWICQ LGKGLAKQLE TTKNLAVNSE KLTSIEGIFA QLKKNEQLNL MTVIIFDQFE EFFFVNRELQ DRREFAQFLQ NCLEIPFVTI ILSLRKDYIH YLLDFSRWAN LEVINENILD KNILYYLGNL RPSQAKIVIE KLTANSQFKI DSALTEKLVE NLAQELGEVR PIELQVVGAQ LQEDQITTLA KYQELGNIPQ VELVERSLKS VVKDCGEENE KFAWVVLWLL TDENNTRPLK TQAELVKESD FNPEKLELVL NIFVGSGLVF LLPEKPAARY QLVHDYLVWF IRQRKGNKIQ EELKQEREKR QQLQTWLVRG SVAVSLVMAI LVGAIYMSVR EEHKEKIISQ ANESRALSIS GQRWDGLMTA MNVRQKQIDA KIKPIGKASK ITNALRVAVY SYDKDDEFRE INRTQAHENW VNGIAFSPDE ETIASGSYDN TMKLWNHQGN LLQTLKGHEN WVNGMAFSPD GGTVKLWNHQ GKLLQTLKGH ENSVYGIAFS FDGETIATAG ADKTVKLWNP QGKLLQTITG HDNWVYGIAF SPDGETIASA SWKTVKLWNR QGKLLQTLTG HENWVYGVAF SPDGKTIATA GGDKTVKLWN RQGKLLQTII GHENWVYGVA FSPDGKTIAT ASGDKTVKLW NRQGKLLQTL KDHDNWVYGV AFSLDGKTVA TASGDKTVKL WNRQGKLLQT LKGHDNWVYG VAFSPDKETI ATASGDKTVK LWNRQGKLLQ TLTGHENSVY GVAFSPDGKT IATASGDQTV KLWTNWRIED LTKHGCERLN NHLVAHPQKL EELRICQTDE RKKLAASSWV IEGEKLAREG KVKEAVATFK KAAKWNSDIN INSNFLVWAK LLAEAETLME EGTELAKEEK IEAAVEKYQR AKELDQVAFT PTWQNVDPEA KAKYQAVDAL LNKGDELLRK RKVKEAIASY EKAERIDSTQ ISAFNWNKLC RDGSLDQKAA DVMFACEKAV AMSPQDGKII DSRGLARALT GNIEGAIADF QVYVEWTRNE EKKAQRQQWI KALQAGENPF TDKLLEELR
|
| |