Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2226 |
Symbol | |
ID | 4243260 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3471091 |
End bp | 3472485 |
Gene Length | 1395 bp |
Protein Length | 464 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 638107328 |
Product | WD-40 repeat-containing protein |
Protein accession | YP_721928 |
Protein GI | 113475867 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.797248 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.32606 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGAAC TTAAACTTAT ACTAAACCAA CCAAAGCTTT GGAAATATTC CATAATGATA GTAACAGCTA TTAGTGGACT ATGGTTGTGG CGATCGCTAT CAATATTAAC CGAATTGAAA GCAAAAACTC AAGCCTCCGT CACCCTTTCC GGACATAAAA CACCAATTTA TGCAGTTGCC ATTAGTGCTG ACGGAAAAAC CTTAACAAGT AGCAGTCATG ATGGTAAAAT TAAGGTTTGG AACTTAACAA ACGGTCAACT ATTTCATACC ATCAATGCCC ATGCAGATGC CATTGAGTCC CTTGTTATCA GTCCCGATGG AAAATTTATT ATCAGTGGTA GTTGGGATAA CGACATTAAA CTTTGGAATA TTACAAATGG AAAATTTATC CAGACCCTTA AAAGTCATGC CGACGATGTG AAAGCCATAG CAATGAGTAA GGATGGTCAG ACCCTGGCGA GTGGTAGTTA TAATGGTGTC ATTAAAATAT GGAACCTCAA AACCGGTTCC CTTAAGATGA AAATTAAACA GCCATACCCT ATAATTGCTC TAGCTTTTAG TCCTGATGGA GAGATACTTG CTAGTGGATG TAAGAAAGGA AACATCAAAA CTTGGGAATT AAATACAGGT AAAGAACTCC ACTCCTTTGC AGCACATACC AAGACAATTT GGGCGATCGC CTTTAGTCCT GACGGCAAAA TTCTTGCTAG TGGCAGCCAA GACCAGAAAG TTAAGCTATG GGAGATAGAA AAAGGCCAAC TTCACAGTAC TTTAGAAAAC CATGATCAAG CAGTTTTATC CGTCGACTTT AGTCCTGATA GCAAAATTGT TGCTGGTAGT AGTTATGACA GCAAAATTCA TCTGTGGCAA GTAGAGACAG GAAAATTACT AGAAACATTT ACAGGCCATT CTCAAGCAGT TTGGTCTTTA AAATTTACCC CAGATGGTCA AACCCTTGTG AGTGGTAGTA CTGACAGGAA CATTAAATTG TGGTGTTTAT CTAACTTAAA TACTCAACAG TTACAGAATA CTACTTTTCG ACCTGTCGTC AAGGAAGAAG CAGATAAAAT CTTCATCTCA GAAATTATTG ATACACAAAA ATTAGAAGAG TTAAACCAAA TATTATATCA TCAAATTAAT CAAAGCTGGC AACAGACTCC AACATGGTCT GAGAACTTAG TTTATCAGGC CACAGTTAAT AATAATGGTG TCATCCTCAG TTTAGAACCC ATAAATAGAT CAGCAAGAGA TTATTTTCAA CAAACACCTT TACCAAAATT GCTCAATAGT TCTGATGCTC ATGGGAGTCA TCAAAAGTCT TTTGCACTAT TCAAAACAGT AATGACACCA ACAGGAGTAC TTGAGGTGAG TCCTTGGAGA GGTTGGGAAA ACTAA
|
Protein sequence | MTELKLILNQ PKLWKYSIMI VTAISGLWLW RSLSILTELK AKTQASVTLS GHKTPIYAVA ISADGKTLTS SSHDGKIKVW NLTNGQLFHT INAHADAIES LVISPDGKFI ISGSWDNDIK LWNITNGKFI QTLKSHADDV KAIAMSKDGQ TLASGSYNGV IKIWNLKTGS LKMKIKQPYP IIALAFSPDG EILASGCKKG NIKTWELNTG KELHSFAAHT KTIWAIAFSP DGKILASGSQ DQKVKLWEIE KGQLHSTLEN HDQAVLSVDF SPDSKIVAGS SYDSKIHLWQ VETGKLLETF TGHSQAVWSL KFTPDGQTLV SGSTDRNIKL WCLSNLNTQQ LQNTTFRPVV KEEADKIFIS EIIDTQKLEE LNQILYHQIN QSWQQTPTWS ENLVYQATVN NNGVILSLEP INRSARDYFQ QTPLPKLLNS SDAHGSHQKS FALFKTVMTP TGVLEVSPWR GWEN
|
| |