Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47685 |
Symbol | |
ID | 7202878 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 517702 |
End bp | 519630 |
Gene Length | 1929 bp |
Protein Length | 598 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182083 |
Protein GI | 219123545 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.54676 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGTGAACC CCTATTCGTC TCTACCTTAT TTCAATGAGA GAAGAAAGAC GACGACGGTC CAGATAGTAG ACAGCCCATG CGAACGAACA TCGTGGCGTC GAGGAGGTCT ACTGCTTCTG GTATCGATAC TATTCTCGCC GTTCCATTTC TGCTAGTAGT TTTCTTTGGT ACGAGGTTCG CCGAACACTG CTCTCTTCAC GCGTTCTCTC TCGACGGAAT GCGTGCAGCA CGAGCGCTAA AAGTAGCGGA CACTTATTCA CTTTCCCGGC GGTTTTCTTG CCCACTTCGA CCTTTCTTTG GAACTGCTTT TCCGTCTACT AACAGTAGAG CGGCTTTGCC GACAAGCTCA CCATTACCGT CGTTAACCTG GCTTCGAAGC TCATCATCCG CACATTCTAC AGAAAGCGGT CAAGCACCAC AAGTATCCAT ATCGAAATCC TTCAATCCAG CCGCGCTGAA TGTCGACAAT CTGGCTTCCG ATATGGCACC GGATCACTAC TCACCGTACG AGCAATGGGT ACGACGTCTG TACATGACAA ACCTCTTCCA CCCGGTCAAA TTGGGCCTCG ACAATATTGA GCAGCTCCAT CGCGTGCTGG GATTTCCCAT GGACGACCCC AACGTAACGG TTGTACACAT TGCGGGAACC AACGGCAAAG GAAGTGTCGC GTTGAAAATA GCCAAGACTC TGGAATTGGC CGATCCAACG CGAAAAGTGG GACTCTTTTG TTCCCCTCAC ATTTCCAGTT TTCGGGAGCG CATGCAAGTC AACGGCGAAC TCATAACTGA AAACGAAGTC GTTGAGTTCT TACCGGAAAT CTACCGAATG TGTCAGGAAC ACGATATTCC AGCCACTTTT TTTGAAATAA CAACGGCTCT GGCGTTTCGC TTTTTCCATG CCCGTGGAGC TGATGCAGTT GTACTCGAAA CGGGCTTGGG AGGACGACTC GATGCGACCA ACGTGATAAA AAATCCAGCC ATTTCCATTA TCACTTCCAT TGGTCTAGAG CATACTCGTA TTCTCGGTGA TACAGTCGAG CTAATTGCAA GGGAAAAAGC CGGAATTATT AAGCCCGGAA GGCCGGTCCT TGTGGGGCCG AATGTACCAG ACGCAGTTAC CCGGGAATGC GCCGCCGAAA AACAATCCGG TCCGTACTAC ACCGTTCCGG ATATTCTGGG AGCGGATTTG TTGGATAAGC TGACTACCCA GGAAGTTCTG GACTACGATT TAGAAAATGC ACGAATAGCT CGAGCCGGTT TGAAAATACT GGAAACCGAG TTTCCGAAAA AGATCACAGA TATTTCAGAG GATGTACTGC AACAAGGAAC TTCCACTAGA CCCCCATGTC GATTTGAGCA TGTGGACTGT GGAGATGGGT TGACAGTTAT TCTGGATGTA GCACACAATC CTCCGGCTAT GGACTATATG GTCCGGAAAC TCGCAACGAC GTATCCTGGA ATGAGCTTTC GTGTCGTCGT AGGGATGTCT TCTGACAAAA ATCTGAAGTT GTGTGGGCAG TCGATATTAC AAGCTGTGCA ATGTGACGCG ACAAAGATCC ACCTGGTCGA GGCCGCACAC CCCCGAGCTG CGACGTTGAA GGCAATTCTG GAAGCAACCG GTCTAACAGA TGCCCACTTT GATACGAACG ACTCATCTCC GACCAAACAG ATAAAGGAGG CGATAAAACT GGCCAGAACT AATGGAGAAT TGCTGGTTGT TTGTGGTAGC GTTTTTCTCA TGGCAGAAGC CCGCGAGGCA TTGGGATTCG AAGAGCCGCG AGATTCGGAG TATATTGCTG AGGTTGCTGG TGCAGGTGTC CGCCATGGTC AAGAAAATTT TGGAAACACT CCGGAATCAG CAATTGTTGT GTAGGTGTTC GTCGTTTGGC GTTTTTCTTC AGTAATTATC TTTAGCAAGT ATCTCCTTA
|
Protein sequence | MRTNIVASRR STASGIDTIL AVPFLLVVFF GTRFAEHCSL HAFSLDGMRA ARALKVADTY SLSRRFSCPL RPFFGTAFPS TNSRAALPTS SPLPSLTWLR SSSSAHSTES GQAPQVSISK SFNPAALNVD NLASDMAPDH YSPYEQWVRR LYMTNLFHPV KLGLDNIEQL HRVLGFPMDD PNVTVVHIAG TNGKGSVALK IAKTLELADP TRKVGLFCSP HISSFRERMQ VNGELITENE VVEFLPEIYR MCQEHDIPAT FFEITTALAF RFFHARGADA VVLETGLGGR LDATNVIKNP AISIITSIGL EHTRILGDTV ELIAREKAGI IKPGRPVLVG PNVPDAVTRE CAAEKQSGPY YTVPDILGAD LLDKLTTQEV LDYDLENARI ARAGLKILET EFPKKITDIS EDVLQQGTST RPPCRFEHVD CGDGLTVILD VAHNPPAMDY MVRKLATTYP GMSFRVVVGM SSDKNLKLCG QSILQAVQCD ATKIHLVEAA HPRAATLKAI LEATGLTDAH FDTNDSSPTK QIKEAIKLAR TNGELLVVCG SVFLMAEARE ALGFEEPRDS EYIAEVAGAG VRHGQENFGN TPESAIVV
|
| |