Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44818 |
Symbol | |
ID | 7199542 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 326096 |
End bp | 329251 |
Gene Length | 3156 bp |
Protein Length | 762 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178978 |
Protein GI | 219116366 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0143602 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAGTTCTTC TTGCTTGGTG GGTTAACAAT GACCGTATTG GAACGCTGGC GTCGTCGAAA AGATCAGAGG AAAACATCAC TTGGACGTCA GCCGTCGTTG CGTAAAGGCC GCAACGGCTG GCCGCTTTTG AAGCGTAAAG AAGCCAGTTT TAGATTGACG GACTTCCGTG TGCGCAACCC AATTGAACGT CACCTGCAAC AAGAATCAAT GGGGCGTTTC AATGTGGAAG CACTCGTTTT GCGAGCAATC AAAGTGGAGG TGATTGATTT ACTTCGGGAT GCGGGTGGAT CTTGTACCGT GCTGGCCATG GCGAACGCCG TTTCGGGAAG TCCGCGAGTC CAGGAATGCG TGGGGGATGC AGCAGATTCC GAGAACGATG CTTGGGGATT GGCATTTTTG GACCTGATAC CGGGACGAGC CGCGCGAAAG AAAAGCAACG CACGGTTGGC AAAGCGCTAC GAGTTTATCA ATGCCTTGCA CAAATTAGAG GCTTCGCCGA GTTCCAGTGA AGCACGCATT TCAAGCTGGG AAGAGGCTTT GGAAAGACTG CGAACGGTCT TAAATACTCA ATTTGCGGAG GCAGATGGAA GTGATGGCTT GTCGATTCCC GACGATCTAT TAGAAGAGAG TATTGTCGAC AAATACTTGC TCCGCGCGCA AGCTATTAAA CTGCAGCAAA TAGAGGAGCT AGAGCATTCC CAAAACCAGT GGTTACAAAT TTCAAATGCT GCCCTTTCAC AGCGAATATC ACAAGTCGAC GCCAATCAGT TGGATGAAGC GGACCGATTA ATGAGAGCTC GCTTGGACGC CGAGTATGAA AAGCGGGAGA AACGTCTACA AAGTACGTCT CTGGCAGAAT TGGAGGCCAA GCTTATTCAA GAAGAGAGAA ATCGTGAAGC GAGAGAACGA GCATTATCTC TGATGCGACC TTTGGAGGAT GCGGAACTTA AAATTGTACA TGAGGCGATG AGTCATGTCG GGAATCCGAA CGAAATAGTT GCCCAAGCGG GCGTCGACTC GGTTCAAAGG GAATCGTTTC AACGACTTGC ACCGGCACAA TGGCTCAATG ACGAGGTCAT TCATTACTTT TATGTCATGC TAGCCAACCG GGACGAGGAA TTATGCAAGG CAGATCCCAA CCGCAAGCGA TGTCATTTCT TCAAGTCGTT TTTCATCACG AAGCTTTTAG ACGAGGAACA TTCCAATCCA TCACTCCGCG GTAAATACAA CTACAACAAC GTGAAACGAT GGTCCAAAAA GGTTCCAGGT ACGTTATCAG AGAAAAGGGC CTTCTGTTTT GTTGTCTCTA TGTTCTTACA AGGTATCGAC TGCCAGGCAA GGACATCTTC AATCTTGACA AAATCTTCTT TCCCATCAAT GTCAGTCGGA TGCACTGGGT ATGCGCAGTT GTCTTTATGC AACAAAAGAA GGTTCAGTTC TATGACTCCA TGGGGGATGG TGGTATGTAT CATTTAAAGG CAATTTTTCG CTACATCCAA GACGAGCATC AAGCTAAGGA AGGTGCTCCG TTACCGGACG CCGATGCGTG GACACTTGTG CCGTGCTTAT CGGATACACC ACGTCAAAAA AACGGTACGT AGGAAGTATA CGATGCGAGA ACGGTTGGCT GAACGAGAAC TTTGTTTACC ATTGTTTTTG ACGAACCAGG TTACGACTGC GGAGTGTTTG CGTGTATGTT TGCCGACTTT TTATCTAAAG ATTGCCCTTT GGTGTTCGAT CAAAGTATGG TCAACCAATG CAGAAATCGT ATCGCACTCG CTCTCTTGAA CGGCAAAGCT ATCCTGTAAA TGTACGATAT ATTCTTTCAC TAACATTTGA CATAGTATCA GATAGATTAC TGTCTTGCGA TATTCTCGAG AAAGATGGAA TCACTGGGCA GAAATGTTTC AGGCTCCTCT GTCAATTAGT TGGCACTGAC GCAATCAAGG AATTTCCAGT AGTGACAGCG CGTAGAGCCG TTCGTGGGGA TAAAGCGTCG CGCTTCGAGA GGTTTAACGA TCTTACTTAT ATGGTAAATA GCAGACGGAG ATGCCACTAT GAGTGTTTTT ATCAATTTCT ATGACCAGGA CCAATCAAAG GAGCGTTTTT CTTTTGATAT TTCCCATGCG GACATCGAAG TTGCCAAGTG ATTGTTACTG TTGGTAAATT GGGCACGCCG CGTCGCTTTC CATTGGCACA CGGACACGCC CGACAAGATG GCACGGCGAA CCGCACCGTC CGTGAATGCG GTAGTTCACT GTCACCTCTT CTCGCTTATT TCTGGAAAGT TCTCGCTCGG TACGTTCCGT CGTCACAAAA ACATACACGG CCGCTTACGG TTGGTTTCAT TGCAAGGGGA AACAAACCTT GACACATCAC TCCAAAGAAG TTCTTTGACC ACATACACTT TTCATTTGCT TATTAAGATG GTTTCTACTC GACGTTCCCA GCCCGTCAAG GGACCCGCGT ACACTAACTG TGACAAGGAT GAGGTAAGCT TCAGCATCGG GATCCTGCGG TCGTGTTGTT ATGCCGCCTC TGTGCGAGCG TCTCCGACAG CTCCCGATAG TACTTGAATA CAATTCCTGT TGCTCACATA CTATATCCGC GATTCACATT AGCTTGCACG ATCCCAGGGA ACCGAGGTTG ATACGTTCCC CAGCACAACT GTACGAAAGT CTGACCAGTC CCCCGGAACG GCGGACTCGA CGATGCACTG GACAGGCGCC TTCCACGATG ACACCGCGAT TATCGCGAAA CACGAGTTGG GACCGGATGT TGAGGACAGC GTCGTTGTCT CTCCAATGGA AAGCGATAGT AGTACCACCA CCGCCGTTCA CACCAACACG AAGAAGAAAG TCAAGGCCGC GGCGCGCAAG GCATCGGCGG CGACCGTCAA GCACGTCGCC ACCACGACGA TGTGTGCTCC AGCGAGTCCG CCTGGGACCA CCAATCAGAC CAGAGCCGTC ACCCGCAGTA AAGGAGGCAG CGTGGATCTA TTCAAGGGTA TTGAAAGTGT TCCCGTGGTA AAAAAGCCTT CTCCGAAAAA GCGAGGCGAC GAGGGGGGTA ACATGCAGAA AATCAAACTT CTCACGGGGA CCCTGTACCT GTACCGGGGG CGGTATCCAC GAGCCGAATT TGTGCGCACC AAGTGA
|
Protein sequence | MTVLERWRRR KDQRKTSLGR QPSLRKGRNG WPLLKRKEAS FRLTDFRVRN PIERHLQQES MGRFNVEALV LRAIKVEVID LLRDAGGSCT VLAMANAVSG SPRVQECVGD AADSENDAWG LAFLDLIPGR AARKKSNARL AKRYEFINAL HKLEASPSSS EARISSWEEA LERLRTVLNT QFAEADGSDG LSIPDDLLEE SIVDKYLLRA QAIKLQQIEE LEHSQNQWLQ ISNAALSQRI SQVDANQLDE ADRLMRARLD AEYEKREKRL QSTSLAELEA KLIQEERNRE ARERALSLMR PLEDAELKIV HEAMSHVGNP NEIVAQAGVD SVQRESFQRL APAQWLNDEV IHYFYVMLAN RDEELCKADP NRKRCHFFKS FFITKLLDEE HSNPSLRGKY NYNNVKRWSK KVPGKDIFNL DKIFFPINVS RMHWVCAVVF MQQKKVQFYD SMGDGGMYHL KAIFRYIQDE HQAKEGAPLP DADAWTLVPC LSDTPRQKNV IVTVGKLGTP RRFPLAHGHA RQDGTANRTV RECGSSLSPL LAYFWKVLAR SSLTTYTFHL LIKMVSTRRS QPVKGPAYTN CDKDELARSQ GTEVDTFPST TVRKSDQSPG TADSTMHWTG AFHDDTAIIA KHELGPDVED SVVVSPMESD SSTTTAVHTN TKKKVKAAAR KASAATVKHV ATTTMCAPAS PPGTTNQTRA VTRSKGGSVD LFKGIESVPV VKKPSPKKRG DEGGNMQKIK LLTGTLYLYR GRYPRAEFVR TK
|
| |