Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54588 |
Symbol | GWT1 |
ID | 7201572 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 471835 |
End bp | 473315 |
Gene Length | 1481 bp |
Protein Length | 445 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181014 |
Protein GI | 219120556 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGTCGACGC TTCCTCTCTA GTAGGGAATA CAATACCATG AATTTACCGA ACTGCATCGA TAGTAGCACT TTGTTCGTAC TTTGCTGGAC AGGAAGTGAC TACTAGCGGG ATAGTATGTT TGCGAAAAAG CTCGCCAAAG AAGCCTTCGT CATGGGGTTA GAAGGGACAA CGCCTCTCGA GCTGCTGCTA GTTTTGTCTT GCATTCCTAT TGGATTTTGG AGTTTCCAAT TTCTCCCCAG TGCGAATCCA TGGCAGTCGT CTGTCTCCGA AGCAATAACA TTTTGGATTC CCATGATTCT GTGCCAAAGC AAGTTGCTTT ATCCGTATGG TGTACTTTAC CTTGCCTCAG AACTTACCTT TGCCATCATA AACGAGTCCG TCAGACCAAA TAGAGAAGTA CTAAAGAGGA CTGACGATAT GCGGCGAGTC ACCTTGACTG TGTATCGATC TTCGCTCCTT TATCTGACGT TTGTAGCTAT TTTATCCGTG GACTTTCACT TTTTTCCCCG GCGATTTGCC AAGACAGAAG AACGTGGCTA CAGCCTTATG GATATGGGTG CTGCGTCGTT TGTGATTGCT GCTGGTCTAG TATCGACTCG TGCCCGGGGT AAGACAGCGA ATACCCGGAG AGATTTTGCA CGGACCTTGC CGCTGCTCAC ATTGGGAGTC TTGCGATTGA TTGCTCACAA GGAGTTGGAG TATCAGGAGC ATGTCTCCGA GTATGGTGTC CATTGGAACT TCTCCTTTAC GTTAGCCATC TTGTCACCGG TCGGGGCACT GCTACCAGGT CCCACTTGGA CTCTTCCAGT GGCTCTTTTG AGTTTCTACC AATTTGCTTT GTATTCTTTC GGGCTTCAGA CATGGATCGA GGATTCGCCT CGACAATGTC TTGAATTCGA CCACAATATA TGTCACTTTT TCGCAGCCAA TCGGGAAGGA TTGCTTGGAT GTGTTGGATA CAGCGCAATA TATCTATTAA GCGAATGGTT TGGCTCTCAA TATCTCTGGA GAGCGCCCGA CGACAATTAC AGATTGAAGT TTGGATTGTT CAAATTCACG GGAGGGTTGA CCCTATTCTG GTTGATATTG GAAGCGTCCG GCTTGACAGC ATCCCGCCGC TCAACCAATC TTGTGTTTGC CGTCTGGGTC CTCCTAGTAA ACATTTTGAT TCTTACTACC GTGCGATACG TTTGCGTTGG GCATGACAAG GTACCTTTCG TACTAAACAC CGTAAACAAA CATGGGTTAC CTTGCTTTGT CGGGGCGAAT TTAATGACGG GAATTGTAAA TTTGTCATTC GATACAATGC AACAAAATGA CACCACCGCG TTCGTAATAC TGCTAGTCTA CATCTCTGGG GTCGGGACAC TGGCGGTGTC GCTCAACATT CTCATCCCCA AACTTAAAGG CTTCTTCTCA AGGCTACCTC CTGGAAAGGA AAAAATAATG TGATATGTTG ATTAAATGTA TGTTTCGTAT A
|
Protein sequence | MFAKKLAKEA FVMGLEGTTP LELLLVLSCI PIGFWSFQFL PSANPWQSSV SEAITFWIPM ILCQSKLLYP YGVLYLASEL TFAIINESVR PNREVLKRTD DMRRVTLTVY RSSLLYLTFV AILSVDFHFF PRRFAKTEER GYSLMDMGAA SFVIAAGLVS TRARGKTANT RRDFARTLPL LTLGVLRLIA HKELEYQEHV SEYGVHWNFS FTLAILSPVG ALLPGPTWTL PVALLSFYQF ALYSFGLQTW IEDSPRQCLE FDHNICHFFA ANREGLLGCV GYSAIYLLSE WFGSQYLWRA PDDNYRLKFG LFKFTGGLTL FWLILEASGL TASRRSTNLV FAVWVLLVNI LILTTVRYVC VGHDKVPFVL NTVNKHGLPC FVGANLMTGI VNLSFDTMQQ NDTTAFVILL VYISGVGTLA VSLNILIPKL KGFFSRLPPG KEKIM
|
| |