Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44554 |
Symbol | |
ID | 7197799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 889762 |
End bp | 890858 |
Gene Length | 1097 bp |
Protein Length | 351 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178597 |
Protein GI | 219115603 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCACGTCCAA CCCGTCTGAT TTCGGCATTG TCACCAGTAA CATGTCCGGT AACGCATCGG CTGCACAGTC CGATGATGGC GACAAAGATC CCGTTGTTAC AGACAAGAAG GCAGAACCAA AGGTGACGAA AGGGAAAGCT AAGTCGGCGA AGAAAGCTAC AAAAAATAAA AGGTTTCGTT CCAAGAAGCC CAAAGACATG CCGCCTCGTC CGCTGAGTGC TTACAATCTT TTTTTTAAGG AAGAAAGGGT GCGTATGTTG ACGGGAGATG TAGCGAGTGA GGGCGACGAC GCTCCAAAAC GTATTGGGTT CGAAGCTATG GCCAAAACAA TCGGGAAACG GTGGAAGGAA TTGCCCGAGA TTGAACTTGC TCGATACAAA GCCGAAGCGA AAGATCAAAT GGATAACTAC CGCAGGGAGA TGGATAAGTA TCATATTAAT GTCGCAAAAC GCGCTCGTCT GGAAAAGGAA CAAGCGGCAG CGCAAAAAGC CGAAGAGGAA GCCGCTGCGG CAGCGGCTAG AAAGACAATG TTTGAGTCGA ACCCGCAAGG GGATATGCAG CAGATGGTAC CGGGTCTCGC TTCTACAATG GGAGGTTCAA ACAGTGCCGC TATGGGAGGG TCGGGGAATT TCGATATGGA ACAACTTTTG CGCGCTCAAC AAGGTATGCT TAATCCGGCG ATGAACGTTC CAATGATGGG ACTTGGAGCG AACTTTTCGC AATTCTACGG GTCGTCCGGA CTTCCTGCTG GTATAGGGAC TGGTGGACTT GGCATGGGGA TGGGACAGAC CAACTCTCAA TTTTTCCCAG GGTCTATGAT GCAGGGCAAT TCATTCCCGC AGCAAGATAA CAGTCAGCAC ATGATGCAAA ACCAGAACCC CATGGCTTTC CAGCTTGAAC AACATCTACA ACAGCAGCAA TTACAGATGC TTCAACAGCA GCAGCTACTC ATGCAAATGT CTGGAGCTCA AGGTCAGCAA CCACCTTATG GAAGCGGAGG AGGGAGTGGG GATATTTCTG GCTTTCCGGC AGGGCCTGAT TCGTCTTTTG CTTACAGCGA CCAAAATACC TTCCATGGCA ACCCTGGAGG ATCTTAG
|
Protein sequence | MSGNASAAQS DDGDKDPVVT DKKAEPKVTK GKAKSAKKAT KNKRFRSKKP KDMPPRPLSA YNLFFKEERV RMLTGDVASE GDDAPKRIGF EAMAKTIGKR WKELPEIELA RYKAEAKDQM DNYRREMDKY HINVAKRARL EKEQAAAQKA EEEAAAAAAR KTMFESNPQG DMQQMVPGLA STMGGSNSAA MGGSGNFDME QLLRAQQGML NPAMNVPMMG LGANFSQFYG SSGLPAGIGT GGLGMGMGQT NSQFFPGSMM QGNSFPQQDN SQHMMQNQNP MAFQLEQHLQ QQQLQMLQQQ QLLMQMSGAQ GQQPPYGSGG GSGDISGFPA GPDSSFAYSD QNTFHGNPGG S
|
| |