Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45666 |
Symbol | |
ID | 7200450 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 865200 |
End bp | 866895 |
Gene Length | 1696 bp |
Protein Length | 551 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179734 |
Protein GI | 219117896 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCAAGAGCGG CCGAAAACGC CCTTGACCTT TGATTGGAAA ATGAATGCGT CTTCCAATCA AACACTCGCA CCGAAAAGTG TTGTGACTAA CACTGAGTCT AATCAAGCGA TGCTGATCCA GCCTGCAAAT ACCGGAGGAT CAACGACTTC GAACGAGAAA GGCCCGCTTG TTGCCGATCT CGATATTGCT ATAGGACGCA ACGCTATGCC TGATTTTCTT GGACAGGATT TTCAGGACCG CAATGGACAG CACCATCCGA TTGCGGAGTT CCTTTATCAA CTGACTAAAA TGCTGACGGA TGAAAACTCA GAAATTATCG AATGGTCGGA CGGGAGGATC AAGGTTCACT ATCCTGAACG ACTGGAAGGT GAGGTCTTGC AGAAGTACTT TCGCCACTCC AAATTTGCAT CTTTTCAGCG CCAGCTGAAC TACTTTGGCT TCCGCAAAAT CGCGGGGAAA GGGAAGATGT CCCCATGCTC CTACGTCAAT GAAGCAGCGA CGTCCGATAT CCGTAGTCTA CTTCTTATTA AACGAAAAAC CAATGGTTCC GCTGCTCGAA AAGCTGCGAT GCAACAACAT TCCCTTGGGT TGCCCGCCTA CGACCCGAAC TCTACAGCTT CAAGCGGAAT GAACTTTCCC GGTCTCTCAA TGGCCACTGG TAGTTCACAT CAGCAGATGT CCCTTCTCAA TGGACAGCAG TCCATTGCTA ACGCCATGGC TTTGCTCTCG GAAAACGCCC TTCGAGCTGG CCTTGGTCAA TATCAGATGT TGCAACAACA GCAACAGCAG TCAGGAGATA GCTTGGGGAA TCACCAGCTT AATCTTTTTG CCCTTCAGCA GCAACAACAA CAACAACAAC AAAGATCCCA GCATCAATTT TTAAGTCTCC AACAGGCACA GCAAGAGCTT CAGCAGCAGC AACAAACATC GAGTTCAAAC CAGCGCCTTC AAGAGCAAAA TCAACTTGGG AGCAACCCCC AGTTTGGACA GCATGCTCTC CAACATTTAT CGATAGCTCA AGGTCAGCAA CACAATGGGT CACAGGTAAC CAGTAACAAC GCTTCGCAGA ACATCCCCTC GCTTGAGCAG CTACGTGCCC AACTGGCAGC GAGCTTAGCT AGCCAGCATG GGTCCTTTAA TGCCAGTCTG CTGAACTCTG GTCTGAATAT TGCGAACAAC CCTGTAGTTG CGGGAGTCAA ACTAAATGCA TCGCATCAAA CTACTAGTGC GAACCCTGCT TCAGCAGCTC TCAAAGCTTT AACCGCTCAG ACTCCTGCTA TGCCTGCGAC CGCAGCAATG GATGCGGCAT CAGCCACTGC TCAGACCAAC AGCAATCTGT TTGATTCTGC AGCCAATCTG AAATCCTTAC TGGGTGAACA CAGCTCTAAC CAGGATGCTC GTCCTAGTGT ACCAACGAAC TCAACCCCAC ACGTTTCCAG TGCCTTGTTG AACCGACTGC CCTCATCAAG TACCATCTTT CCAGAGCTGA GCACGGCCAG TTTTGGGAAC CTTCTGGCCA GCTCGAACCG ATTGAACTCA TTGCTGAGCT TGAACAGCTT TTTGGGATCT AGAGAACCAT CGTTGGCAGA TTTTGCGGCT GCCAACAATA TGAGCGCTCA TCAACTGGCG GCCAATGCCG CAAACGGCAT GTCTCATTTT GCTTCAGATG CTTCCAAATT TCGAAGCAAC CATTAG
|
Protein sequence | MNASSNQTLA PKSVVTNTES NQAMLIQPAN TGGSTTSNEK GPLVADLDIA IGRNAMPDFL GQDFQDRNGQ HHPIAEFLYQ LTKMLTDENS EIIEWSDGRI KVHYPERLEG EVLQKYFRHS KFASFQRQLN YFGFRKIAGK GKMSPCSYVN EAATSDIRSL LLIKRKTNGS AARKAAMQQH SLGLPAYDPN STASSGMNFP GLSMATGSSH QQMSLLNGQQ SIANAMALLS ENALRAGLGQ YQMLQQQQQQ SGDSLGNHQL NLFALQQQQQ QQQQRSQHQF LSLQQAQQEL QQQQQTSSSN QRLQEQNQLG SNPQFGQHAL QHLSIAQGQQ HNGSQVTSNN ASQNIPSLEQ LRAQLAASLA SQHGSFNASL LNSGLNIANN PVVAGVKLNA SHQTTSANPA SAALKALTAQ TPAMPATAAM DAASATAQTN SNLFDSAANL KSLLGEHSSN QDARPSVPTN STPHVSSALL NRLPSSSTIF PELSTASFGN LLASSNRLNS LLSLNSFLGS REPSLADFAA ANNMSAHQLA ANAANGMSHF ASDASKFRSN H
|
| |