Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45917 |
Symbol | |
ID | 7201132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 667340 |
End bp | 668979 |
Gene Length | 1640 bp |
Protein Length | 499 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180290 |
Protein GI | 219119047 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.147124 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTACCTTTCC ACCTCGTCCA TCTCCACTTC TCCAACTCCA GAGATTCGTC GCAGTTCTAG TGCCTGCTAA TATTGCGTCT ATCCGAAATA GTCCATCTCG TTCGCTAATC ACATTACTTT CCCAGTAAAC CTAAAAAAAT GTGGAGCTTT CGTAGCCAAA ACGAGCTTTC CCACACGAAC CGGAAGCGAC CAAGAGAGCA CCGTCGATGG ATTGCTGGCA TGTGGATAAT ACTGTCAAGC TTAGCATCGT CTAGCATGGC GTTCGCTCCT TTCGATAGCC CCTATTCGAA AGCAAGTCGA AGAATGTCTA CATGGCAAGC ACAACGGATT CTAGTTCAGG AAACCTACCA CGAGGACAAT TACGGTGAGG GGCCAAAGAG CAAATGGTTT CAGTGGTTCA TTGGTGGGAA ATCACGGGGA ACAAATAAAA TCATTATGCG CGAAGCGGAA GCGCTGGGAG GCTTGCCCCG CAGCAATCGA TACGCGAGTC GCGATTGGTT TCACAGTGCG ATAACTCTAC CGAATTCGGC CATTTTGCGA GACATTCGCA ACCCAGTGAT CGCAATCACC TCGTGGGCAT CTTTCTTGTC TATCCTGCAT CGGAAACTCG TGAAAGTCAA TCCACTTGCA GCCGAACACA TGTATTTCCC CACCGCCCCC CATTCGCTCA TGATGAGCGC TTTAGGTCTG TTGCTTGTTT TTCGCACAAA TAGCGCCTAC CAACGGTTTA CTGAAGGGCG CACGATCTGG GAGCAGATTA TCAATTCTAG CCGTGATTTG TTTCGACTTA TGATGCTTTA CGAGAAGGAA ATTGGGGTCG AAAAACGTCG CCGTGTACAG ACCTTGTGTG CAGCCTTTCC TTATTTGCTC CGCCATCGGA TTCGGCCCGA TCTAGTCATG AAAACGGTAG ACGACGAGGC CTACAAACGC GAACCCGAAA ATACAATATT ACTATATCAA GACTCCGGCC CAACGGATGA TGACGCCGAA GCAGCGGCAG TCGCCAACGC CGAAGAAGAC ATCGGCCGCA GTCGTCGTAA AACCCGCCCA TTGTTCTGGG TTGACAAGCG CACGTTGCCA TGGCGACTGC TTCCACCCGG GGCGTTGGAG AAGTGTGCTC GAGCCCAGAA TCGTCCACTA TGGGTGTGCG ATCGTATGGC TATGGAGCTA AGGGCCGTAC CTGATCAAGT TGGATTCACA AATCGCGAAC GACTCGCGCT GATCTCACAC GTGGACAAAT TGTCTCGATG CATTGGAGGT TCTGAGCGTA TTCATCAGAC AGTCGTACCT TTGAACTATG CACGCCACAC GCTTCGTGCA TTGACTGTAT GGCTCTTCTC ACTGCCTTTC GTGGTAGTGA AAGATCTAAA ACTCTTGACT GGTCCAGTGT TGTTCCTCGT TTCTTGGATG TTGTTTGGTG TATTTGAAAT CGGATCGGCC ATTGAGGATC CTTTCCAAGG AACGCTGCGG CTGTCCATTC TTTGCGACAC TATTCGGCGC GACGTGGTTG GTGACGAGTA CGTCCGGAGC ACGGCTTTTT TTCTGGAAGA GGAAGATGCC GCGACTCAAG GGCCAAACGG AGCAAGTGCA AACAAGATCC TGGACTCGGT ATCGTCACCT GGGGAAGCCT TGCCTTGAAT
|
Protein sequence | MWSFRSQNEL SHTNRKRPRE HRRWIAGMWI ILSSLASSSM AFAPFDSPYS KASRRMSTWQ AQRILVQETY HEDNYGEGPK SKWFQWFIGG KSRGTNKIIM REAEALGGLP RSNRYASRDW FHSAITLPNS AILRDIRNPV IAITSWASFL SILHRKLVKV NPLAAEHMYF PTAPHSLMMS ALGLLLVFRT NSAYQRFTEG RTIWEQIINS SRDLFRLMML YEKEIGVEKR RRVQTLCAAF PYLLRHRIRP DLVMKTVDDE AYKREPENTI LLYQDSGPTD DDAEAAAVAN AEEDIGRSRR KTRPLFWVDK RTLPWRLLPP GALEKCARAQ NRPLWVCDRM AMELRAVPDQ VGFTNRERLA LISHVDKLSR CIGGSERIHQ TVVPLNYARH TLRALTVWLF SLPFVVVKDL KLLTGPVLFL VSWMLFGVFE IGSAIEDPFQ GTLRLSILCD TIRRDVVGDE YVRSTAFFLE EEDAATQGPN GASANKILDS VSSPGEALP
|
| |