Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43079 |
Symbol | |
ID | 7196715 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 1974364 |
End bp | 1975717 |
Gene Length | 1354 bp |
Protein Length | 376 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176885 |
Protein GI | 219110267 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGGTG AACTATTCGC CCTCTTTGAT TGGCTGTGTT AGTCCCTTTC TTCCGGGATC TGAAATCGGA CCTTCGGCTG GAACAAAATC CTTTTGTTGT CCATCGCGTG AACATTCTCT GTCAACTCGT GTTCGCTGCC ATCCTACAGA AGCGCGTGTT GCTTGCCAAT GTAATGGCAA GGTTTCGGAC ACAGAAGCGA GATTTTCGTC AGATGGGCGC GCGGGGTGAG AATGGCGAGT ACTCAGTACG CGGAAGGTAT TCACAGTCAG GGCCGATGGG TTTTTCGCTG GGCCTTACGA TTATCCTTGC CTGGATAGCA CTGGGTTGGC CTTCGGTTGC AGCCAAGTCT GCTTCCAACA TCTCTCCGAT ACGAAACTTT GAGCCTTCCT TGGCAGCATC ATCAGCTCAA GGAACCACAA TCTCTGTTTC GTATCCACGT GATGGTTCCA GACAACAAAC AGGCATAGTG GTTGTCTTCC GGTCTCCGTA CAAAGGGGGA TCCGTGCGCA AAAAATCCGC GAAAACCGAT TGGACGTTGA TCGACGGCAT TCGAGTACGA CCGACATCAT CGGATACGGC AACCAATTCC GAGTCTTCCC ATCGATGGAC TCTCCTGTCC GACGCTCTCT GTTGCATGAC GGGCCTCGCT TCGGACGTGG ATTATCTGTC CCGCTGCATC CAAAAACAGG TCGACACGCA TCGGGTTGTC TACGAAGGAA CACGAGTATT TTCGGCTCTA CAATTGGTAC GTGCACTGTG TGAGTTGTTA CAGGAGGCCA CGAAAGGGAG CAGCGGCGGC CGTCCCTACG GTGTACAAGC CTTGATTGTG GGTACGTCTC CAACGCGTGA CGCGCTGCAA ATGTATACGG TGGATCCGAG TGGAGGCTTT CGGCACTGGG GAACCGGAAC CGCCATTGGT CGCGGTGCGG CGTTGGTTCG TAAGCACGTG TACCGACAAA ACATCACAAA TCAAACTCCT CCGACCTGTG CGAAAGAAGC TTTGGAAGTC GCCCTCCGAG CTTCGTTTTC CGCGAACAAG GAGCTTTTCG ATGCCAACGC CGATGACCCG TACCAAGCAT TGTTGGTCTG GACTGACGAG CAGGGTCGAT TTCGTGTGGG ATCGATTGAC GAAAGCGTAA TTGACGAATT TCGTGAACGT ATTTCGATAG ACAACTCGGG ATCGAGTGCA TAAAAACGGA ACCCGGTCGC GGAGACTGTT TCCTTAGTGT CGAAGCTACT TGGGGACTAC AGGTTCCGAA ACGCTAGCCG TTATGGCTGG TGTAGACCCC CGTCGCAATC CTTATTTTAT GTGTAGAGAG CAGTGTTTAA ATACATCGAC AACCAATTTT CGTT
|
Protein sequence | MNVPFFRDLK SDLRLEQNPF VVHRVNILCQ LVFAAILQKR VLLANVMARF RTQKRDFRQM GARGENGEYS VRGRYSQSGP MGFSLGLTII LAWIALGWPS VAAKSASNIS PIRNFEPSLA ASSAQGTTIS VSYPRDGSRQ QTGIVVVFRS PYKGGSVRKK SAKTDWTLID GIRVRPTSSD TATNSESSHR WTLLSDALCC MTGLASDVDY LSRCIQKQVD THRVVYEGTR VFSALQLEAT KGSSGGRPYG VQALIVGTSP TRDALQMYTV DPSGGFRHWG TGTAIGRGAA LVRKHVYRQN ITNQTPPTCA KEALEVALRA SFSANKELFD ANADDPYQAL LVWTDEQGRF RVGSIDESVI DEFRERISID NSGSSA
|
| |