Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42820 |
Symbol | |
ID | 7196481 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1224325 |
End bp | 1225859 |
Gene Length | 1535 bp |
Protein Length | 419 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177244 |
Protein GI | 219110985 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.749474 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCTAAGGTGT GTGACGATTA GCACAATATG GAGCACGATA CGGGAGGTTC TTTTCTTTCC CCAGCGGAGC GAACGATGCT TCTAAAAGAA ATCGAAGTTT TCGAGAAGCT GGAAGCATCA CTCAAATCAA CGAAACGCGG GAATCTTCGC AAGCCACGTT CTGTACACGC AGGGTCAAAT AAGCCCAGGA CGAGAGATTC GGGTTTTCGA GAAGCTCCGC GCAGCACTGG TTGTCTCGTT TGTGGTATCG ATAGGGATCA CACGAATATT CTTTTGTGTG AAGGCTGCAA TGGTGAATAT CATACATACT GTCTTTTGCC GCCCTTGAAA TCAATTCCGC AAGATGACTG GTTTTGTGGT GAGCTCGACG TACTAAATAA TGTTCATTCA GCTCATCGTT TTGTCCATCA AGAAAGCTCA CTCGAATCTC ATTGATTTCT ACAGACAACT GTCTTCCAGA TGACGGAGAT GGGCTCGAGC AGCTCGTAAG CTCATTGCCG CCGAATTTTA CTACCAGATT CGCTGAGATA TGCTGGGCTC AAGGAGGCAA TGGCTACGGC TGGTGGCCTT GCTGCATCTA CGACCCAAGA TTAACTGTTG GAGATGCTCG AGTCCTCGCT CGCAAAAACT TGGGCAAGAA GCACTTGGTA TATTTCTTTC AATGCGAGGA GGCCCCTTTT GCGGTGCTTC CTACGACAAA AATTCAAGGC TGGACGGAAG GCTTGGTTGA TAGCTTTTAT ATGGGAAAGG CGGCGAAGGC TGCAGGGAAA TGCCGTTATA TCCAATTTCG AAAGGCTTTC CAAGCCGCAA TAATTGAAGA GAGCAAACCT CTGACACAAA GATTAGAATG GAACCAGCTT GGATTTCCGC CCCAAGCTTC GTTAGCAGCA AGACCGGCAT CGCCGCAGAA GACTCCGGTG AACGCGAAAA AGCGTCCAAC GGACTGTCTC ATTGAAGGTA GCAGAACTAA GCGTGCAAAA AGTAGTGTCA GTCTCGACTT GGCACGAAAT GATATAATGG AGAAGCGGGA GCCTGCACGT TACGTCGAAA CTTCGGAAAG CGGTACTGAA ATGTTTTGTA AAGTTAAACG AAAGTTATTG GGGGCCGTAG ATAAAGTGGA GATTGGATTT GTGTTGTTAC CGTGTCGATT CACTTCGACC TTCGCGGACG TGCGGAGAGC CATTTCTATT GATCTTGACG AAGAATTGCC TTCAAATTGG GATTGGAGAT TTTACGTGCC ACCCCTTGGA CCGCTGAGCA TAAAACAAGA ATCCAGATTT GGTGCAATGC TATCTTTCTT GCGGAAAGCA GCACCCCATT CGGATATCGG AGAAGGGAGC CTACAAAAAC CCGCACAAGT CGTGCTGGTC GATGCGCCTC AAAACTAATT TGCAACAAAC ATCGATTGTC GGCTTGAAGT AAATTGCTGA ACCGAAAGCT CCCTTACTCA CGCTAATTTT CGAGTGGAGA AACACTAAAG GCGACCGTCA TGGAGTGGAC TGTTCGTAAG GTTAAAAAGT GCCATTTGAC TTTTA
|
Protein sequence | MEHDTGGSFL SPAERTMLLK EIEVFEKLEA SLKSTKRGNL RKPRSVHAGS NKPRTRDSGF REAPRSTGCL VCGIDRDHTN ILLCEGCNGE YHTYCLLPPL KSIPQDDWFC DDGDGLEQLV SSLPPNFTTR FAEICWAQGG NGYGWWPCCI YDPRLTVGDA RVLARKNLGK KHLVYFFQCE EAPFAVLPTT KIQGWTEGLV DSFYMGKAAK AAGKCRYIQF RKAFQAAIIE ESKPLTQRLE WNQLGFPPQA SLAARPASPQ KTPVNAKKRP TDCLIEGSRT KRAKSSVSLD LARNDIMEKR EPARYVETSE SGTEMFCKVK RKLLGAVDKV EIGFVLLPCR FTSTFADVRR AISIDLDEEL PSNWDWRFYV PPLGPLSIKQ ESRFGAMLSF LRKAAPHSDI GEGSLQKPAQ VVLVDAPQN
|
| |