Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36377 |
Symbol | |
ID | 7201534 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 308217 |
End bp | 309477 |
Gene Length | 1261 bp |
Protein Length | 413 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180799 |
Protein GI | 219120106 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGATTG CGAGCTTCGT CTATTGTATT TCCACCGCGG CTGCCCTATC TACGATACCA GTAGTGACCA AAAAACCATC CAGATTCCTA CTTCTTGGCG GTGCCGGTCG TATAGGTACG GCCGCTGCGT CACACCTTTT ATTACGTGAC CCATCTTCCC AAATCATTCT CGTTGGACGC TCCAATGATG GAAGCCGAGC CGTTGAGGAA GTTCGGATGG ATCATCCAAA CGCGACCGTG TCGTATGAGC AAGTAGCCGA TATCTGGGAA GGTGAAGGTC CATCGGTAGA CAGGCTAAAG AGTTTGATGC GAGAATCGGA CTGCATCATA CATACCGCAG GTCCCTACCT ACACCGAAAA CCGACCCCAA TGAAACTAGC AATTGAATCG CGATGCCAAG TTTATGTGGA CGTATCTGAT CCACTTCCGT ACTTGGAAAC TGCCTGTCTC ATGAATCACA CGAGCGTCAC TACAACCTGT CTCCTTTCGG CTGGTGCCTT TCCCGGTATG TCCAATGTCA TGGCTATGGA GGCAGCATCG TACTTGGGCG GAGAGAGTGT ACACGATGTC CGATTTCAAT ACTTTACGGC AGGGCTAGGG GGATCGGGTC CACTCAATCT ATACATTACG AATCTTGGTT TCGGAGAACC AATGGTGCAG TACGACGGCG GACAGCTGCG CTTTTTTACG GCCCTTTCGG GCTCGTTACT AGGAAAGGTC AATTTCTTCT TGAACAATGC TTCTCGGTCC ATTGGCACTA GCGGATTCGG TAATGAACAG GCTCGCCAAC GCGTTGGTTC CCAACCCGTG TTCGCCTGGC CTTTTCCCGA AGCCGCAACG GTTGCGACAG AGCTACGTGC CCGTGGTGGT TCTACAGCCG CAATGGGCAC CGCTCCCGGT ATATGGAACA CAGTGTTGGC GATCCTTGTC AAGCTCATCC CTCGACCATG GTGGAGAAAC GAAACATTTT CGAAGTTTCT CGCCGACTTT TCCGAGCCCA TGGTCTGGGC AACGGACAAA ATCCTTCGAG CCAGCGACCC GGCCGGAGTC GGCGAGACAC ACGCCATGCG AGTCGACGTA AGCGGACGCA GAGGTCCCCA TATTTCTATT GTACAAGCGC ACGATTCGTT CCGTCAGTGT GTCGGGCAAT CTTGTGCCGA GTTCGCTTTG GACTGCTTGC GGTACCCCGC CGTGGGAGTA GAGCTACCCG AGCGTCGATA TCGTGACCCC ATTGCTCGTG CTCGCATAAT TGGTAGACTG A
|
Protein sequence | MRIASFVYCI STAAALSTIP VVTKKPSRFL LLGGAGRIGT AAASHLLLRD PSSQIILVGR SNDGSRAVEE VRMDHPNATV SYEQVADIWE GEGPSVDRLK SLMRESDCII HTAGPYLHRK PTPMKLAIES RCQVYVDVSD PLPYLETACL MNHTSVTTTC LLSAGAFPGM SNVMAMEAAS YLGGESVHDV RFQYFTAGLG GSGPLNLYIT NLGFGEPMVQ YDGGQLRFFT ALSGSLLGKV NFFLNNASRS IGTSGFGNEQ ARQRVGSQPV FAWPFPEAAT VATELRARGG STAAMGTAPG IWNTVLAILV KLIPRPWWRN ETFSKFLADF SEPMVWATDK ILRASDPAGV GETHAMRVDV SGRRGPHISI VQAHDSFRQC VGQSCAEFAL DCLRYPAVGV ELPERRYRDP IAH
|
| |