Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43435 |
Symbol | |
ID | 7197154 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 451921 |
End bp | 453660 |
Gene Length | 1740 bp |
Protein Length | 456 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177620 |
Protein GI | 219111737 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGCTA CAGGCGGGGT CTTTGTTTAC AGTCAAAGTC AGCAAATCCT ACTTGTTCAA ATATTTAAAT ATTGCAGGGC TTCTGACCTC ACAAACGCGA TCGGCGAGCG ACACTCCTGA TGCCATCAAA ATTTATCCTA GAACCAAGTA CGGCTCCCGT CGTTGATTAG CCCAAGAGAT CAAAAACAAA GTACGTCTAG CTTTGTTTGG TTTGGCAATA CAGCACAAAT GTTGACATCA TATCGAAGAA GGATATTTCT TTTCGTCGTA TTTTGCTTCG CGATCCAAGC TCATGGATCG TTGGAGCATA ACTCCGCCAA TGATTTTCTT GCCCAAGGCA ACGACGCTCT CTGACAAGGA GAAAACAACA AGGCAATATC CATATATCGA AACGGCGTTT CGTTTCTCAG TGGGGATGCC GCATCGCTTA CGACGGTAGT CAGCTTGTAT ACGAATCTTG GAACCTCGCT CTCCGCGGAA GGCTTGGACG ATGAGGCTGC CGAAAACTAC GAGCAGGCTC TCGCCCACTA TAGGGAGAAA ATTGACAGGC TTGAATATGT AGAAGCGCAA CAAGAAGTAA CCACAATTGC TGCACAAGCT GCTTTTTTTC TCGGCATGGT CTATCAGGAC TTGCATCGAC CCAACGATGC CGCCGAGGCG TACAGTCTAG CACATTCATT GGATCCTCGG CACTGGTCGG CGGCGGCTAA TCTCGGGGCG TTGCTCCACG ATAACATGGC CAATCACGCA AAAGCGTTGC AAGCCTACAA TCTGGCGTAC GATATTCTAA CTAACCGGGA GGAAGAACCA ACTGACCCGC CCCTGGAGCC TCGTTTCATT CTCAGTCAAC TCCAGTACCG CATCGGTCTC TGCATTACGC ACGATAGTAC TCAGAAGTGC GCAAACGTGG ACAACCCGGG AACTCCGATC GATTGCCAAG AGTTTGCCGC ACACGCGTTT GCGTTGGCAG TCGAGTACGA CGAGGACAAC GAATCGGCCA AGCATATGCT GGCGACCGTG ACCGCCGACG CAACCATGAA ACGGGCCTCC AATACCTATA TTAAATCATT GTTTGACGAT TACGCGTCCA ACTTTGAGCA TTCTCTGGTG CAGGACTTGG GCTACACGGG ATACGAGCGG TTGCGACGAG GGTTTGATCA AGCCTTAGAA CAAGATGGCA AATCGGGACT AGTAATGTTC GCTACCGTGG TAGATGCGGG TTGTGGGACG GGCTTGGTCG GTGAGCAGTT TCGGAACGTA AGTCAACACT TGACCGGGGT CGATTTAAGC CAAGCCATTC TAGACGAAGC AGTAAAGGCG CGTCCCAACC TTTACGACAA AGTGATTGTC GGCGACGTTA CGACCGTATT CCGCGAACGC CAGCCAATTT CTCTGATTAT TGCCGCCGAC TCGTACACTT ATTTTGGAGA TTTGGAGCCC CTGTTTGAGG CTATGCAAGT AGGTTTGGAA ACCGGGGGTT ACGCGGCTTT CACATTGGAA GATGTTGACG AAGCTACGGA AGCGGCTCTG GAAGCGACGA AACCTGATTG GCGATGGCAA TTGACGGCTT CGGGTCGTTT TGCTCATCGC AAGGGATATG TACAACTTAC TGCCAAAAAA TACGGCCTGA AACTGATACA CTACGAGCAA TTGGTGAACT TTCGGTACGA GCGCGGTGTC GGTGTCCGTG GGCACCTTTT CGTACTGCGT CAAAGTGCGG ATCATAGAGA AGAGTTATAG
|
Protein sequence | MPATGGVFVY SQSQQILLPK RSKTNLYTNL GTSLSAEGLD DEAAENYEQA LAHYREKIDR LEYVEAQQEV TTIAAQAAFF LGMVYQDLHR PNDAAEAYSL AHSLDPRHWS AAANLGALLH DNMANHAKAL QAYNLAYDIL TNREEEPTDP PLEPRFILSQ LQYRIGLCIT HDSTQKCANV DNPGTPIDCQ EFAAHAFALA VEYDEDNESA KHMLATVTAD ATMKRASNTY IKSLFDDYAS NFEHSLVQDL GYTGYERLRR GFDQALEQDG KSGLVMFATV VDAGCGTGLV GEQFRNVSQH LTGVDLSQAI LDEAVKARPN LYDKVIVGDV TTVFRERQPI SLIIAADSYT YFGDLEPLFE AMQVGLETGG YAAFTLEDVD EATEAALEAT KPDWRWQLTA SGRFAHRKGY VQLTAKKYGL KLIHYEQLVN FRYERGVGVR GHLFVLRQSA DHREEL
|
| |