Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44959 |
Symbol | |
ID | 7199486 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 796899 |
End bp | 798629 |
Gene Length | 1731 bp |
Protein Length | 500 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178845 |
Protein GI | 219116100 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00935633 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGTAGTCGAC CACAACCGAG AATCAGCACA ATCCTCCATT CCAAAGAAGC AAACGGACCG ACACAACTCG AGTCGTTCCG TCCCATCACT GCGGTCGCCT ACCTCTAGTA ATTGATTGAC TATGAGGACA ATTCCTTTTC AGTTCTGGAC GCGAATGCTG CTGGCGACGG CGTTGTGCCA GCATCCCTCC GTGTCGTCGC TGGTGCACGC GGCAGCGGAC GACACTGGCA CGGTGGGTCT ATCAGCGGGT AAACTGCGAT CCAACGCGGA AGAAGCCATG GCGGTGGGGG ACTACACCAC CGCTGTGCAG TATTTGCAGG AGGCGATCAC GTTGGAACCG GAAAGTGCGG TGAATCACTA CAAACTGTAC CGGATTCGAC ACCGAAAACG TCACTACCTC GAAGCCCTCA GGGATATTTC CCAAGCCGTT GAGTTGGAAT CGTCGGCTTC TTATCGCAAG CTCAAGGCCA AACTCTTGGT AACGTTGGGA CAATGTGATC GGGCCGTGGC GGAACTGGAT TTGTTGGCAC CCAACGATCA GGACAATGCT CAGTATGAAA CAGCCAAGAT GTGCCACGAA ACAATACAAC TGGCGGAGTA CCATTTTCTC AATCAAGAGT ACGAGCTTGC TGCAGAATAT TTTCAGCAGG CCATGTCGTT TGTTGAGATT GCATCAGATC TTGTTTGGCC CAAAGCCAAG TCCCTGTTCG AAACGGGGGA CTACTACGGC GTTATCTCCG ATACCGGTAT GTTGTTGAAA CAGCACCCGC ATCACGTCGA AGCGTACTGT TTGAGAGGGT CTGCATATCA TCGCCTGGGC GAACACGATC AAGCGGTGCT ACATTTTCGG GAAGGACTCA AGTTGGATCC CGAGCAGGCG GACTGTAAAA AGGGACACAA GAGTGTCAAA GCGCTCGAAA AGAAGAAAGC GAAGGGCGAC GAAGCCTACG CTGCCGGTGA TTTTGAAAGC GCATCGGGGC ACTACGAAAG GGCTATGATG CTGGATCCGA CTCACCATGC CTTCAATCGT CCCGTCCAAC TCCAACTCGT ACAAACATAT TCCAAACTAG GCCAACACAA AAAGGCCATG GACACAGCAC AGAAGTATGT GGAAGAGCTA GAGTCACTAG AGGGACTCTG GGCTCTGGCC AACGCCCAAC AAGCTGCAGA CAGCTACGAA GATGCCGTGC GTACATTTCA GAGGGCAGTC GAGGTTGCCC CAGATGGTAG CGAGCAGGAA CGGGAAGCGA ATCAAAAATT GAAGAACGCT CAAGTTGCGT TGAAGCAAAG TAAAGAGAAG AACTACTATA AAATATTGGG TGTGTCCCGA TCAGCCACAG CAAAGGAAAT TAAATCAGCC TATCGCAAAC TCGCACTCAA GTACCACCCG GATAAAGTTT CGGATGAAGA AAAGGAAGGT GCCGATTCCA AGTTTGCCGA CATCGGCGAG GCCTACGAAG TCTTGTCGGA TCAAGAATTA CGCACCAAGT ATGATCGGGG CGAGCAGGTT TTTGAAAATC AAGGAGGCGG TCCGCGGCAT CAAAATCCGT TTCAGTTCTA TCAACAGCAG TTCCAACAAG GTGGCGGTGG TGGTGGACCA CGAGTGCACT ACCGCTTCAA CTAGATGGCC ATTTCTCGCA AAAATAGCGA GCGATGACAG TAGCTAATCC TCAAATCGAA CCACTATAGC TTAATCCAGA GCTCCTCTTG TAGGCGCTTC TTGTTAGACA G
|
Protein sequence | MRTIPFQFWT RMLLATALCQ HPSVSSLVHA AADDTGTVGL SAGKLRSNAE EAMAVGDYTT AVQYLQEAIT LEPESAVNHY KLYRIRHRKR HYLEALRDIS QAVELESSAS YRKLKAKLLV TLGQCDRAVA ELDLLAPNDQ DNAQYETAKM CHETIQLAEY HFLNQEYELA AEYFQQAMSF VEIASDLVWP KAKSLFETGD YYGVISDTGM LLKQHPHHVE AYCLRGSAYH RLGEHDQAVL HFREGLKLDP EQADCKKGHK SVKALEKKKA KGDEAYAAGD FESASGHYER AMMLDPTHHA FNRPVQLQLV QTYSKLGQHK KAMDTAQKYV EELESLEGLW ALANAQQAAD SYEDAVRTFQ RAVEVAPDGS EQEREANQKL KNAQVALKQS KEKNYYKILG VSRSATAKEI KSAYRKLALK YHPDKVSDEE KEGADSKFAD IGEAYEVLSD QELRTKYDRG EQVFENQGGG PRHQNPFQFY QQQFQQGGGG GGPRVHYRFN
|
| |