Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45022 |
Symbol | |
ID | 7199531 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 1029212 |
End bp | 1031163 |
Gene Length | 1952 bp |
Protein Length | 513 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178898 |
Protein GI | 219116206 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000948434 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTTGTTTCC GGACAATCGA AACGATCGTA TCCACTTGGA ACCAACACAC ACACACAAAG TAGGTCCCGT GAGTCCTTCC TTCGAACAAC AAACAAACAA ACAAACAGCT ACGTGAACGC AACTTCTAGA GGCAATCTTG TCTTCGAGAA GCAGAATTTC TCCCGAGAAC AAACCCGGCA CTTTATTCCA TCATGTCGGG TGAAAGCTAT CCTTCGGAAT CTCCGGAAAT TCCGACATCG CGTCCAGCAG ATTCCACCCA CAAAGCGAGA AAATCCGACG GCCGTAAGGC TGATGCGGAC CGGAATCCTG TTCATTCCAA GCAAAGCGAT CCCGCACCGC ACGAAACGGA CATGCTCGCG GACCCCGCAA CGACCCTCTC CCCCTTCCCC ACGGATCAAA CCTCGTCTCC GGCTTGGCAC TACCAGCATC ATCCGCCACC ACCACACGCC TACTACTATC CGCCCTCGCC GTACGGCTAC ACCGGGGTAC CGCCGTACCC GGGAGGACCT CCACCGCCCT ACGGACCATC CTATCCACCC CCCTTCTTTC CAACCGCACC GGGAGCCTAT TACCCACCTC CCCCGGGATA CGCACCTCCC GGCTCCCCGT CACCGTCCCG CAGTGATGGT CACATGAACG AGCACGGGCA TGCGTCTTCG CCGGGACGCG GCGAAGCCGG AGGCTACGGG ATGCCCTACC CCCCACCGCC CTACGATCCC TACCAACCTC CGCCCTACGG CTCGGCCTAC CCGATCGGGG CGCCGCCCCT TAGTCCACGC CATCCCGTCT ATTCCAATGG ACTACCGCCA CACAGTCCGG ACGCCAACGG GGTTTCCTCG AAGCATCCGC ATTCGTCGAC GAATCCCCCG CCCTTGGAAG ACCTCGACAC GGCCTCCGAT TCCGGCAAAA CATCGGCCGT CGAACCCGCG GTGTCAGCAA ATGAACTCCG GAAACTCAAG ACCTACATTC GGCCACCGGC ACCGTCCAAT CCCGAAGTGG TCGCCCGTCG CCAACACAAG AACTCTCAGA GTCGTCGCCG CGCCGCTGTT CTGAGGGACC GGGTTGCCGC CGTCGCCGCC ATGGATGCTA CCAAACGTAC CGAAGAAGAT CAACACATGT TGCACCTACA CGAGACGCGT CGGGAACGGA AAAATAATAG GTCACGAGAA CGCGCTCTCG AACGCAAGGA AGAGATGGAT CGTATCCTTG GCAAGAAAAT ACGACAACGT ACACGACTCG AAGTGCAGTT TTTGAATAAC ACAATGTCCA AGAAGCAGCG AAAGAATGAA GGCGATCGTT TACGTCGGGC TCGACTCAAG GCGCTCGGAT TGGACGCACG TAACGGGGCC GCCAAAAAGC CCGGTGTTCC GGCGCGTGGA CCACTCCCTC CGCATCTCTT GGATCCACAA CGGCAGCATT TACCCTTGGA TCCGCAACAA CAGCATCATC ATCACAACCG ACCGCCACCG TCGCCGCGTT TCATGCCTCC CGCACTAGCG TATCACCAAC ACGCTTCACC GCAGCCGCAA CCGTACTACA CACACCGGTT CACCATTCCC GCACAGCACC CAAGTGTTGG AGCTCCCTGG CACCCGACCG CACCCACAAC GTACGCGCAC CAGGAATTTG CCAAACACGC CGAAGGCAAC AGCGTTGCCA AATCCGAGGC GGTTGTGGAC GCCAATACCG TAACGCCAGC GTCGCCAAAC GAAGTCGACG GTGTTTCCGT TTGAATAGAC GGTGAGAAGC CGACGACGAT CTGCATAGTG AGGAATGCAA CTACGGCCGT AGGAAGGAAT CAGGAATGGT ATATTTCGTA GTTTTATGTA AGAAAAATGG CTCGCGGTGG AGCCTGGGCA ATTGCACATG TGAAAGTCTA GTCATGGACC TGTGGGAGAG CTGTGTTGAT GGGGTGGGAA AGGAGTCTCG GAAAGAGAAG ATGACCAAGG CT
|
Protein sequence | MSGESYPSES PEIPTSRPAD STHKARKSDG RKADADRNPV HSKQSDPAPH ETDMLADPAT TLSPFPTDQT SSPAWHYQHH PPPPHAYYYP PSPYGYTGVP PYPGGPPPPY GPSYPPPFFP TAPGAYYPPP PGYAPPGSPS PSRSDGHMNE HGHASSPGRG EAGGYGMPYP PPPYDPYQPP PYGSAYPIGA PPLSPRHPVY SNGLPPHSPD ANGVSSKHPH SSTNPPPLED LDTASDSGKT SAVEPAVSAN ELRKLKTYIR PPAPSNPEVV ARRQHKNSQS RRRAAVLRDR VAAVAAMDAT KRTEEDQHML HLHETRRERK NNRSRERALE RKEEMDRILG KKIRQRTRLE VQFLNNTMSK KQRKNEGDRL RRARLKALGL DARNGAAKKP GVPARGPLPP HLLDPQRQHL PLDPQQQHHH HNRPPPSPRF MPPALAYHQH ASPQPQPYYT HRFTIPAQHP SVGAPWHPTA PTTYAHQEFA KHAEGNSVAK SEAVVDANTV TPASPNEVDG VSV
|
| |