Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49130 |
Symbol | |
ID | 7195508 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 2167 |
End bp | 3957 |
Gene Length | 1791 bp |
Protein Length | 472 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183790 |
Protein GI | 219127121 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTTCCGGTAA ATTATGAGTA TGTTTCTTGG AGAAAACGAA GGAAACCCAT GTATGCTACA GAAGGGCGCG ATGTCGACCA ATTCGAATTT TCTCAGCTCC TCTTTTCATG TCCGGTTCCT TCACGCTTTA TCGAACAGAA TAACTTCAAC GTGATTAATC CACTGTTTTG GATCGACTTG GTACCAATAC GAGCGCCTGC CCGACGCGGT CCCATGCTAA TGACCAAAGA TCAGGTAGGC CCGCAAGAAT TTCAGCACCT GGATCGTTTC AGTATCTTGG AACACTACGG AAACGCGCAC ATTCTCCCTG CCATCGAAGA TTCGGGTAGA ATCGTTAACC TTCCTGTTTG TCCGACAACA CCGAAGCCAC AGCATCATCC TTCATACCAA TTGGTGGCGT GCACTTGGAC ATCTGCTTCT TATAATCGAC GGGGGGACAC AACGACTGTG GAAGACTCTG CTGCCCGCCT GGAAGAATGG ATTGTCTTTC ATCGGACTGT CGGATTTGAT CACATTTATA TTTACGACAA TACACAGGTT CCACAGAACT CTTCGGAATC CGTTCTATTC AAGATAGCAT CACAGTTTCC GAGCTTTGTC ACATATCATT CATGGCCGGC AAAATCGTGC AGCAACAATC GACCAAATCA CAAAAATCCT GGCGAGCGTT CCTCCCAATA TGCAGCCGAG GCTTCATGCC GTGAACGTTA CGGTCCGACT GCATCCTGGA TGGCATTTAT TGATACAGAT GAGTATTTGG CACCGATGGG GAACAAAACT TGGTTACCTC TGCTGGATAA AATGGACGCA AAGGACATCA AAGTTCTAAA ACTGAAGAGC AGTCGCGGCC GTCCCCGAGA AAGTTTGATG CAACTCTTGG ATGATCCTAA CGAATGCAAT AGCCAATCAC GGCTGAGCTC TTTTTCAAAG AACGAGTGCT TGATCCCGAG GAAGAATGAA ACTTTTCTAC GGGTATACAA TTGTGACTTC ATCCGGCCTC CACGCCCCGT TCGCTTTGCC CGCGCGATGA AGCAAATTTA CAAGCCCAAC TTTGTTCTTA GCCACTTTGT ACATTACTCT ACGATTACAG CAAGCATGTC ACGATACTAC AAGGATTTTA AGGCTAGAGA CCTATACACA CGTGAGCTCA ATGAAGGGGA CTGGGGTGAT ATTTTTCTTG ACGAGCGAAC GGAGGGAACA CTTATCCACG CAAAGTCGGT GCTTCCACAC GAGACAATGG CGCGAAAGGA CTCATGTCAG ATTGCGTCAA AGCGACCTTG TGTGATTGGT CATGTCTGTC CCAATACGAC GCCTTTTGTA GACGCTATTC ATCAAAAGAA CGTATTCCGA GATGCTGATG GAAATTTTTG CAACTGCTGG TAAGTGTTGT CGAGTCCAGA TGCTTGCCGC TAGGGCAGTG TTTAACTTTT ACCTTCTCGT CGTCGCTAGG ATCAACGAGC ATGTGGAGAA AACTTTGATT CCAAATCTTG AAAAGGCTCT CCGGGAGCAC AAAAGAAATT CGTTCATGGC CGACTAACAC TAAGGCTTAT TGCTGTAGTG GATGCAGCAA CTCAAGGACA CAACGTGATT CCGTTGGAGA ATGTGCCGGC TAGTCTCACG TGCCGAAGCG CGAGATATGT AAAGAGACTA TTGACTTTCC TGTCGGTAAG AAGTTGCATT CATGCAAGCG GACCGAAAGA TACGGACCTC GAATGTGAAC GAAAGGTTTG TCACTGTCGT TTCGAGCGTC TCTCTACTAG ATAGAGAATT GGTAGGAAAA AGCTAGTGTC T
|
Protein sequence | MYATEGRDVD QFEFSQLLFS CPVPSRFIEQ NNFNVINPLF WIDLVPIRAP ARRGPMLMTK DQVGPQEFQH LDRFSILEHY GNAHILPAIE DSGRIVNLPV CPTTPKPQHH PSYQLVACTW TSASYNRRGD TTTVEDSAAR LEEWIVFHRT VGFDHIYIYD NTQVPQNSSE SVLFKIASQF PSFVTYHSWP AKSCSNNRPN HKNPGERSSQ YAAEASCRER YGPTASWMAF IDTDEYLAPM GNKTWLPLLD KMDAKDIKVL KLKSSRGRPR ESLMQLLDDP NECNSQSRLS SFSKNECLIP RKNETFLRVY NCDFIRPPRP VRFARAMKQI YKPNFVLSHF VHYSTITASM SRYYKDFKAR DLYTRELNEG DWGDIFLDER TEGTLIHAKS VLPHETMARK DSCQIASKRP CVIGHVCPNT TPFVDAIHQK NVFRDADGNF CNCWINEHVE KTLIPNLEKA LREHKRNSFM AD
|
| |