Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49763 |
Symbol | |
ID | 7198346 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 201211 |
End bp | 203325 |
Gene Length | 2115 bp |
Protein Length | 594 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184505 |
Protein GI | 219128617 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAGCACGA TTCCCACATA TATTCCATCG GAGAGGAATT TTGAATTGAC TCGATGCCCC AGAAACGCTG TCTTTCTTGG CTTGGTTTGA ATCCCCGGAA AACATTCATG GAACCGCTTG CGTAACGAAA ACCTTCTTGT CATTTGTGCT GGCATTCCTG ACGCTGATCA TCGGCCCCAC AATGCAGACT GGTCTTTCAG CTATAACTCC TTTCCGGTGT GATCATGGAG CGTTGGTGCG ACCATTCCAC TTACGGTTTG GAGGAGGACA ATGGAAATCG ATCCAGTACA CTAGAAGGTG GTGAATCGGA ACGACCCAGC ATATCAGAGG GTGCAGAATC GGAACATTTT CCGTTCTTTC ACGATACCTT GACTTTTGCA AGAATCACGG AGGGGATGGG ACATATATTA CCTTACGCTT CCCAATCTGG ACAGGTAGCA GGAGAAGAAC AAAGCGCGTA TTCTCTATCC CCATACGTCC CGTATCCTCC ACGTAATTAC GTACGGAGAG ATTGCGAGGA AGACGACGTC GGGAGAAGAC CTAGGGTTCG ACGATGCCTG GATTTAAACA AAGGAGTGAA CCTCACTAGT CCCAAGCGCT TCAGTATTTC GCAAGACCCA TTTCTATCAA AAAATCATCC ATCGTCTCTT TCGGACGGCG AAGAGACAAG TCAGCATTCG AGAAAAGCAG CAGGCATTGC TTTGGAAGAG GACGCGGTAG GAGATATCTT TTCGAGCCGC AGCATTCGTG CTCAATACGG GTCTGATCCG ATGCAAATGA AAGTTCGATC CCCCCGATAT CTTCCAGAGC CGAAATCTCC ACTGGTACAC TCACAGCATT CAACTCCGTC ATCACATCGT CATCCTGGAA GTCACCTCTA CCAAAGCCCA GGTTTTTCAG GGAACTTTGC ATACTCTTCC TATCAAATGG CCTCTCCCCA CAGCTATTTC CAGAGACGAT ATCCCATGGT AGCTTCGCAT CCTTTCAGTG ACAACACAGG CACGCCAGGT ATTTTCATGT CCCAATTGCC ATGTCCTGTT TACTCGCCAC CGCAATACCC TCCACAACCA TTTATACCCC AACAATCTGT TCACAGCGAC ACTCCTGTAG ATGCCAATAG GATGGATAAT TCTGGTTGCA CGGCCACGGG ATCATCTCCA AATAGGACTC TTCCTTTCCC AAACCCGAGC TTGAGCCTAG CTATTCCACC TTCCCCTATA CGATTGCGCC AGAAACCGCC CGGTAAATTG AATCCGGTGC GTCGATCCGC CCGTTCAGAA TCCAAAATCC GGGAAGTGCG GACCCGGACG AGCTCTGGCG AATCGACGTC GCAAGCTGCG GGCTTGTGCG CAGCGGAAGT GGCGACGGCA GGTAGTGTGC GGGCGAAGGC AGCTATTGTG ACATGGTACG ATCGACTGGA CGATTTACGT CGGTTCCGGA AAGAGTTTGG CGATTGCAAC GTGCCTCAAA AATATGAACC AAATCGAGCT CTGGGTATTT GGTAAGTCTT TCGCAATAAG GACAGATGCA TGGTTGTTTT CCTTGTGCTT TCAGCTACGG CCTTTAACAT CTCTATTGAT GCTGCGTACG TTTCTGTGTG TTTACAGGGT CAACAAGCAG CGGATGGAAA AGAAGAAGCT CGACAGGGGC GAACGATCAT CCATGACTAC GGAACGACTG CAGGCGCTGC AGAGCGTCGG GTTTCAGTGG GCCAAGCTTA AGGGCGATGT TTCTTGGAAC CAGAAGTACA CAGAATTGCT GGAATACAGA TCCGTGTTCG GTGACTGTAA CGTGCCTACC AAGTACCGCA CCAATCCGGC GCTGGGACGC TGGGTTTCGA CTCAGCGATC GCAATTCAAA GAATTCCAGG CCGGTCTGGT AACGCATATA ACTGATCAGC GAATTTCCCA CCTGGAGAAG ATAGGCTTTC GGTGGAGCAT GATGGAGGAG GAGGAAGAGA ACAACTGCAC AAATGAAAAT TCGCTGAGGG ATGGTAGCGA AGCAGATGCC ATCTTTTCAA GGTCAATGCG AGTGGAGAAA GTAAAGCGTT GGCAATACGA CAAATCCAGA ACAAGTACCC GCCACTCTTC CATCAATCGG GTTACTAGTG TGTGA
|
Protein sequence | MERWCDHSTY GLEEDNGNRS STLEGGESER PSISEGAESE HFPFFHDTLT FARITEGMGH ILPYASQSGQ VAGEEQSAYS LSPYVPYPPR NYVRRDCEED DVGRRPRVRR CLDLNKGVNL TSPKRFSISQ DPFLSKNHPS SLSDGEETSQ HSRKAAGIAL EEDAVGDIFS SRSIRAQYGS DPMQMKVRSP RYLPEPKSPL VHSQHSTPSS HRHPGSHLYQ SPGFSGNFAY SSYQMASPHS YFQRRYPMVA SHPFSDNTGT PGIFMSQLPC PVYSPPQYPP QPFIPQQSVH SDTPVDANRM DNSGCTATGS SPNRTLPFPN PSLSLAIPPS PIRLRQKPPG KLNPVRRSAR SESKIREVRT RTSSGESTSQ AAGLCAAEVA TAGSVRAKAA IVTWYDRLDD LRRFRKEFGD CNVPQKYEPN RALGIWVNKQ RMEKKKLDRG ERSSMTTERL QALQSVGFQW AKLKGDVSWN QKYTELLEYR SVFGDCNVPT KYRTNPALGR WVSTQRSQFK EFQAGLVTHI TDQRISHLEK IGFRWSMMEE EEENNCTNEN SLRDGSEADA IFSRSMRVEK VKRWQYDKSR TSTRHSSINR VTSV
|
| |