Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39891 |
Symbol | |
ID | 7195694 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 288361 |
End bp | 289581 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183963 |
Protein GI | 219127481 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.013374 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCCGAA AGCACCATGA CAGAAGACAA TTTTCATTCA CCAAAGCGTT TTGTTTGACG CTTGCGGCAG TCGCCACTTT TCTTTTGCAG CAGCATCAAA TATCGAAGCT AGATACAGTT CGAGCGCCCA ACATTTCGTC GCTCCACCTA AAAGATGACC TCATTCTTCC AACACACCCG TACAGAGCGG ACCCCCTGCG AACAGGCATT CTTCCATCTG TATCAGTTAA ATCAGAGGTA CCTGCCACTC GGGTGAAGAA GGAACGAGAT GTACTATCCA TGCGCAACGG CAGCACGAAC AACGCTGGTC GTCTTCTACC AAGTTTCCAA GCCGGGGGTG TGATGGTATT CTTGCACTAT CCCAAGACGG GAGGCACCAC TTTGAACGGT TTAAAGGATC TTCCCAAAGT CCAATGGCTC CGTATCAATG ACTTTACATC GTGGAACCAA TATTGGCCAC TCATCCAAGC GCACTTGTCG CTGCCAGCCG AGAATCGCAG CACTTTATTC CTTGAATTTC ACAATTACTG TCCGAAACTG GATAATTTCT TCCCGATGAT GAACGAACTG AGGCAAGCAG CTGACGAAAA AGGAGTGTCG ACCTTCGTCT TTACACTGAT CCGCGATCCC ATCGACTTTG CGCTCTCATT TTTTCACTTC TTCTACACGC AACCGTGCTC CGTTTTCAAA CGATGCACTC CGCAAGAAGA GCGGTCCCAC GTCTGGATCA GCGCCACGGA AAAACATTTG CGCAAACTCT CACCAACAAA CTATCAGTGC CTCCTTTTAG CTTACGATAT GCGCAACTTG TGGCATTTAA AGAACGAATC GGCCACCTCG AATCGAACCT GGAACGGCCA AGTGGATCGT CGCCGCGTAG TGAACACTGA AGAATGCTTC CGTACCTTCC CTCTCTTGTC AGTCTTCGAC TGGATTGGAA CAACAGATCT GCTGAGCGAA GAGACGCTGC CCTTGCTGAC GCATATGCTA ACGGGAAACG CAACTCTCGG AACCCTTTTG CCCAAAATCA ACGCCGCTTC CCCGTCTAAA CTCTCGAGAA CGTCTCTAAC CAGTGAAACA TTACAGTACC TTCGAAATAT CAACAGCTTG GATCTTAGGC TCTACGAATA CGTCCAGAAA AATTTTCAAT TCGAATCACA GTGGGATAAC CTTCCAGCTA GTATTCAGAG GATTCGCGGT GATAGTCTTC AGGTCTCTTG A
|
Protein sequence | MLRKHHDRRQ FSFTKAFCLT LAAVATFLLQ QHQISKLDTV RAPNISSLHL KDDLILPTHP YRADPLRTGI LPSVSVKSEV PATRVKKERD VLSMRNGSTN NAGRLLPSFQ AGGVMVFLHY PKTGGTTLNG LKDLPKVQWL RINDFTSWNQ YWPLIQAHLS LPAENRSTLF LEFHNYCPKL DNFFPMMNEL RQAADEKGVS TFVFTLIRDP IDFALSFFHF FYTQPCSVFK RCTPQEERSH VWISATEKHL RKLSPTNYQC LLLAYDMRNL WHLKNESATS NRTWNGQVDR RRVVNTEECF RTFPLLSVFD WIGTTDLLSE ETLPLLTHML TGNATLGTLL PKINAASPSK LSRTSLTSET LQYLRNINSL DLRLYEYVQK NFQFESQWDN LPASIQRIRG DSLQVS
|
| |