Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44684 |
Symbol | |
ID | 7197920 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 1254122 |
End bp | 1255395 |
Gene Length | 1274 bp |
Protein Length | 358 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178671 |
Protein GI | 219115751 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0864059 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGAAGTCTCA CCTGGCGGAC TTCACCGAAA ATGGTAATCG AGGGAAACGC AACTGAGACT CGGGACTTTC CCATGCAGGG AAACATGCAG ACCGGAAAGG TGGTCCATAC ATACCGGGAC TATTCGCAAG AAGTCGAGAC TAAGTTTGAG GCCAGCAGGC ACATGTCTGA TACCATCACC GCTGACACGG CATCGAAAGG CTGCCGGGCC GAGGAGAACT TCCCAATAAA GCTTCATTAC ATGCTCTTGG AACTTCAACG AGACGGGCTC GATCACATTG TTTCGTGGCA ACCTCACGGA CGGTGCTTTG TTGTACACAA GCAAAAGGAG TTTGTGGAGC ATATTCTACC TTTGTAAGTG ACGAATGTAT CAACGCTAGC AAAGTGGACA CTGAAAGGGA ATTATTTTTA ACGAAAGTAT CGCGTCGGAA TATTGAGCAT CTCACACGTC TTCGTCTTGT TTCTGACAGC TGGTTCAGGC AAAGTAAATT CCCTTCTTTT CAGCGACAAC TGAACCTTTA CGGTTTCAAG CGATTAACAG CGGGTAAGAG CGGTTTCCAT TTTCCTAGCA GCCTCAGTTG TCCTTTGGAT GAAGAACTCA CTTTTGCATC ACCAATCCCA ATCTCCTTCG CAGGGCGTGA CAAAGGCGGA TACTACCATG AACTTTTTCT TCGAGGTAAG CGCTTTTTAG CGCATCGCAT CCAACGAATC AAGATCAAAG GAACAGGGGC TCGCAAACCA AGTTCACCGG AAACGGAACC GAACTTTTAC AGGACCATAT ATCTCCCCCT AGAACCCATT TCGGGAAGGC AAGCGAAGAC AAACTCCGTA TCCACTTTGT CATCCCAGAT GCCAAATATT TTGAATCGGT TCAATTCCGC TCTACCGGAT CCGAACGGTC GAATTTCGTT ACATCAGCAA CTCCTTGCGA CGTATCCGCA TTCGCCACAA TCGTTTCAGC CTTCCCCGGC GTCACTATTA CCGGCTGAGT TACTTATCAT GAAATTGCAG CGAGAGCGAC TTCAGCAAGA TATCATCATG GCCTCTCAGT GTTTTGACGG TCGTCTTGCT GGCCTTGGTC TAGATGGAAG CGGCGCGATT GCGCATCCCA ATAATTCCGC TCTGGCGTTG GCTGCGGCAT TTGTCCGGAG TCAAAGCAAC CCGTTCGCGT CCACGACAAG GTAGTGACAT GAAGGTGCCG ATTGTCGGGG GCTCTACATC ATAAAGTGAG TTCAGTAGCA GCTTCCTTTA CATAATTTTG ACATTAGTTT TTTG
|
Protein sequence | MVIEGNATET RDFPMQGNMQ TGKVVHTYRD YSQEVETKFE ASRHMSDTIT ADTASKGCRA EENFPIKLHY MLLELQRDGL DHIVSWQPHG RCFVVHKQKE FVEHILPFKV DTERELFLTK VSRRNIEHLT RLRLVSDSWF RQSKFPSFQR QLNLYGFKRL TAGKSGFHFP SSLSCPLDEE LTFASPIPIS FAGRDKGGYY HELFLRGARK PSSPETEPNF YRTIYLPLEP ISGRQAKTNS VSTLSSQMPN ILNRFNSALP DPNGRISLHQ QLLATYPHSP QSFQPSPASL LPAELLIMKL QRERLQQDII MASQCFDGRL AGLGLDGSGA IAHPNNSALA LAAAFVRSQS NPFASTTR
|
| |