Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42437 |
Symbol | |
ID | 7196642 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 49681 |
End bp | 51216 |
Gene Length | 1536 bp |
Protein Length | 428 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177006 |
Protein GI | 219110509 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.138953 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGGTC TTGTTCCCAA ACCTTCCGCC AAAGATAGAC ATTGCAAAGG GTTCCAGTGT GCCGCTCTTC TTCTTTACAC TTTGCGCACT TGTAGGATCG GCCAATTTGT ACCTACGCGA AACATGGATG ACACTACGAT GCATCGAGCG GCCAGCCAGG ATTCGACAGA CATACGCTCT CCGCGGTTAA AGGAAACGTT TCCCTCAGTC GCCAACGAGG GAAGCCCTTC ACTAGCCGAA CATGCATCAA CCACTAAGCC GCTGACATTT CCTAAACCAA CCCATAGTCG CGCGGATAAT TGGATGATTG TCTCGTACCG ACGGACACGA TTCAGCAAGT TTTGCAAGGT TGATGTGCAT CCCGAAGACT ATTCCAAGCG CAATAATTTG CCCAGGTGTC GCCCAAGTCT CGTTGACCCC TTACCAAGTA CTCCGCTTTC GGATTGCGAT ACGTGTGATG CTGACAGCAA TTTGCATCGA CAGAAGGACT CTTTACCGGA CCGCGTCGAT GTGATCGTAG AGGTAAGAAG CCTCTCGCCC ATGCGTCGTG TTGGACTTTG TTCAAGAAGT CCATTCTCAT GCCAGATATA AATTCTTTGG GTTTTAATTC ATCCTAGGCG ACTGCACTTT GTGATCAAGA TTTGCATATA CGGCGCAATG CTTCCGAAAT ACATAAATGG TGGCCCTGCG CGTTCGGTAG CAGCTTTGTC GGACGCGCCC AGGCCTGCGG CCCCGAAGCG AAAGAAGCCG GGATCAAACC AGGATCGCGC GTGGCTGTAA TAGCCAAAAG TGGACCTATT GCCCGCTACG TTCCCGCTCG CGCACAGGAT TTAGTGACTG TACCGAAAGA ATTGGATGCT GCCGACATTG CCTGCCTAAT TGCTACATAC TTACCAGCCT TCCAAGCTTT GCATCACGGA AGGATCCGCC CCTACCGGTA TTCTCGAACA TGTTTCAAAG GAAGGAGGAT TTTGGTCACT GGTGGAGCTT CACCGGAAGG ACTGGCAGTC GTTCGATTGG CTCAATTGGC GGGAGCCAAG GACATTTTTG TAACGGCCCC GAGAGCGCAC TTTGATGTTA TCAAAGCGCA GCGTGCAATA CCGGTCGACG ACAATGCAGA GGCATGGTTG GATCAGCTCG AAGGGCGTAT CGACATTGCC ATCGATTTGA ATTTTCCAAG AAATTTTGTC TTTGTACGTC AGTCTTTGGC ACGGAAGGGG AGATTGGTAT GTCGCCCAAT ACATCATTGT AGCAGCGCTC CTGAGCAACA CTGCATGACA ATGACTAAAA ATCTATTTGG TCGCTTCCGT TTGTGTATGA TGAAACGTGC AACAATCTTT GATTTTACGG AAAACCTGGA ATTGCATCGG AATGAAACAG GAACAGGACT TTGGGTTTCT CCTACGTATG CTAGCGCTTC GAAAGATTCG CCCACACATT GATTGCTTCA TCCGACTGGG TGATGTCCCT GAAACCCTCC TAGACCTACG TGCCAAGCTA TCAACTGGCA CCATCATTTG TGAACCTTGG AAATAG
|
Protein sequence | MMGLVPKPSA KDRHCKGFQC AALLLYTLRT CRIGQFVPTR NMDDTTMHRA ASQDSTDIRS PRLKETFPSV ANEGSPSLAE HASTTKPLTF PKPTHSRADN WMIVSYRRTR FSKFCKVDVH PEDYSKRNNL PRCRPSLVDP LPSTPLSDCD TCDADSNLHR QKDSLPDRVD VIVEATALCD QDLHIRRNAS EIHKWWPCAF GSSFVGRAQA CGPEAKEAGI KPGSRVAVIA KSGPIARYVP ARAQDLVTVP KELDAADIAC LIATYLPAFQ ALHHGRIRPY RYSRTCFKGR RILVTGGASP EGLAVVRLAQ LAGAKDIFVT APRAHFDVIK AQRAIPVDDN AEAWLDQLEG RIDIAIDLNF PRNFVFVRQS LARKGRLEQD FGFLLRMLAL RKIRPHIDCF IRLGDVPETL LDLRAKLSTG TIICEPWK
|
| |