Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41016 |
Symbol | |
ID | 7198930 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 143131 |
End bp | 144533 |
Gene Length | 1403 bp |
Protein Length | 460 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184975 |
Protein GI | 219129606 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCCCCG TCGCCGAACT CGGTCGCCAG TACCAATCCG GAGACCGCGT CTGGCTGCGG GGACGACTAC AGTCCATCCG GGGCAAGGGA AAATCGTGGT TTCTCGTACT GCGACAAAAC TCATTCGATA CCGTACAGGC ATGTTACTTC AAAAATGTCG ACGATGCGGA AGCCTCGCAA AAAATGATAC GTTACTTGAA AACGCTCACG GCCGAAAGTG TCGTTGATCT CGAAGGGACC TTGGTCGACG CGGACGTCAA ATCGTGTTCC GTCAAAAACG TCGAACTCAA CATTCACCGC ATCCACACCG TTTCCAAAGC CGACGCCATC TTGCCATTTG AGGTAGAGGA TGCCGCCCGT AGTGAGCAAG AAGTCGAGGC CTCGCAGAAC ACCGAACGTC CCTTTCCCCG TTTGGGGCAG GAACTCCGTC TCGATCACCG TTGGATGGAT TTGCGCGCGC CGGCCAACAA CGCCATTATG CGCATACAGT CCGCCGTGTG TCAACTTTTC GTGAAAGTCT CTACAGTCAG GGCTTTTGCG AAATACACAC ACCCAAGCTA ATTGCCGGCG AAAGTGAAAG CGGCGCCGGC GTCTTTACCA CGGACTATTT CGGAACCACG GCCTGTTTGG CCCAGTCACC ACAGCTCTAC AAACAAATGG CCATTGCGTC CGATCTACCA CGCGTCTTTG AAATTGGACC CGTCTTTCGC GCCGAAAATT CCAATACCCG CCGTCATCTC TGCGAGTTTA CCGGACTCGA TCTGGAAATG GCCATTGACG ACCACTACTT GGAAACCTTG GAGGTTGTTC ACGAACTCTT TAAACATATT TTTACCGGCC TCGAATCGCG TTGGGCGAAG GAATTGAACA TTATTCGGGA ACAGTACGAT TCCGAACCCG TCGCTTTTAC GCCAGATCCG TGCGTGTTAC ACTGGCCCGA AGCCCTGGAA ATCCTTCAAA ACGAAGGATT CGATATTGCT GACGGTATGC AGGATATGAA CGGTGCCATG GAACTCGCGT TAGGTAGGGT GGTCAAGGAA AAGTACGGCA CTGACTTTTT CATGCTGGAT AAGTACCCGT CCTCCATTCG GCCTTTCTAT ACCATGCCCG ACCCTGAAGA TTCCAGATAC TCGAATTCGT ACGATATTTT TATTCGGGGA CAAGAAATAT GCTCCGGAGC CCAGCGGTGT CACGATCCGG ATCTGGTCGA GAAAATTTTG CAAGAGAAAG GCATTGAAGT CGGTGACGGT CTCAAATCCT ACATTGAGTC CTTTCGTCAC GGGGTCAGTC CCCACGCGGG TGCTGGGATC GGTCTGGAGC GCGTCGTCTT TTTGTACCTC GGCCTCGACA ATGTTCGTAA AGCCTCCATG TTTCCGCGCG ATCCCAACCG ATGCACACCC TAA
|
Protein sequence | MVPVAELGRQ YQSGDRVWLR GRLQSIRGKG KSWFLVLRQN SFDTVQACYF KNVDDAEASQ KMIRYLKTLT AESVVDLEGT LVDADVKSCS VKNVELNIHR IHTVSKADAI LPFEVEDAAR SEQEVEASQN TERPFPRLGQ ELRLDHRWMD LRAPANNAIM RIQSAVCQLF GFCEIHTPKL IAGESESGAG VFTTDYFGTT ACLAQSPQLY KQMAIASDLP RVFEIGPVFR AENSNTRRHL CEFTGLDLEM AIDDHYLETL EVVHELFKHI FTGLESRWAK ELNIIREQYD SEPVAFTPDP CVLHWPEALE ILQNEGFDIA DGMQDMNGAM ELALGRVVKE KYGTDFFMLD KYPSSIRPFY TMPDPEDSRY SNSYDIFIRG QEICSGAQRC HDPDLVEKIL QEKGIEVGDG LKSYIESFRH GVSPHAGAGI GLERVVFLYL GLDNVRKASM FPRDPNRCTP
|
| |