Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21430 |
Symbol | |
ID | 7202186 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 780220 |
End bp | 782120 |
Gene Length | 1901 bp |
Protein Length | 541 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181442 |
Protein GI | 219122207 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.368988 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TAGATTCTTC TTCCACTTTT GTCTAGATTT CGTACAGTTT CGATCCTTTG TCTCCATTTT CTGCTTCTCT CTCTCTCTCT CTTCAAGAAA CACAACCACA CGGCCTCATC ATGTCGATGG TCTACGGTAC GTTGCGTGTG TGCGATGGGA TGCGATGCAA TGCAGTATTC AACAACGACA AGCACACGAA TCGAGACAGT GTCACACACA CAAACAAACA CAAACCCTAC ACTCGCGCAA ATACTCAACT CACCACCGCT GTGTTCTTCC ACTTCTTCTG TTTCCGACTA GATGAATACG GCCGGCCGTT CATTATTCTA AAAGAGCAAC AGGCCAAGGC CCGTGTGAAA GGCACGGAAG CGACCAAGTC GAACATTCTC GCTGCCCGTA GCATCTCCAA CATGCTTCGC ACCTCGCTCG GTCCCAAAGG TCTGGACAAG ATGCTCGTCA GTCCCGACGG CGACGTTACC ATTACCAACG ATGGAGCCAC CATTCTAGAG CAACTGCACG TTGATCACCA GGTTGCCAAG CTCATGGTGG AGCTCTCGCA GTCCCAAGAC GACGAAATTG GTGACGGCAC GACGGGAGTC GTCGTTCTGG CCGGAGCGCT CCTGGAACAA GCCGAGGTGC TTTTGAAAAA AGGCATTCAT CCCATTCGCG TGGCGGAAGG ACTGGAAAAG GCCGCCGATG TGGCCATGCA GACGCTCGCC GAAATCGCCG AGCCCATGGA CATTGCCGTC AACAACCACG CCGCCCTCGT TGCGACCGCG ATGACCACAC TCAGCAGCAA GATCCTGCAC CAACACAAGC ACAAAATGGC CGACATTGCC GTCCGCGCCG TCCTGCAAGT TGCCGATCTC GAACGGCGGG ACGTCAACTT TGAGCATATA CGTGTAGAAG GGAAGACGGG AGGGAGTCTG GAAGACGCCG AACTCGTCAA CGGTATCGTT ATTGATAAGG AAATCGCGCA TCCGCAAATG CCCAAGATTA TCGAAGACGC CAAACTCTGC ATCTTGACTT GTCCGTTCGA ACCACCCAAA CCAAAGACGA AACACAAGCT CGAGATTGAC AGCAAGGAAG CCTACGAACA GCTCTACCAA CAGGAACAAG AATACTTTCG GGACATGGTC AAAAAAGTGA AAGACAGCGG CGCCAACCTC GTCATTTGTC AATGGGGCTT TGACGACGAA GCAAATCATC TACTGCTACA GAACGACCTG GCGGCCGTGC GTTGGGTCGG CGGGGTCGAA ATTGAGCATA TTGCCATGGC TACGGGTGGA CGTATCGTGC CGCGCTTTGA AGAAATATCG GCGGAAAAAC TCGGACACGC CGGTCGCGTG AAGGAAATTA CCTTTGGTAC CTCCGACGAA CGCATGCTGG TCATTGAAAA TCCCGTCAAT ACCACGGCGG TCACCGTTTT AGTCCGCGGC GGCAGCAAAA TGATCGTCGA AGAGGCCAAA CGCTCGTTGC ACGATGCCAT GTGTGTGGTG CGCAATCTCA TTCGGGACAA TCGGGTCGTC TACGGCGGCG GCTCAGCCGA AATCGCCTGT TCCTTGGCCG TCAGCCGATT CGCCGACACT GTAACCGGTG TGGACCAGTA CGCGATTCGG GCCTTTGCCG ACGCTTTGGA CGACATCCCG CTGGCCTTGG CGGAGAACGC TGGCCTCTCG CCGATTGAAG AAGTGGCGGC CGCCAAGTCG AGGCAAGTCA AGGAAAAGAA TCCCGTAATT GGGCTCGGTA TGGACGTGAT GAACGAAGCG GATGGCTACC ATTCGGCCGA TATGCGGGAA CTGGGTGTCT TTGAAACGTT GATTGGCAAG CAACAACAAA TTCAGTTGGC AACTCAAGTG GTCAAGATGA TTCTCAAGAT TGACGACGTG ATTTCCATGG GACCACAGTA G
|
Protein sequence | MSMVYDEYGR PFIILKEQQA KARVKGTEAT KSNILAARSI SNMLRTSLGP KGLDKMLVSP DGDVTITNDG ATILEQLHVD HQVAKLMVEL SQSQDDEIGD GTTGVVVLAG ALLEQAEVLL KKGIHPIRVA EGLEKAADVA MQTLAEIAEP MDIAVNNHAA LVATAMTTLS SKILHQHKHK MADIAVRAVL QVADLERRDV NFEHIRVEGK TGGSLEDAEL VNGIVIDKEI AHPQMPKIIE DAKLCILTCP FEPPKPKTKH KLEIDSKEAY EQLYQQEQEY FRDMVKKVKD SGANLVICQW GFDDEANHLL LQNDLAAVRW VGGVEIEHIA MATGGRIVPR FEEISAEKLG HAGRVKEITF GTSDERMLVI ENPVNTTAVT VLVRGGSKMI VEEAKRSLHD AMCVVRNLIR DNRVVYGGGS AEIACSLAVS RFADTVTGVD QYAIRAFADA LDDIPLALAE NAGLSPIEEV AAAKSRQVKE KNPVIGLGMD VMNEADGYHS ADMRELGVFE TLIGKQQQIQ LATQVVKMIL KIDDVISMGP Q
|
| |