Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44014 |
Symbol | |
ID | 7204213 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 701389 |
End bp | 703384 |
Gene Length | 1996 bp |
Protein Length | 533 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186400 |
Protein GI | 219113633 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGTTG AGATTTTGTC ACATTACCGG CTCCAGTTTG CCCATCGAAA GCAAGTCGTC CCGAACAAAA AAAAACGTAG GTGGCTCTAC ATTGGACGAG GCCGATTGGG AGAACGCGTC CTTGTCACTG TTCCGCCACG TACTTCCTCG TTGGAAGCGC AACGGCAGCA AACGTAGTGA AGCATTTTCC GAAGAGCAGG CAGTTATCGA TCCAATTGAA GGGCAACGGG CAGAAGACAA TAGAGAGGCT GAAAACGCTT CGTTCTTGAA AAATTTGACA TTTCGAGCTT CGCAAGCATC TAAACGCGAA GAAAACAGCA ACTGGGACAA CATTGTACCA GTTGCTCCAC TTTTAAAAAC AGGATTTTCG AGTCATCTCC TTCATGGGTC CACTTCGTCT CAACCCACCG ATGACGCCAT TGACCAATTA GGAAGCAGCA AAGCGCGATC CGTCTCCTAG CCGGCCCGGC CCATATTCTT ACGTACAAAA GACTGATAGG CAATCAACGA TAATGTCACA GCAACTGGTT GTGGGCGACC TAACGGATCA GCGCGGTCGA GTCACCATGC CACCTACCGG TATGTATCCG CAGAACCCGC CACCCAAATC ACCCAGCAAG AAAACAAACG AAGTCGTCCG CGTGGTAAGG ATTTTCAGTG TTCATGTAAG TCGGATTTCG ATTTAGAGTC GGATTTTGAA AACGTCCCTA ATTTAGAGTC GGATTTCGAT TTAGAGTCGG ATTTCAAAAT CCGACTCTAA ATAGAGTCGT TTTCGAAATC CGACTCTAAC ATACATGATA CATGAACTGG AAGAACACGA CAAGGCAAAG TTGGCGGTCA GCTTCATGGA CCTTCATGCG AAACGCATTT GTGCTACGGT GTACAGAACA AGACCCTTTC TTTTCTTGAA ACAATGCGGT ATCGACCAAG CTTGTACACT TTCTTCTTAG GCACTGCGCT TCAAATTCTA TCAGCATCAG CTTTTAACGC GCCTTTGACA ATTCAGACAT CAGTTTCAAT TCGATTGACT GTGAAGCGGG GTATTTTCTC ACGAGCAGTC CCTAAAATAA AGGCAGGCAA TCAAGATGAG TTGATGAAAG AGGAATCTGT AAGATTGCAA GTTTGGAAAT CTCGAAGAAA TCAAATTCGA CAAACACTCA AGTCGGCCGA TCAAGTGCGT ATTTTCCGTC TCCAACAAGG GTGGGTGCCC GAGTTGGGCG ACGACGGTAA ACCCTTGAAA TCTGACGGAA AGGTGGCTTT GACACTCACG GCCTTTGTAA TTGCCGCCGG CGCCATCGCG TTGCGCATAG GTGGTCGTGC CGCTCTTGTA TCAACTCTGG GGTTGGATTT CGTGACTGAA AATCCAGAAC TTAAGGCAAA TCTAGACATT GTTCTCAACA CTGCCGACAA CATGGACCCA GTCACCAAGC TTTTACTGTT TACGGCGAGC TGGACTGCGG TCAAAGTTTT GTGTTTTGAT GCTGCGGGCG TGGCGTTGGC ATTGTCATCA GGAATTCTTT TTGGAGGGGT GTTACAAGGT GCTGTTGTGT CAGCCGCAGC GGCAACCTTC GGCTCTACCG TGGCGTTTGG GCTGGCAAAA CTGGACACTC CTTTACGCAA GAAAGGCTTG GGATTGCTTG ACGAATACCC AAGTTTACGT GGAATTGAAA AAGTCGTAGC TAAAGAAGGT TTTAAAGCGA TTTTGACTTT GAGATTGGCA CCATTGCTTC CAATTCCTAT CGGTGCGTAC AACTACATCT ACTCCATCAC AAACGTCCCT TTGCTCGACT TTTGCGGAGG AATCTTCATC GGAAGTCTCA AGCCTTACTT GCTTGATAGC TACCTTGGGT ATTTTGGGAA GTCGCTTGTT GACGGCACGG CAGATCAAAA TGGATGGCAG GATACCTTGC TATTGGCAGC TCTTGGTTTT AGCGTTCTCA TCGGTGTCTT TGCATCACAA CTTGCAAGTG AAACGTGGGA TTCAGTCTTG GAAGAA
|
Protein sequence | MSVEILSHYR LQFAHRKQVV PNKKKPFSEE QAVIDPIEGQ RAEDNREAEN ASFLKNLTFR ASQASKREEN SNWDNIVPVA PLLKTGFSSH LLHGSTSSQP TDDAIDQLGS SKARSQLVVG DLTDQRGRVT MPPTGMYPQN PPPKSPSKKT NEVVRVVRIF SVHNKTLSFL ETMRYRPSLY TFFLGTALQI LSASAFNAPL TIQTSVSIRL TVKRGIFSRA VPKIKAGNQD ELMKEESVRL QVWKSRRNQI RQTLKSADQV RIFRLQQGWV PELGDDGKPL KSDGKVALTL TAFVIAAGAI ALRIGGRAAL VSTLGLDFVT ENPELKANLD IVLNTADNMD PVTKLLLFTA SWTAVKVLCF DAAGVALALS SGILFGGVLQ GAVVSAAAAT FGSTVAFGLA KLDTPLRKKG LGLLDEYPSL RGIEKVVAKE GFKAILTLRL APLLPIPIGA YNYIYSITNV PLLDFCGGIF IGSLKPYLLD SYLGYFGKSL VDGTADQNGW QDTLLLAALG FSVLIGVFAS QLASETWDSV LEE
|
| |