Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54087 |
Symbol | |
ID | 7197424 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 358153 |
End bp | 359982 |
Gene Length | 1830 bp |
Protein Length | 549 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177920 |
Protein GI | 219112337 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.027692 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ACATCACAAC CATGAGCGCA ACGACTGTGA GCGCGGATGG AAGAAAGTAT TGCCCCCGTA TCCATTTTGC CCGTTGCACA CTAGTATCAA AAACGAGATG ATTGTCTACG AACAGGACCT ATTTGGTCTC AAAAATTTGA CCCGCGTGCA CGGATCAGCT GTTTACCGAG TAGTCTTACC AGCTTCAGTC TCGTCCGCGA TCCTCCTGAT TTTTCACAGT TTACAAATCG ACAATGACCA GACCCAAAAG GCACGAGAAG ATCTCATCGT GCAGCATCCT TACGTCATTG GGGCCTTTGT TGGATTCTTT TCTTTTCTGT TGACCTTTCG ATTGAATTTT GCCTACCAGC GGTACTGGGA AGGGGCTACC GCTATACACC AGATGTTGAG CAAATGGACA GATTTCGGTA TGAACGTGGC GGCTTTTCAC TACCAGAGCG ATGTATATGA TGATATACAG CCACCGAGCT TTGGATCGCA TCCAAATTTG CGAGCAAAAG ACATTGTGGG ACGTGAACGC GATTTCGAAA CAACGTACGA GGAGTGCGTT GAGAAAGTAG AGCTCTGCTT GGAAGAAATG CAGCAAGAGA GTATCGCAAA CACATCCCCG GCGTGGTGGA GAGGGGGATT ACGACGTCGC AGACACAAAG ACCACGCCGA TTCGACTCGG CAACCGTCCA CAACAACACC CCACAGACTT AAATCGATCA ATGCCAACAA AGAGACTAGC AGCAATTCCG CCCCCAACCG CCTAGATTCA TACATTCCCA TCCCCCAGCG CTTCCAGGAA CAATTTGGAA TAAACTTGAG GAGACAGTCA ACGCCTATCA CCACGGATAG TCGAGCCAAC AGCCGTCGTA GTAGTCTCGT CAAACAATCC AAAGCATCGC AAAATCTAGT TTCTCGTAAG GCACGCGTTC CTTTACCCAG CCTGTTTTTA CAGGAAACTG CACATTTGCT TTCGCTTTTG GCCGGTGTCG CTTTCTCGAC GCTCCGTAAC GATATGGAGC AGGCCGAGAA TCCTTTAAAC ATGTACTATC CCGGAAAACC CTGGCCTGCG ATGGATCCGG ATTGTCTCGA TAAAGATACC AAGCACCTAT ACGGCGAAGA TCAGATCCTA TGGCAATCCT TGTACTTTAT GCTTGGTTTA GATCGAAGTG AGCGACACAG GACGCTGTAC AACGCCGCCA GACCGTTTGG TGTCCTTGGT GGTGTCAGTG ATCAAGAAAT TGCCATGCTT CAGCAAGCAC ATGGGCCGTA CGCCAAGGTA GCGCTGTGTA CAATGTGGAC GCAAGAATTT ATTAGCCGAG AATACATGGT AGGGTCGGCC GGAAAGGTGG CACCGCCGAT TGTGTCGAGA TTGTACCAAT TTCTATCGGA TGGTGTGGTG TAAGTCAACA CGGCGTCGTT TGATGGTCAG CAGAGCCAAG TTTCTCATCA TTGTATGCCT TCGGTATTTT TGTCAAGGAC ACAGTGGCTA CAACCAGGCC CGGAAAGTTG CCTATATCCC TTATCCTTTT GTAAATGCTC AAATGACAGC ATTTTTCAGC CTCAGTATCA TATTCATTTT CCCCATGTTG ATGTATTCCT ACGTCAATGT TCTTTGGTTT GCTTGCGTAC TAAATTTTAC TGTCGTGACC TGTTTTCTGG GTATGCATGA GGTTGCTCGG GAATTGGAGA ATCCTATGCA AAACGCGCCG AATGATTTAC CCTTGGTCAC CTATCAGGCC CAGTTCAACG AAGCGCTGGT AACCATGTAT GCTGGCTTTC ACCCGGATTC GTGGTGGGAG ATTCGCAGCC AAGAGTCCAA GGATGATAAC ACGCAGTTCG
|
Protein sequence | MIVYEQDLFG LKNLTRVHGS AVYRVVLPAS VSSAILLIFH SLQIDNDQTQ KAREDLIVQH PYVIGAFVGF FSFLLTFRLN FAYQRYWEGA TAIHQMLSKW TDFGMNVAAF HYQSDVYDDI QPPSFGSHPN LRAKDIVGRE RDFETTYEEC VEKVELCLEE MQQESIANTS PAWWRGGLRR RRHKDHADST RQPSTTTPHR LKSINANKET SSNSAPNRLD SYIPIPQRFQ EQFGINLRRQ STPITTDSRA NSRRSSLVKQ SKASQNLVSR KARVPLPSLF LQETAHLLSL LAGVAFSTLR NDMEQAENPL NMYYPGKPWP AMDPDCLDKD TKHLYGEDQI LWQSLYFMLG LDRSERHRTL YNAARPFGVL GGVSDQEIAM LQQAHGPYAK VALCTMWTQE FISREYMVGS AGKVAPPIVS RLYQFLSDGV VGYNQARKVA YIPYPFVNAQ MTAFFSLSII FIFPMLMYSY VNVLWFACVL NFTVVTCFLG MHEVARELEN PMQNAPNDLP LVTYQAQFNE ALVTMYAGFH PDSWWEIRSQ ESKDDNTQF
|
| |