Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44149 |
Symbol | |
ID | 7203854 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1122943 |
End bp | 1125265 |
Gene Length | 2323 bp |
Protein Length | 459 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186474 |
Protein GI | 219113781 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.104122 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCCAT TCCGAATGCT TAAGGATGCT CCCCGGGAAC TAGAAGATGC GTCTTTCGGA GTCCAAATTA TTTATTATGA GGACTTTTAC TACGCGCTCG TTTTCCTTAC AGCAGTCTAC CTGTCCGGAC GTGTCGCCAG CCTTTTACGG ATGCCGGCCT TGACCGGGGA AATTGTTGCT GGCATTCTCT TGGGACCGCC ACTGGCTAAC TTTGCTCCCA TCCCTCAAGC GTGGGTGCTG CTCGGAGAAA TCGGGTAAGT GGTGGAAGGG ACAGATAGTT TGATCTTGCA AGTATCCATT ACCATACAGA CAGGCTAAGG CTCCGGATGG CAATAAGACT TATTTTGACA GTGCTCGAAG CTGCCATGGA CATTGACGTC AAGAATTTGA AAGTTATTGG AATACGCGGA TTTACAATCG CTGTAGTTGG ATCGATTTTA CCAATTTCAA TCGGAATTAG CTTGGCCTAC GGCATGGGGT TTGAAGGTAA GGCAGCAATT GGGGCTGGCG CAACGTTCGG GCCGACATCA ATGGGCATCG CGCTGAACGT TCTCAAGAAC GGTGGAATTT CATCAACACC CCTTGGTCAG CTGATTGTTT CGGCCGCAAT TGTAGACGAT ATGCTCGCAT TGGTGATTTT GAGTCAGCTC GAGGCCTTGA CCGGGGAGAT CACAGTGGTT GGTGTCGTTG TTCCGATTCT TTCGGCCTTC CTTTTTCTCT TGCTCGGGGG TTATATTGCG CTCTTTGTAT TACCGAATAT CAGAGAGAAA TATTTGATTA GCAAAATCCC ATCCGAACAT CGCGATGAGG CTCAACTCAT CATACTCTTC GGGATCCTTC TAGCAATGGT ACCAGCGACG TTTTACGCCC AAGCATCGTA TTTAATGGGA GGATTCGTCA CTGGTCTTGC CTTTTGTAGC GCAGAGGGAA CCCATCAGCT TTTCATTAGT CAGTTCAAGC GAGTAACAAC GTGGTGCATG CGAATGTTCT TCGCGGCAAG CATTGGCTTC CAAGTCCCCG TCAGGGATTT CGGAGACGTC CAGGTACTTT GGCGCGGAAT GGTATTTTTC TCGTCGCTGT TCGGAAAACT TGCGGTTGGG TTTCTCGTCC CGAATATGCA TGAAAGCCGC AACTTCACCG GCCCCCATCT TCGCGACTGC TTGATTACGG GTTTCAGCAT GGCGTCGGAA GGAGAGTTCG CCTTCGTTAT CGCGGTGTTT GCCGTCGATA ACGGTCTAAC CGATCGTGGA TTGTACGCCA GTGTCGTATT GGCAGTCCTC CTGAGTGAGA TTATTCCTCC GTTCCTTTTG CGATACACGA TTGCCAAGTA TGAAGTTTCA AAAACAGAAC CAACGACTCG ACATCGTAGC GATGTCGAAT CAGAGAACCC ATTGGAAACT TTGTCGAATT CTGAGTACAA TCATAGTCAG GGACAATAAG TCCTACAGCA GCCGTGACTG CTACATCGGT AATGCATACA TCCAGCAAAA CTTCGAGTCG ACAGCGATTT TGGTTTGGGA ATGAATGATG TTCCAGATGG GGTCCTTCAT CGAACAATTA TAATCTTGTA CTTTTGTGTG AATAGAACTA TGTGTCCGTG TTACTAGCCT TTTAAGCTGC AGTTCCAAAC AAAAAGTTCG CTTCAAACCA TGCGGAATCA AAATGGCGAG CGGAACTTCA AAATAGGAGG AAACTTCGAT AATATGGCGC CTTCCAAAAC TTGTCGTTCT TGTGACAAAC ATCTTTGCCG TATCGCAGTG TCTTTTTCGT CTCGACAAAA TTAGCCTCGG TGTTAGCTCA ATCTAGTATC GGATCTTCCT CGTCATCTCC CTTTGACTTT GATGAAAGTC CCAGCCACTG CGACATTTTC TCAAACTCCG CAAGAGTCGC CACCGTAGTT GTGCCTGCGA TACCACCGAC GGCAAGAACA AGTCCTCGCA ACCTTGCGGC CATTTTGGAT GCCGCTTCGA ACGGATTTAT CCCCTTATTG AAAAAACGGG TAGCATTTGC GAGAACACTT TGATCACACA ATTTTTCCTG CAACTCGGTA ATTCGAAAAT AGGAAGCCTC CACAAAAACC TCACTCAAAA CTTTGTGGTC CTCCGGAAGC TGCACATAAT TTTGGGTATT CTTGCGCAGA CTCACGGTCG TTCTGCCGTA GGACAGCAGT TCGACGCGGT TCCTCAGGTA CTGCAGAATA AAGCCGAAAT GTGAAGGATC GCGGTCAATG AACACTGCTC CGTTGCGTAC AATTTCCTGG TTGGCTTCCG CCCTTGCCAC GTGATCGGCC AACACCGGAT TTTCGGCCAC GGTCGACCGC AAC
|
Protein sequence | MMPFRMLKDA PRELEDASFG VQIIYYEDFY YALVFLTAVY LSGRVASLLR MPALTGEIVA GILLGPPLAN FAPIPQAWVL LGEIGLRLRM AIRLILTVLE AAMDIDVKNL KVIGIRGFTI AVVGSILPIS IGISLAYGMG FEGKAAIGAG ATFGPTSMGI ALNVLKNGGI SSTPLGQLIV SAAIVDDMLA LVILSQLEAL TGEITVVGVV VPILSAFLFL LLGGYIALFV LPNIREKYLI SKIPSEHRDE AQLIILFGIL LAMVPATFYA QASYLMGGFV TGLAFCSAEG THQLFISQFK RVTTWCMRMF FAASIGFQVP VRDFGDVQVL WRGMVFFSSL FGKLAVGFLV PNMHESRNFT GPHLRDCLIT GFSMASEGEF AFVIAVFAVD NGLTDRGLYA SVVLAVLLSE IIPPFLLRYT IAKYEVSKTE PTTRHRSDVE SENPLETLSN SEYNHSQGQ
|
| |