Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49505 |
Symbol | |
ID | 7195729 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | - |
Start bp | 446826 |
End bp | 448590 |
Gene Length | 1765 bp |
Protein Length | 456 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184253 |
Protein GI | 219128086 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0369192 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AATCAAGGCT AACTGTAAGT CGTATTGTGA AACTTCGATT CTCATTGCTG ATTACATTAC AGTAAACCAC AGCAGAGCGT GGAGCTGTTT GCGAAGCGTT CATATTTTGA GTGACGTACG CATTCTTGAT GCAAAGAGAA GCGATAGTTC GACAGCGCCG CAGCAACTTC AGCGGTGTTG TCAAAGACAC GATGGCTCCT GAAAAATAAA AGAGTATGCA ATTCAAGACA ATCATCGAAC CATTTCGGAT CAAGACGGTG GAATCCATCC GTCGGACCAC ACGCGAAGAA AGATTGAAGG CTATTGAAGA GGCTGGATTT AATTTGTTCC TTTTGGATGC TGACAATGTA CTAGTTGACT TGTTGACCGA CTCGGGTACG GGCGCAATGT CTGCCAAACA GTGGGGTGGG ATGATGCAAG GTGACGAATC TTATGCCGGT TCAAAATCGT TCTATAAATT TAAATCAGCT GTACAAGGCA TCACTGGTTA CAAGCACGTC ATACCAACGC ATCAGGGTCG AGCAGCAGAG AGAATCCTCT TTGGTTCTGT CTTGAAGGGT GGTGATATCG TGCCAAACAA CACGCACTTT GACACAACCA GAGCTAACGT CGAACACCAC AAGGCCGTGG CTTTGGACAT TCCCATATCG GAAGCACGAT CCCCTTCTGT CGTCCTTCCC TTTAAAGGAA ATATAGATCT TGAACTGTTG GAAGAGCAAC TATTGCGCAA CAAGCAAAAA ATACCGATCG TGTTTGTCAC CATTACGAAT AATTCGGGAG GTGGACAGCC TGTCTCAATG GCAAACATCC GTGGGGCAAG TCACATTTGT AAAAAGTTCA GTGTTCCCTT TTTTCTCGAT GCTTGCCGTT TCGCGGAGAA TGCCTGGTTT ATCAAAATGC GCGAGGAGGG CTATGCCGGC AAGTCTCCGC TAGAAATTGC ACAAGAACTA TTCAGCCTTG CCGATGGTTG CACCATGTCA GCTAAGAAGG ACGGCTTGGC AAATATCGGA GGCTTTTTGG CACTGAACGA CGATTGTCTA GCCCAGGCGT GCAAGAATGA ACTGGTAAGT TACCTGCAAC AAAGGAGAGC CCTTCGTACA ATTGTTCCAT TTATTGTCTT ACAAAGTGTT CCCCGCTGTA GATTCGCACT GAAGGGTTCC CGACGTATGG AGGGCTGGCC GGGTACGACC TTGAGGCTAT TGCTGTCGGT ATACAAGAAG TACTTGAGGA GGATTATTTA GCGTATCGAA TACAATCCGT TGCGTATTTT GGCAAGCAAC TGACCGATGC CGGGATACCG ATTGTCCAGC CACCCGGAGG CCACGCGGTC TACATCGATG CCACAGCTAT GCTTCCCCAC ATTCCCGTCT CCGAATTTCC GGCATGGGCT CTTTCTCTTG CACTTTATGT TGAGGGCGGT ATTCGCTCTG TCGAGATTGG GTCCGTTATG TTTGGACAGG AGACACCGGC TTCGATGGAG CTGGTTCGAC TGGCATTTCC ACGTCGGGTC TACACACAGT CGCATGTGGA CTATGTGTCC GAGGTCCTTC GCTACATCAA TGAGCATAAG AGCAACATCC ATGGCGTTCG TATTGTTGAG CAGCCAGCGG TTTTGCGTCA CTTTAGTGCC AGATTTGAGC CTATTGGAGG TTCTCTCCAA TAGCTGCAGG CATCCAACCT AGGCGTATGT CGATTCCAGA CAATCGCCAC AGTAAACTTT GTAGCAAACT AGATAACTGC GCATATAAAC ATTGCTAGAC TTTGC
|
Protein sequence | MQFKTIIEPF RIKTVESIRR TTREERLKAI EEAGFNLFLL DADNVLVDLL TDSGTGAMSA KQWGGMMQGD ESYAGSKSFY KFKSAVQGIT GYKHVIPTHQ GRAAERILFG SVLKGGDIVP NNTHFDTTRA NVEHHKAVAL DIPISEARSP SVVLPFKGNI DLELLEEQLL RNKQKIPIVF VTITNNSGGG QPVSMANIRG ASHICKKFSV PFFLDACRFA ENAWFIKMRE EGYAGKSPLE IAQELFSLAD GCTMSAKKDG LANIGGFLAL NDDCLAQACK NELIRTEGFP TYGGLAGYDL EAIAVGIQEV LEEDYLAYRI QSVAYFGKQL TDAGIPIVQP PGGHAVYIDA TAMLPHIPVS EFPAWALSLA LYVEGGIRSV EIGSVMFGQE TPASMELVRL AFPRRVYTQS HVDYVSEVLR YINEHKSNIH GVRIVEQPAV LRHFSARFEP IGGSLQ
|
| |