Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35240 |
Symbol | |
ID | 7200721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 419409 |
End bp | 421463 |
Gene Length | 2055 bp |
Protein Length | 684 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179641 |
Protein GI | 219117702 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGGAAG AAGCCAAGAA ACTCTGGAAG GACGAAATTG CGTCCAACAG CGACGTGAAA GCCGTCGTAT TCTCGTCCGC GAAACCCGAC ATGTTCATTG CCGGCGCCGA TATCTTTGAC ATTAAAGCCG TCGAGAATAA GCAGGACCTG ATTCCCTTTA TCGCCGACGG TGTGAAATTC TTTCAAGACA TGCGAGGCAA GGGTGTCCCT CTCGTGGCCG CCATCGACGG ACCCGCCCTC GGTGGTGGTC TGGAATGGGC GCTCTGGTGC GACTACCGTA TTTGCACGGA CAGTTCCAAG ACCAAAATGG GACTTCCCGA AGTAAAGCTC GGGCTTTTGC CCGGTTTCGG CGGCACGCAG AACTTGCATC CCGTCGTGGG CTTGCAAAAC GCCATGGATA TGATGTTGAC GGGGAAGGAT ATACGTCCGC ACCAGGCCAA AAAAATGGGC TTGGTGGACC TGGTTGTGGC GCAGGCCTCC TTGGAACGTG TCGCGATTGA TTCGGCGGCT GCTCTGGCCA ACGGATCGCT CAAAGCCAAG CGCAAATCCA AATCTATGTT TAACAAGATC CTCGAAGACA ACTCGATCGG GCGCAACGTC ATTTGGAACC AAATTGACAA AATGGTGCAA AAGAACACCA ACGGCAAGTA CCCGGCACCC TACGCCATTA TCGATTGCGT CAAATTCGGT TTAGACAATC CCTCACAAAA GTACCAGCAT GAACGTGAGG AATTCGCTAA ACTCGCCGCG ACGCCGGAAT CGGAAGCACT TATTGGTATT TTCGACGGCA TGACGCAGAT GAAAAAGCAC TCGTTCGGTG CTGATGCCGC CATTCCGGTC AAGACCGTGG CTGTCATGGG TGCGGGGTTG ATGGGAGCCG GGATTGCCCA AGTAACGGCA GAAAAGGGGA TCAAAGTCCT ACTCAAAGAC CGCAACGACG AAGCAGTCGG TCGGGGTCAA TCCTACATGA CGGAGAATTG GAGCAAAAAG CTCAAACGCA AGCGCATGAC ACAGTATCAG TACAACCTGA ATACTTCCAA CGTCACTGCA CTCACCGATG ATAGTCCGAC TTGGCAACGT CATTTCGGAA ATGCTGACAT GGTGATTGAA GCCGTGTTCG AAGATTTGGA TCTCAAACGC AAGATTGTCG CCAACGTCGA ATCCGTCACC AAGGATCACT GTATTTTTGC CACCAATACA TCCGCCATTC CTATCGCCGA TATTGCGCAG GGCGCTTCGC GTCCGGAAAA CATCATCGGT ATGCACTACT TCTCACCGGT ACCATCCATG CCCTTACTCG AGATTATTCC ACATACCGGT ACGAGTGATA CCGCTACGGC GACCGCTTTC GAAATTGGTT CAAAGCAAGG CAAAACCTGC ATTGTGGTCA AGGATGTCCC CGGCTTTTAC GTGAATCGCT GTTTGGGTCC GTATTTGGTC GAAGTGTCGG CGCTCGTTCG GGACGGTGTA CCGCTCGAGG CACTCGATAA GTCGCTCAAG AATTTCGGCA TGCCGGTGGG CCCCATAACA CTAGCCGACG AAGTCGGTAT TGACGTGAGT TCGCACGTGG CCAAGTTCTT GTCCAATGCC GACCTGGGAG TACGCATGGA GGGTGGTGAC GTCTCTCTGA TGGAGCAAAT GATTGGCAAA GGATGGCTGG GCAAAAAGTC TGGTCAAGGG TTCTACACCT ACAAAGGCAA GAAAAAGACC ATCAACGAGG AAGTACAAAA GTACGTCAAG GACTTTGCCA CACGCGACTT GAAATTAGAC GAGAAGGAAA TCCAAGATCG CATCGTGAGT CGCTTTGTGA ACGAAGCGGC CAAATGCTTG GAAGACGAGA TCATTGAAAA TCCTGTCGTT GGTGACATTG GTCTGGTGTT CGGTACAGGC TTTGCCCCCT TCCGGGGTGG TCCGTTCCGG TACCTAGATC AGGTCGGCGT CGCATCGTAC GTAGATCGCA TGAACACGTT CACCGACAAG TACGGTCCGC AATTCGAACC GTGTCAACTA CTGAAGGATT ATGCCGCAAC GGACAAAAAG TTTCACAAAC GGTAG
|
Protein sequence | MAEEAKKLWK DEIASNSDVK AVVFSSAKPD MFIAGADIFD IKAVENKQDL IPFIADGVKF FQDMRGKGVP LVAAIDGPAL GGGLEWALWC DYRICTDSSK TKMGLPEVKL GLLPGFGGTQ NLHPVVGLQN AMDMMLTGKD IRPHQAKKMG LVDLVVAQAS LERVAIDSAA ALANGSLKAK RKSKSMFNKI LEDNSIGRNV IWNQIDKMVQ KNTNGKYPAP YAIIDCVKFG LDNPSQKYQH EREEFAKLAA TPESEALIGI FDGMTQMKKH SFGADAAIPV KTVAVMGAGL MGAGIAQVTA EKGIKVLLKD RNDEAVGRGQ SYMTENWSKK LKRKRMTQYQ YNLNTSNVTA LTDDSPTWQR HFGNADMVIE AVFEDLDLKR KIVANVESVT KDHCIFATNT SAIPIADIAQ GASRPENIIG MHYFSPVPSM PLLEIIPHTG TSDTATATAF EIGSKQGKTC IVVKDVPGFY VNRCLGPYLV EVSALVRDGV PLEALDKSLK NFGMPVGPIT LADEVGIDVS SHVAKFLSNA DLGVRMEGGD VSLMEQMIGK GWLGKKSGQG FYTYKGKKKT INEEVQKYVK DFATRDLKLD EKEIQDRIVS RFVNEAAKCL EDEIIENPVV GDIGLVFGTG FAPFRGGPFR YLDQVGVASY VDRMNTFTDK YGPQFEPCQL LKDYAATDKK FHKR
|
| |