Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40785 |
Symbol | |
ID | 7198642 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 392477 |
End bp | 393941 |
Gene Length | 1465 bp |
Protein Length | 393 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184796 |
Protein GI | 219129227 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.432323 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTTGC GGACTCTTCC GGTCCTTTCG TCGTCGCGAC TCGTCAGCAT GCTCATGCTT TTGAACGTTG TCACAATATT ACATTATTTC CAGGCACAAA CGCACCTTTA CGATGCGGAG GCCTCCCGGT ACTTTACCGA GTCCTCCGGG ACGCTGCAAA CCCACAGTGC TCCTTCGGAC GGCATCACCA ACGCCAGACT GACAGTAAGA CCACGGCACG GACGTTTCCG GGATGATACC ACCGCGACTG CTCCTCTGTA TGGTAGTACT GTTTCCAGGG ACTCTTCGGG CAACGCTTCT TCTTCTTCAT CGTTGTCCGC GACCTGGACA TTCTGGCTCT GTGAAGAATG CCTACAGGTT GTAAATTACG CCCCGCGCGT CAAACTCACC AAGCCCGTCC AGACACGCTC GGATGAACGG AAATTCCTGA GTTACTTTTA CCTGAAAGAA GCGGGCAAGC ATCCCTTCCA AGGTGCCCTG GATGCCCAAG GCCGGTCTGG TTTCCACTAC GACGTCACCA GTTTACGACG GAGCCCGCCC TCCTTCGTAG ACAGCTTTCC CAATCTCACG GCCGAGTGTC TCCGGCGCGA TGACGAATAC TACGCACTCC AAAGACTTCG GATTCATTCG CCGTCACCAG AACAATCAAC TACCGCCACT CGACGACTAT CGCAATCGTC TACTCCGGCG AGAATACTCT GTGTCGTCTA CAGCAGCGAG CCCTTTCACC ACAAGCTGCA GGCCGCTCGA CAGACCTGGG CTCCCAAGTG TGACGGCTTC TTCGCTGCGT CCAACGTAAC CGATCCCACC TTTGATGCGG TCAACATTGT CCACAATGGT CCGGAACAGT ACAACAATAT GTGGCAGAAG GTACGCTCCA TTTGGGCTAC GCTGTACGAG TTGTACTATG AGGATTTTGA CTGGTTCCAT CTTGGCGGTG ACGATATGTG GCTTCTAGTC GAGAATTTGC GTATGTATTT GGAAAGTGAC GAGATTCAAG CCGCTGCCAA CGGAGGCTTT TCCGACACGT TACCACTAGG GGTGCAATCG GGCAACAATA ACAGTACTCG GATACAACCA GACCAGGTGC CCTTGTATCT GGGGAGCCGT CTTGCCTTTC GGAAGAATAT ACGAACCTTG TACAACACGG GCGGACCGGG ATACACTCTC AACAAGGCGG CGTTGAAGCT CCTCGTGACG GAAGGATTGC CCGTCATGCA CAGTCAGCTA CGAACCTCTG CCGAGGATTT GCGAGTAGCC GAGGTTTTCC GACGATTCCG CGTCTTGCCG TACCCGACAC ACGATCGCGA CGGAGGTGAG CGGTATCACC ACTTTACTCC GGGTTTGCAT CAGCTATCGG CCATGCCCGA ACAATATAAA TGGTTCGACA AGTGGGCTTC ACCAATGGGA TGGAAAGGAG GGTGGAATCA TTCTTCTGTG TACAGCGTCG CGTTCCATGG TATAA
|
Protein sequence | MGLRTLPVLS SSRLVSMLML LNVVTILHYF QAQTHLYDAE ASRYFTESSG TLQTHSAPSD GITNARLTVV NYAPRVKLTK PVQTRSDERK FLSYFYLKEA GKHPFQGALD AQGRSGFHYD VTSLRRSPPS FVDSFPNLTA ECLRRDDEYY ALQRLRIHSP SPEQSTTATR RLSQSSTPAR ILCVVYSSEP FHHKLQAARQ TWAPKCDGFF AASNVTDPTF DAVNIVHNGP EQYNNMWQKV RSIWATLYEL YYEDFDWFHL GGDDMWLLVE NLRMYLESDE IQAAANGGFS DTLPLGVQSG NNNSTRIQPD QVPLYLGSRL AFRKNIRTLY NTGGPGYTLN KAALKLLVTE GLPVMHSQLR TSAEDLRVAE VFRRFRVLPY PTHDRDGASR SMV
|
| |