Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48046 |
Symbol | |
ID | 7203385 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | - |
Start bp | 1897 |
End bp | 3989 |
Gene Length | 2093 bp |
Protein Length | 678 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182595 |
Protein GI | 219124615 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0330655 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGATG CTGTGTTCTT CTCCGTCGTG GCTCTCGCCA AAGCGGGCAT TGAATGTTGT CGGAACGCTC AGATCTGCAA GGATGAGGCA GCCCGGATCG GTAAGCGTCT GACAATAGTG GTCGCGCGAG CACACGAATG GGGAGCTGTT TGTGAAAGTA CGCGCCTTGC TCATTTTCAT GAAGTTGTGG AGAATGTCTT CCTGCGCTTG CAAGCAACCA CATCGTCTGT AAACAGGCGC TCCTTGTGGA ACAGAAAATT CAAGATTGCG TTACAACCCC AGACTATACT CTGTGAAATC CTTAAAGCAG AGAGCCAGCT GAATACTGCC ATCAATGATC TTCAGATGGA GCAGTCCAAT GCCATCTTTT CGCACCTCCT TGACGTCTCC AAAGGAGTTG CGGATTTGCT AGATCAGTTT GGCGCTCTCG CTCTGAGCAA GTCGGATCCT TCTGCAACAA TCCAGCGGCA AGTTGAAAAG GCCTTGGCGG AAACTCAAGT TCGAGCTCCC CAAGTTGCCG TTGCCAACCC CGATGACGTT AGCCAAGATG GTGGACGAGG GCAAGATCCT ACCTTAGCTA CAAGCGGTAT GAAGGTTTCG CACCAGTATA AGACAACCTT TTGTCCATCC AAGGATGATG TACTCGCTAT CTCGCTCCAG CCATCCTTGC TGAAGTTTTG TGATGACCAC GAGAGCCTCC TCGGAGGTGG AGGTTTTGCG GAAGTCTTTC GCGGAACATA CAACCTCCAA CCTGTCGCCA TCAAACGCCT CAAGGTATAT CGGGGAGATG TAAATTCTCT CTCGAAACTT CAGATCGCCC GCGATGTGGA ACAACTAGCC GCGGAAGCCC TTCTGACGCA CAAATGCGGT ACACATTCAA ACATCATTCA TGTTGTTGGA TGCCTTAGCA TACTAAGCGA AGTCGAGAGA CCCTTGCTTG TCATGGAACT TATGCATACG ACTCTCTTTG ATGTTATTCA CGATCGTGTT TTAGCAGATG CTCTGGCATT TTCCCGTCGT CTCTATTTGT TGAAAGGTAT TGCAGGCGCG TTAGAGTTTC TTCATTTGCA AGGCATTGTC CATCACGACA TCAAGTCTCT CAACATTTTG TTGAACAAAA AATTGACGGT TGCCAAGCTG GCAGACTTTG GGAGTCAAAA GTGAAAGGCC TGAACACCAC AAAACTCCGT CTTGGAACAA TCCTAGCAGC TACCAGCCAT CAAGGCAACC AGATTGCAGG TACAGCCGCC TACCAAGCGC CCGAAATCCT ATCAGAAGAA GTCAAAGACA CATCGCGCGT TTGTGAGATG TATTCATTCG GGGTGACAGT ATGGGAGTGT GTGACGAGCA AAATTCCACA TGGAGGGAAA AAGGAATCAT CTATAGCGCT TTTGGCTGCA ACTAAGAAGT ACCTCCCCAT GCTTGCGGCG CCCTCCAGCC CCCCAAAGGA TCTTTCAGAG ACAGAATCAG CTTCCTGGAA AGCGCTGAAT ATGGTTGCAG CATCGTGTCT CTCTCGCGAC CGCTCAGTGA GACCCACTGC TTCGATAGTT GTTGCGCTCT GGCACAAAGT CAAGTCTCCG GAAGTGGAAG TACCTTATTC TTTTTTTCAA GACTCGTCTT TTACCAAAAT GGGCGATATT ACCGAAAGTC GAGTAACGGC CGGTCCGACG ACTCAGGGAA CAGCAAAAGA CGACCTTGTC ATGGATGTTC CGGACGACAG AGAAAAGTCT AAATGCGGAG CAGCTGCGAT GTCAAAAAAT CATCGCCGCA AATTATTGGT TGTTGCACTC GTTGCTGTCT TGCTTACGCT TACAGTCACA GTGGTCGTTG TGCTGGTATC CAAATCGTCG CCAGATCTTG CATCCCCAGC ATCAGGAGTT GATGTACCAG CTGATGCGCC AGTCTCTACG TCCCCGCCAT CCCTACCGCC GACTGCAGTT CCGGTCTCCG TCCTCCCGCC GACTGCAGTT CCGGTCTCCG TCCTCCCGCC AACCGGTACG CCAATGACCG TCCAATCTCC ATCTGGGGCG TGCCTCAACG CCACAAGCAT AAATAGCCCA AAATGGGTTG TTTTTAACGC TTTTAAACTT TAA
|
Protein sequence | MTDAVFFSVV ALAKAGIECC RNAQICKDEA ARIGKRLTIV VARAHEWGAV CESTRLAHFH EVVENVFLRL QATTSSVNRR SLWNRKFKIA LQPQTILCEI LKAESQLNTA INDLQMEQSN AIFSHLLDVS KGVADLLDQF GALALSKSDP SATIQRQVEK ALAETQVRAP QVAVANPDDV SQDGGRGQDP TLATSGMKVS HQYKTTFCPS KDDVLAISLQ PSLLKFCDDH ESLLGGGGFA EVFRGTYNLQ PVAIKRLKVY RGDVNSLSKL QIARDVEQLA AEALLTHKCG THSNIIHVVG CLSILSEVER PLLVMELMHT TLFDVIHDRV LADALAFSRR LYLLKGIAGA LEFLHLQGIV HHDIKSLNIL LNKKLTVAKL ADFGTTSHQG NQIAGTAAYQ APEILSEEVK DTSRVCEMYS FGVTVWECVT SKIPHGGKKE SSIALLAATK KYLPMLAAPS SPPKDLSETE SASWKALNMV AASCLSRDRS VRPTASIVVA LWHKVKSPEV EVPYSFFQDS SFTKMGDITE SRVTAGPTTQ GTAKDDLVMD VPDDREKSKC GAAAMSKNHR RKLLVVALVA VLLTLTVTVV VVLVSKSSPD LASPASGVDV PADAPVSTSP PSLPPTAVPV SVLPPTAVPV SVLPPTGTPM TVQSPSGACL NATSINSPKW VVFNAFKL
|
| |