Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38113 |
Symbol | |
ID | 7202957 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 105171 |
End bp | 106517 |
Gene Length | 1347 bp |
Protein Length | 448 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182158 |
Protein GI | 219123701 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCAAC GTGGTAGTCC TTCCCCACGA GTCGCCGAGG CCATTGGGCG CAGCCCTCAT CACGAAGAGC GCCAAGAAGA AAAGGATCGT CCTCGTTTCC GCTTTGGGCT AGCGGTACAT TCGCGACGAA AACTGACCCA GCTCGGGTCG CTTGCCGCGT ACGTCGTTTT GGTAGCGCTA GGTTCGCTTT GTCTGACGTC TCGTCTCCAC CTTCACACCT CGATGCCGGT GTCGAGAACA TCTCCCACGT TGGATCTTAC CATGGACGAT ACGAGTGCCA CCAGTACTCT TCCCGAGGAG TCACCCGTGT CTCTGTGGCG TACACTGCTC CGAACAAATA CCAAAGCCGT TGCTGCAAAG GAAAAGGCAT CCGTATCCGT TCAACTCGCC CAGATGTCCA GCGAACCCTT GCCTCCAGGT GTCGTGGATT TGCCCCAAAC CATCAAAAGT ACCGAGATTC CTCCCACCAA CAAGACATTG GTGGTAGTCC TGGGCGACTT GCGCTGCGGA GAAAACGCGT GGCAATCTCT CTACCGAAAC GTGCTCGACG AAAACATGGC CGATCTCGCG GTCTTTTCCC AAGTACCCGT ACAAGAAACC TACCGACACG CCAACGTCAG TATTTGGGAA CGAGCCCGCC ACGTCGAGAT TGTGCCGCGG TACCACGATT GGGCTGATGC GATCGACCAT TTGGCCGGAG AAAAGCTCTG GCGGGACGAG GTCCTGCAGC GCTACACTCC GCACACGCAT CCACTCATGC TCGGGGGAAT CCGCGGCTTT CAGTCGAGCG CGGCTATTGT CTTTTGGTTC CGCTGGTACT TGGCGCAACG CATCCGCACG TTGCAGTGGG CAACGCAGTA TGATCGCTTC ATCATTACCC GGACCGATCA GTACTACTCG TGTCCGTTAC GTATGAATAC GCTCCAGCCG GAGCGTATTT GGGTGCCTAC GGGGCAGAAC TATCGCGGAA TAACCGATCG ATTCTATATT GCCCCGGCCG ATGTAATACT GGAAACTTTG GAAGTGTTTC CACTCTTTTT GCGACAACCA GACATCTTTG CCAACTTTAC GAGCAGGCTT ATGAATCCGG AAACCTTCCT GCGAACCATG TGGACGAAAG CCGGACTGTT GTCCCGGGTG TACCGATTTC GTCGAATCAT GTTTACCTGC ATGACTCCGA TTGACACCAC AAAATGGGGA GTCATGCGAG AGAAGGTTCA GGAAGGAGTC CATCTCAAGT ACCCTGGGGA GTACACTGAC TTGGCTGATA TTTGTGAACG AGTTAACCAT CCGGAACGGT ACCCGTTTCC TTACCGGAAA GGTTTAGAGC GAGAAGATCT TCAATAG
|
Protein sequence | MSQRGSPSPR VAEAIGRSPH HEERQEEKDR PRFRFGLAVH SRRKLTQLGS LAAYVVLVAL GSLCLTSRLH LHTSMPVSRT SPTLDLTMDD TSATSTLPEE SPVSLWRTLL RTNTKAVAAK EKASVSVQLA QMSSEPLPPG VVDLPQTIKS TEIPPTNKTL VVVLGDLRCG ENAWQSLYRN VLDENMADLA VFSQVPVQET YRHANVSIWE RARHVEIVPR YHDWADAIDH LAGEKLWRDE VLQRYTPHTH PLMLGGIRGF QSSAAIVFWF RWYLAQRIRT LQWATQYDRF IITRTDQYYS CPLRMNTLQP ERIWVPTGQN YRGITDRFYI APADVILETL EVFPLFLRQP DIFANFTSRL MNPETFLRTM WTKAGLLSRV YRFRRIMFTC MTPIDTTKWG VMREKVQEGV HLKYPGEYTD LADICERVNH PERYPFPYRK GLEREDLQ
|
| |