Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49243 |
Symbol | |
ID | 7195704 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 350162 |
End bp | 352081 |
Gene Length | 1920 bp |
Protein Length | 576 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183975 |
Protein GI | 219127506 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.312572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTATCGAT CGTGAAGGGA CATACTCACA GTCAAGGAAT ACGTATTTCG AGCATTCAGT ATCACAATAA TTACAACAGT ACAATCTAGA CGCGTGCGCC TTGCCGTTCT TACTAATACG CACGACCACT GCCAGGGTTT TTCGTTCGCC GTCCCTCCTC CCGAACGATT TCCCTACCAC ACTTCCAGGA TGCCCGCCGC CGGCTCCGAG CGCACCATGG CGACACCACT GCCGCTTCCG GAAGCGGTAG TACCCGCCGT ATCGCTGTCG TCCCGTATTG CGCGTGGAGT CGGATCGATC TGGACTACTC TGTCCGGTCA CAAACGACCC GCAGCAGACC GCGACGATAT TGCGGAAGGA GGGCCGCAAC GGCAAGTCCG CGCACGCGTA TCCGCCTCAC CCAAAGCGGA GAACGGTACG TTTGGATCCG TGCCCGACGA CTCTGTGACC AACCCGTTGT TGTCGTTGCC TTTGTGGGAT CCTGTGGTGC CTCCAGGCTT CTGTGGAATT TGTCTCGATA CCGTCCTGAT GCCTACTAGT AAGGTTGGTT CCCTTCCCAC TTGCGGTCAC TGCTTTCACG TACATTGCTA CGCCAAGTGG AAGGCCTTTT GTCCTTCTGC CGTCGCCTTT TGGCGGTGTC CCTTGTGCGA TCAAGACGCG TCCGACTTTT ACAAACTCCA ACTCTTTCCC CAGCTCGAGC AAGCCGACGA CGACAATAAC AACAACCACG AACAGTCGCA GGACCAAGTC TCTCGACTCC CCACGCCACT GGAACGGGAC GAATTCCACG ACGCCGTCTC CGGTGTAACT CTCAGTGGCA CACTCCGCAA GCTCGTCAAC CACGTATCCT CCAGTACAGA CCCGCACGTC GTCACGCGCG CCTTGGACGA ACTTTCGACC TTGGCCTTTC GGTACAACCA CGCCCGCGTT CCCATTTGCG TGACGGGAGG GATTGCCGGG ATTGTGCGCG TCTTGACCAC CTGGTCGCAC GATACCAAGA CCACCACGAC TGCTGCACTC CGACACGCTT GTCAGCTACT CGGGAATCTA CCCCTCAACG AGTCGGCCAC GGAACGGATC GAAATCGCCA TATGCAATGC CGGAGGCATG GACGCCATAT TGTCCATCAT GCAACAACAC CCCGACGACG TGGATTTACA AGAACAAGCG GTCTTTTCCC TAGGCAAATT CATCTGCGAA GATGAACTGC TCGAGACCAA TTACGTGGCC GATCAAATAC CGGCCATTTT TCGAACCATG CAACGACACA GTGATGCGGC GTTGGTGCAA ACCTACGGGT GCTACGCCTT GGCTAGTTTA ACCTTTCGAG AAGACGTTCA GGAAGTGGTA CAAATGATCA CCGCGCAACA CGGCATACCT ATTGTTTTAC AAGCACTGAC ACAGCACGTC GACGATCCCG ATGTGGTGGA TGCCGCCTTG AACCTGTTGG CCAATCTCAC CGAACACGAT ACCTCGCAGA CGCGCGGTAT CCTCGAATCG AAATCACTCA AGGTTCTGGG CCGAGCCATG CAGAATTTTA CGGAACTTGT TGATGTACAG ATGCACGGCA TTGTTATTCT CCAACGAATA TTACGATTGC CGGACCGTAT CAGCGAAGCC GCTCTGATGG AATTGGGGCA AGCCGGGGTT GTGGTTGCCT TGACCAAATC CATTGACTTG TATGCGGATA AAGTCGATTT GCAAATCGGG GCGATTATCA TTCTCGCACG GTTGGCGCCC CTGGTCGATC TGCAACCAAT CATGCGCGAT GCCGGTACAC CGGGGGTCCT GCGGGTCACG ATGGATATAT ATAACGATGA AGAAACACTC ATGACGCACG CTTATCAGGT GTTGTCCTGT TGTTCCGAAT CACCGGTGCC CGCGGTTGCC GCGGCGGCAG CGGTCGATGT GGAGGCGTGA
|
Protein sequence | MPAAGSERTM ATPLPLPEAV VPAVSLSSRI ARGVGSIWTT LSGHKRPAAD RDDIAEGGPQ RQVRARVSAS PKAENGTFGS VPDDSVTNPL LSLPLWDPVV PPGFCGICLD TVLMPTSKVG SLPTCGHCFH VHCYAKWKAF CPSAVAFWRC PLCDQDASDF YKLQLFPQLE QADDDNNNNH EQSQDQVSRL PTPLERDEFH DAVSGVTLSG TLRKLVNHVS SSTDPHVVTR ALDELSTLAF RYNHARVPIC VTGGIAGIVR VLTTWSHDTK TTTTAALRHA CQLLGNLPLN ESATERIEIA ICNAGGMDAI LSIMQQHPDD VDLQEQAVFS LGKFICEDEL LETNYVADQI PAIFRTMQRH SDAALVQTYG CYALASLTFR EDVQEVVQMI TAQHGIPIVL QALTQHVDDP DVVDAALNLL ANLTEHDTSQ TRGILESKSL KVLGRAMQNF TELVDVQMHG IVILQRILRL PDRISEAALM ELGQAGVVVA LTKSIDLYAD KVDLQIGAII ILARLAPLVD LQPIMRDAGT PGVLRVTMDI YNDEETLMTH AYQVLSCCSE SPVPAVAAAA AVDVEA
|
| |