Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43223 |
Symbol | |
ID | 7196948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2374422 |
End bp | 2376221 |
Gene Length | 1800 bp |
Protein Length | 595 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176967 |
Protein GI | 219110431 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACCTTTCCG CCATGAGAAT GACCGTTCTT TCCCGACGGC TATTCAAGCG ACTGCGTCCC GACCGAATAC TCCGTATCTA CCCAAAGCAT GTTCCCCTAC AGTGGTGGCG GTGTTGTTGT TGGTGGGGCT GCTACTTGCT GCTGATGACT CGCTCGGCAG TGACCATCTC GGCGAGTTCG TTGTCGACCG ACGAACGGTA CCTGTACTAC GCCCAAGACG TGGACACGTC GCCCGTCATC GAGTACATGC CCGACGCCGA CGCCGCCGAT CCCAACCATC CCGACTTTCT CTATCGTCCA CAAGCTCACG GACCTCGAAT TGTGGAATTC TACGCACACT GGTGTCCCCA CTGTCGGCGC TTTCGGGATC ACTACGTACA ATTCGCGGGA CAACTCGTCG CCATGGCCAA AGACCAAAAG GTCGAACCCC CGCTGCGCGT CTACGCCATT TCCTGCGTAG CGCACAAAGC CATCTGCCGC GATCAAGGCG TCAAAGGATA CCCCAGTCTC AAAGTGTTCC CCGCCTATTC TCTCAACGCA ACGGCGGAAC CCTCCTACTT TCGACTCCAC CCGTTTAGCG TTCTCGGCAG TATGGGCATC GACTTTGACG TCGACAACCA CGCCCAGTTT GCCGTAGCCG ATACGTCAAT GACGGCCGCC TCGGCGGCCG TGACTAGTCA CACGCATTCG TCCTTTCTGC GTCGGAACTG GTTCGGGAGT CTCACCAGTA CCACGGACGG TACCCTCCGT GAACGCCAAA CCCACGTTGC CGATCTGGAC AGTACCCGTC GGACCAAGCA AAACGTTTTT GACGATGCCT ACCGATCCTT CGATTTCGCC ATCCGTACCG CCGTCTTCAT GACCAACGGT CCTTTGGAAC ACAACGCGAC CAGAGACGCC TTGCACGACT GGTTGTCCCT GCTACAAAAG GCCACGCCAC CAACCTGGTC GACGTTACAG AAACCCGTAC GGGCACTCCT GGGCAACTTT GACGAAATCG TGCGCGGCGA AGACCACTTG CTCGCAGTGT ACGAAAAGGT AGCCTCGCCG CCCGCATCGC ATCAGTGGAG TGACGACTGC TCGCACGGCC AAAAAGGTGC CGGCTACACG TGCGGCCTCT GGCAACTCTT CCACATTGTC ACCGTGGGTG CCACGGAATG GAATCTTATG CTCTTGGAAG AGAATTCACC CAATCTTCTC GACCTGACCG ACACCGCTGA CACGTTCCGG AACTACGTCC AGCATTTCTT CGGTTGCGAA GTTTGTCGGC TCAATTTCGT CTCGGCCTAC GACGCCTGCG CGCACGATCG GTGCCACCGC TTGGACCCTA CCGACCAGTC CCGGACCGCC TGGATCCAAC TACCCCTCTG GTTGTTCGAA ACGCACAATG CCGTCAATGC TAGACTCCTT CGCGAACAGG CCGAACGGGA AGGATGGAAC GTTACCCTCG CCGACCAGCG CGCCCGGGAG TTTCCCTCGC GCCACGCCTG TCCCGTGTGT TGGAAAGCTG ACGGGAGCTG GGACGAAGAT ATGGTGTACC AGTTCCTGCG ACTCGAATAC TGGCCCGAAG ACTCGGTGGC GGTAGACCTT CGGGAGCAGT TGGCCCAGCG CATTCGGGTA CAACAGGAAG GCTGGGATTC CCAGCGTGAC CCGAACGATC CGGACGACGA TCGGAACGTC CCCGTCCCAC CCGTGGCCTT ACAGCTGGTT CCGTTGATGG TTGTGGTAGG ACTAGTGGCC GCCTGGTACA CGAAACGTAA CGAGCGGCTG AGGACGGGTC GGCACAAACG GATCGCCTGA
|
Protein sequence | MRMTVLSRRL FKRLRPDRIL RIYPKHVPLQ WWRCCCWWGC YLLLMTRSAV TISASSLSTD ERYLYYAQDV DTSPVIEYMP DADAADPNHP DFLYRPQAHG PRIVEFYAHW CPHCRRFRDH YVQFAGQLVA MAKDQKVEPP LRVYAISCVA HKAICRDQGV KGYPSLKVFP AYSLNATAEP SYFRLHPFSV LGSMGIDFDV DNHAQFAVAD TSMTAASAAV TSHTHSSFLR RNWFGSLTST TDGTLRERQT HVADLDSTRR TKQNVFDDAY RSFDFAIRTA VFMTNGPLEH NATRDALHDW LSLLQKATPP TWSTLQKPVR ALLGNFDEIV RGEDHLLAVY EKVASPPASH QWSDDCSHGQ KGAGYTCGLW QLFHIVTVGA TEWNLMLLEE NSPNLLDLTD TADTFRNYVQ HFFGCEVCRL NFVSAYDACA HDRCHRLDPT DQSRTAWIQL PLWLFETHNA VNARLLREQA EREGWNVTLA DQRAREFPSR HACPVCWKAD GSWDEDMVYQ FLRLEYWPED SVAVDLREQL AQRIRVQQEG WDSQRDPNDP DDDRNVPVPP VALQLVPLMV VVGLVAAWYT KRNERLRTGR HKRIA
|
| |