Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37466 |
Symbol | |
ID | 7202374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 208934 |
End bp | 210127 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181680 |
Protein GI | 219122703 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCCT TGCACTGCCG CTCCATCGGA ATGTTCAGTC TTATCTTGTG GAGTTTGGCG TTCGGTACAA CGGTAGCATT GTCGTCGCTT TCTGGGCCAA CAAATAGTCC AACCGCGAGC AAGAAACGTG TGCACATCGT GACGGGTGCC AGCGGATACG TAGGCCGAGC CATTGTGCAT CATATTTGCG AAAACGCTTC AATATCGCTT ATTCAATCCG AGGTACATCA TTGTCAAGAC GTTTTGTGTT TGGTACGACC AAATCGAGTG GCGACCGAGC AAGCGTACTG GAACATACTT TTGCAAGATA TCGCATCGCC CGTGTCCGTC CGTGTCCTTC CCTACGATAT GTTGGATGGT GGAGCAAGTC TTAAGGACGC ACTCGCATCT GTGGTGGTGG AACAAGATCA CGCCGAGACG TGTGTCTATC ACGTGGCTTC CGTGTTCGGT CCAACCGAAG ATCACCAACA AACGGCACTA GACAATGTAA AGGGAACGGA AGACTTGGTG CGTACCTTGG TAGATTCTGG CATGACTTGC CGGCTCATCA TGACTTCGTC TATGGCGGCC GTTCGAGGCT CTGGACAAAG GCCACGAAAC GGAAAGTATT ATACCGAACA AGACTGGAAC ACAATTAGCC TGTTGGGTGC CAACTGGGGC GCCAGTTATC AATGGTCCAA AGCGGAATCG GAACGCAAAG CCTGGGAGAT CTGCCGACAC CACAACATTC CAATGGTGGC ACTTTGTCCT TCTTTCGTCT TTGGACCTCC TCGGGATTCG ATTAATAGTA ATTCATATTC TATCACTTTG GTTGGTCAAT GGGCGAGAGG GGAATCTCAA GTGCAAAGCC GTCTTTTTGT TGATGTACGA GACGTCGCTG CAGCACATGT GGCCGCCGCC ATCGAGCTGG AGGCTGCTGG CCAACGGTAC ATCGTTTCTT TGGAAACGCG AGCTCCTAGT CAAGACATTG CGACGTGGTT GCGAGAGGTA TGCCAAACTA CCGGACTGTC TGATCCGGAA AAGGTTCATT TTGACGGAGA ATTTGACGGT GGCGCAATCC CTATCGGAAG CAAAGAAGTG GACGCAATCG ACCGGCTACG AAGGGAACTG AGAGTTACAT TGCGTCCTAT CAAAGACACG ATAAGGGACA TGGCTGGAAA CTTACTCAAA GAAACCGCGC AAAATGACTG CTAA
|
Protein sequence | MNPLHCRSIG MFSLILWSLA FGTTVALSSL SGPTNSPTAS KKRVHIVTGA SGYVGRAIVH HICENASISL IQSEVHHCQD VLCLVRPNRV ATEQAYWNIL LQDIASPVSV RVLPYDMLDG GASLKDALAS VVVEQDHAET CVYHVASVFG PTEDHQQTAL DNVKGTEDLV RTLVDSGMTC RLIMTSSMAA VRGSGQRPRN GKYYTEQDWN TISLLGANWG ASYQWSKAES ERKAWEICRH HNIPMVALCP SFVFGPPRDS INSNSYSITL VGQWARGESQ VQSRLFVDVR DVAAAHVAAA IELEAAGQRY IVSLETRAPS QDIATWLREV CQTTGLSDPE KVHFDGEFDG GAIPIGSKEV DAIDRLRREL RVTLRPIKDT IRDMAGNLLK ETAQNDC
|
| |