Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31489 |
Symbol | |
ID | 7196051 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 200256 |
End bp | 201466 |
Gene Length | 1211 bp |
Protein Length | 391 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176543 |
Protein GI | 219109577 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTTCGG ATCGAACGAA CGAATTTCTT TCACTCGCCC AGAGTTTGCC GAGTGCGGCC GAATCCTCCA TAGCGCCCCT GCTCGGATCG TCCCACACGC CCTCTACATC GTCCTTCGTC GGTGTACTAC TACTACTACT ACGAATTCTA CCAAAGCTGC ACGATCCGCG CCACCGACTC CCGCGTACGC TGCCTTGCGC GAATTCCACC AAACAGCCGG CGATATCAGT CGTGACATTG CTTCGACGTC GGCCCTTCTC GCCGAACTCA CAACGCTGGT CCGTCACCAG TCCATGCTCC AAGACGACAG TGCTCCCGTC AACAATCTCG TCGTCCGCAT CAAGACCAGT ATTGAGAATT TACACAGTCG TTTGGATCAG GCCTCCAAGG TTTTGCAAAC GCAGAAACGG CAGTTGGGCA AACACAGTCA AGCCGGACAA GAAGCCACCA ATCTCGTCGA TGGACTCCAG GCGGAATTCG CACAGGCCGC TACAGGCTTT AAACGAGTCC TACAGCAGCG GACGGACAAT CTCAAAGAAA CCGACGACCG CCAACGACAA GTCTACGGAA ATGGCGATCA TGATGGTTTC CACGACGATC CCATGCCCGA CATGGGCCTC TTGGCCGCCC CACCGCCCGT CTACGGCGAC GCATCCAATC CTCACGCATC CTTTATGCTA GATTTGACCA GCAATTTGCA ACAACAGACG GGCGGTGAAC CCACGTCCAG TAGTCTCCCC CGTCCGCACG GCATTGCCGC TCCCGGATCG GGCGGTCTCG AGTACGGAGT CCGGCAACGC AAACTCGGTA ACGCGGGCAC CCCGGACGCC GCCAATTTCT ATGGCCACAC CGGACCCTTG ACCCCCCTCG ATATTCAACG CATGGAGGAA GAATCCGGGT TGACCCAGTC ACTCCAACTC ATTCCTGATC AGGATTACAT GCAACAACGT GCCGACGCCA TGTCCACGGT CGAAACCAAC ATTGTGGAGC TGGGCACCAT TTTTAATAAA CTGGCCGTCA TGGTATCCGA ACATCAAGAA ATGGTACAGC GCGTGGAAGA CAACGTCGAA GACGCCAACA CCAACATTAG TTTGTCGCTG GAAACGTTGA CGGACACCTT GACCAATCTG CGCAGCAATC GACAACTCAT GCTACGGCTC TTCTCCGTCC TGGTGGTTTT CATTATTGTT TTTGTAATCG GCTTTGCGTA A
|
Protein sequence | MASDRTNEFL SLAQSLPSAA ESSIAPLLGS SHTPSTSSFV AARSAPPTPA YAALREFHQT AGDISRDIAS TSALLAELTT LVRHQSMLQD DSAPVNNLVV RIKTSIENLH SRLDQASKVL QTQKRQLGKH SQAGQEATNL VDGLQAEFAQ AATGFKRVLQ QRTDNLKETD DRQRQVYGNG DHDGFHDDPM PDMGLLAAPP PVYGDASNPH ASFMLDLTSN LQQQTGGEPT SSSLPRPHGI AAPGSGGLEY GVRQRKLGNA GTPDAANFYG HTGPLTPLDI QRMEEESGLT QSLQLIPDQD YMQQRADAMS TVETNIVELG TIFNKLAVMV SEHQEMVQRV EDNVEDANTN ISLSLETLTD TLTNLRSNRQ LMLRLFSVLV VFIIVFVIGF A
|
| |