Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48096 |
Symbol | |
ID | 7203266 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 201554 |
End bp | 203275 |
Gene Length | 1722 bp |
Protein Length | 347 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182487 |
Protein GI | 219124390 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGGTGA CCATACTTTT GGCCATTCTC AACAACCGTC GTGCTTTAGC CGCCGATGAA AATAGCCGCT TCTCCTATCC CGGTTACGTT ATTACCAGCA TTGACGTCGA ACCACTTGGA AAGGGCCATC CCACCTCTCG CGTCAAGTTT CACGCCAGAA TTCCTGGGAA AGTGGACGAC ACGACCGCAC ACGTCTTACG CGACGTTGTG GATTCCGGTG AAACCGTTGG AAAATGGCCG TTTCGGAGTC CTGTGTTTCA CGGCACGGTG GTGCCTCCCG GCCAGCTACA GGCCAGGTTG GATGCCGAAT CTCCCCGCGT GTCGGATCGT TGGAATCCAC TGCAGCATCG CAAAGCATTG CTCTGCGTCC ACGGCTTTGA CTCGCAACCG GAATCCTGGT TGCGTAGATG TGCGGAAAAC TACGGAAAAA GCGACGAGTT GGCTATTATT CCAGTTATTT GGCCAGGTAC GTTCTTCAGG TGCGGTACAA ATCACGTTTT CCTTGCGAAC GCACCTCCTG ACTCTTTCAG ACTTGCAATT CATTGATTCT CAGCCGGCGA TAAAGGGCTT CGTGACTACA TGAATGATCG AAATGTCTTC GTTCCCGGGG CTGCCCGAGC CTTTCAGCCT TTGTTGGACG TGGCCAGTTC CTTGCGCAAA TCATTGCTTT GTCACAGTAT GGGTAATTTC GTGCTCAAAC TCACGGCACC TCCGCACCAG CCATCCCGTA CCAGCGGCTT AGCGAAGCGC AAGCTTTCCG CACCACCTTT TGAGGATGTG TACATGTCGG CCGCCGATGT GCGGCATGAT GTATTTGATC GCGCCCAAAA CGATCACGAC GATGCTAATT TGGACTACGG TCGTAATATT GCTGGCATGG CGCAAAATAA GGTCCATGTG ATGCACTCCA GAAGCGATGT CCCTTTGATG GCCCGCCGAG CGCGGCATGC AGGCTTACGA GCGTTGGGCA GCAATGGCGC CAATATGAAA AACCTGCACC CGAGTTTGAA GGGTAAGGTT GTCAATGTGG ATTGCTACTC ATGGAACAAG TGGACTCGAA AGAACAATCT TTATCACTCT TATTTCTTCA AAAGCGAGGC TATCAAATAC TACGAACAGC GAGACGCCTA GAAATGAAAC TGGCGTAGAC AATTTGCTAA TTAGCTATAA GAACCGCAAT TTCTTGCACT TCCACACAAA CTCTCACTTT GGTCTGTCTC AAGTTGTATC TCTGATTTAG AGAAAGCCCA ACCAATCAGC AGCTCTCTAC TGTTAATTCG TTGTTGATAC TTACACTAAC AAGTAATTTC TACGCATGAA ACGTATAATG TGTTTTTATA ACAATACCCA AACAGCGCTT GATAGGAACG ACCATACCAC GATGACAGCA CCGCTTCCCG TCTGCTCAGC GAAGTTAAGC ATCGTCGGGC TCGGTTAGTA CTACGGTGGG GGACCACGTT GGAATCCCGG GTGTTGTTCT TTTTGGGGCT TCTGACCACG TAGAGTAACT TTTTTTTTAA AAAACAATTG TGTTTTATGC GATTAACTGA CCGTAAAACC TGGTTTTGGT TGTATTCATT TGGTTGAAAT TTTCAAACAA CACATTAGCT TTGGGTTTTG AAGTTCCCCG CGACTCGGCA CAGCGTACGA TACGCCCAGT AAATTCTTTC CCGTCACGAT CCCCCACAGC ACGGCAACTT GTGTGTTGAC GGCGTAGGTG TT
|
Protein sequence | MWVTILLAIL NNRRALAADE NSRFSYPGYV ITSIDVEPLG KGHPTSRVKF HARIPGKVDD TTAHVLRDVV DSGETVGKWP FRSPVFHGTV VPPGQLQARL DAESPRVSDR WNPLQHRKAL LCVHGFDSQP ESWLRRCAEN YGKSDELAII PVIWPAGDKG LRDYMNDRNV FVPGAARAFQ PLLDVASSLR KSLLCHSMGN FVLKLTAPPH QPSRTSGLAK RKLSAPPFED VYMSAADVRH DVFDRAQNDH DDANLDYGRN IAGMAQNKVH VMHSRSDVPL MARRARHAGL RALGSNGANM KNLHPSLKGK VVNVDCYSWN KWTRKNNLYH SYFFKSEAIK YYEQRDA
|
| |