Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39045 |
Symbol | |
ID | 7194725 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 204495 |
End bp | 205601 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183177 |
Protein GI | 219125835 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTAAAGC CTCTCACCGA TCAACCCGGA ATCGACTCGG ACGACGCCAA AATCATTTGC AATGAATTTC ACAAGCATCT CGACGTTGAC GAGATGATGG AGACTGCACG GGACAGCCCA TGGTGGGTTT CTCTTTCCGA ATCATCCGAT GTGAATGTTA TCGCGACTCC TTCATCCGAT CCAAGTGATG GCATCGGACT TATTCTGTCT AATTCCTTCA CAACCAAAAA TTTCTCCTAC ATAAAAGAGG GAGCTCTCCA GCATCCACAA GTTAAAGCTA GTTCGGACAA GGTTGCTGCA GACTGGACGA ATCCGACATC TTCGCGAAAG ATAAGTTCTG CTGTTTCGTC TCTGCAGCTA TCTGACGACA TATCAGCTTG CGTGAAACCT CATAACTCGC AAAAAAATGA AGTCGTGAAA AACCCGAATA ACAGTAAGCA GATTCAGTCT CCGTATTTGC TTTGTCCTGT CAAATCTCAT GGCGATGGCC AAGGTGTTTT GCGATTCTCG GGCTTTGATC TTCCACAAAT CAAAGCATCA ACACTCTCCC CTTCCACTTT TCCGCCGACC ATAGCTAGCG TCTTGCAGGC ATTGGAACCA ACTCCGTTCG CTCCGGGCCA GAAAAGTCTT CGCCGCGTCG TTTCCAAAAG CGAATCGACG ACATCTTCAT GCAAGGACTT AAGGCCGACC TACACTACAA GCAGGCAAAC CAATGCGGTG TTTGCTGATG CTTCTGTGGT CCAAGATCTT CCGAGCGCCG TGTGTACTGT CCCATCAACC GTTTCTCCTT GTGGCAAAAA GCTTTCGCCG TTGCGAGCGC TTTCTGCCTA CAACTTCTTT TTCAGAGACG AGCGGGAACG GATTCTCCAT GGAGGTGCCG ATGACTGGAC TGCCGCTCGT CACGCAGCCC TCTTAACCTC ACACTGGTTA AAGGATCGCA CCAAGAAACG ACAGCATCGC AAGAGCCACG GTATGATCGA CTTCACGAGT CTTTCCAAGC TCATCTCGAT GAGATGGAAA AATTTGCACC CGGACGGCAA AGGCTTTTTC CGTCAAGTTG CAGCCGCTGA CTGGAGACGC TATCAGGACG AACTTGCCAA GGAATAA
|
Protein sequence | MVKPLTDQPG IDSDDAKIIC NEFHKHLDVD EMMETARDSP WWVSLSESSD VNVIATPSSD PSDGIGLILS NSFTTKNFSY IKEGALQHPQ VKASSDKVAA DWTNPTSSRK ISSAVSSLQL SDDISACVKP HNSQKNEVVK NPNNSKQIQS PYLLCPVKSH GDGQGVLRFS GFDLPQIKAS TLSPSTFPPT IASVLQALEP TPFAPGQKSL RRVVSKSEST TSSCKDLRPT YTTSRQTNAV FADASVVQDL PSAVCTVPST VSPCGKKLSP LRALSAYNFF FRDERERILH GGADDWTAAR HAALLTSHWL KDRTKKRQHR KSHGMIDFTS LSKLISMRWK NLHPDGKGFF RQVAAADWRR YQDELAKE
|
| |