Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46243 |
Symbol | |
ID | 7201200 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 704165 |
End bp | 706453 |
Gene Length | 2289 bp |
Protein Length | 640 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180490 |
Protein GI | 219119460 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCTATCGTTC GCATTGATGT GAGGCAATCA TAAAAACTTT AGCACTTCTA CATTGGACGA CGTTTTCTTG AAAAGTCAGT GGAGCCCCCG ATATGAGTGT GACTCGCATG ATGATCCGTC ATGGCTTTCG TCTCGAAGTA ATGCCGCCCA TTATAGCCGA CGTACCCAGT TTCCAAGACA CTCTTCTTCG AAACCTTTCC TGGGAATGCC CCGGGCCTTC ACGCGCATTC CATCCGGTCT GCATCGTGGA AAAGATCTTG TACCGTAACC CTTCCGATTT TGCTTGGCAT CACATCGGGC TTCTTTTGGC GGTGTCGATT GTACCTCTCT TGCTCGTCAT CTGTTTTGCG TTCGAGTTGA AGATAGTTGA TTTCGGTCTT TTCGGTGAAC CCTATAATCA GGCCAAGACT GCTGATGTTG CGGTTCCCCG GGATATTGCT CACAAACCGA GCGAACGAGA AATCGCAGAC ACTGATTCAG AGCACAGCGA TGAAGAACCT CTAGGATGGT CTCCTTCAAG CCCACTTGCG ATTGGCGACG AAAGCGTGGT GTGTTGGACT CCAAGAACTG CAGAAGAGGA TTCATTCGTT TTGACTCCTC AGGTTGCTGA AGAGGAAATT CTTGCAATTG AAGATGGCGA GCACCAAGAT GAAGAGGGTG TAGTTGAAGA TCAAGATACT TCAGCTACAA AAGAAGGACC AGAAGAGCTC GTTGCAAATG ATGAGGTTTC TAACGAAGAC ACAGAAGCCC GTGTTTTGTT CTCGGACGAA GATGAATATG TCGAGCCGAA GTCGTTGGTT CAAGATGTCG CTGTCGAGAC AATGTGGATA AAGAGAAATC AAGCAGCGCA AGGTTGCTTC GCCGACTGGA CTGCCTCAGC CGTGGCTTTG TCAACCGACG GCGATGTAGC GGCCATGGCA ATTCCTGGAG TTGGACAGTG CAGCGTAGTG CGTGTGGTCC AATATGATGC GGAAGGTGGA CATTGGAACA GTTATGGTCA AGAAATTAAA GTATTGGTAG GCCACTCCGA AAAGAGCATA GCGCTCTCTG GGGACGGAAA GATCCTTGCG GTTGCTTCTT CCAGCACCCC TGCTACCGAC GGTTCTGCCC TCGTCAAGGT TTTCCAGTAC AACGAAGTGT CTGCTGCATG GGAAATTGTT GTTGTGATTC CGGGATTTTC AAAGGATACG CTTGGTGGAC GAATCGCTTT ATCAAAAGAT GGTTCGATGC TTGCTGCCAC TTCTTTGGCT CCTCCGAATG AGCCTTCGTT TCTAAGTAAG ACAAAAACGT ACCGAATTCT ACCACAGAAG GAATGCTTTA TTCAGATGGG TCAGGATATT TATGGAAGCT ACGGTAAGAA TGATTGCTTG GCGTTACCAT AATCCTGTTT GCGTCTGCTG ATCTTTTTCA TGATTCTATA CAGGCATTGG TGACGATCTT GCGCTCTCTG ACAACGGGAC CACGCTGGTA TTAGGAGCCT CCAAGCACAA CTCATGGCGA GGACAAGTAG CTGTATGGAG TTACGATAAT TCCGAAAACT ACTGGCGCCA ACTCGGTCAA ACTCACGAAC TGGATGGAGA GAATCCAGGA GACCGAGCGG GCTTTGTCTC CATCGCCGGG GATGGTTCGA AGATTGCCAT TGGATCTCCC GCGAACGATG AAAAGTAAGT AGTCTGTTAC ACAAGTACAT AACTTCCTTC TTGGAAAGCC GACTAACGAC CAACTTGGTA TTCTGCGCAG TGGTCTTTTA ACGGGCAAAG TCCGAGTATT ACAATACTCC AGCACGGAAT GTTGCTGGAA ACCTATGGGC CAGATTTTGG GAGATAAAGA AGGTGAAGAG CTTGGCAGTG CTGTCTCGTT GTCGAACGAT GGTTTGTCAT TGGCGGTGTC TAGTAACGAG GTCCAAAGTG ACGAACACAC TTCAGGCGTG GCTCGCATGT ACCAGTACGA CCTGCCTACT TCCTCCTGGA AGCAGGTTGG TAAAGATATC AGTGGTGTGA TGGGCCAGGA AACCGGAAGT AGAATTGGGA AGTCTTTGTA TGGCTCCGTA ATTTCTCTAG CTGCGAACGG TAAGGCCTTG GCGGTGTGCC CACGGCTCAA TACGACAGAG GATCCGCTAG GACATGTCCA GATCTTCCAC CTGCTGGAGT CGGAAACATA GGTTGTTGCA TTGCTTTGAA CGTACACACA CCAATGAAGA CACAAAGAAA GCTTTAACCA TCTCCCATGA CCACAGCTTC AATATCTTGC ATCCATAATG TTTATATAAA AATATTTGCT TCATTATTT
|
Protein sequence | MSVTRMMIRH GFRLEVMPPI IADVPSFQDT LLRNLSWECP GPSRAFHPVC IVEKILYRNP SDFAWHHIGL LLAVSIVPLL LVICFAFELK IVDFGLFGEP YNQAKTADVA VPRDIAHKPS EREIADTDSE HSDEEPLGWS PSSPLAIGDE SVVCWTPRTA EEDSFVLTPQ VAEEEILAIE DGEHQDEEGV VEDQDTSATK EGPEELVAND EVSNEDTEAR VLFSDEDEYV EPKSLVQDVA VETMWIKRNQ AAQGCFADWT ASAVALSTDG DVAAMAIPGV GQCSVVRVVQ YDAEGGHWNS YGQEIKVLVG HSEKSIALSG DGKILAVASS STPATDGSAL VKVFQYNEVS AAWEIVVVIP GFSKDTLGGR IALSKDGSML AATSLAPPNE PSFLSKTKTY RILPQKECFI QMGQDIYGSY GIGDDLALSD NGTTLVLGAS KHNSWRGQVA VWSYDNSENY WRQLGQTHEL DGENPGDRAG FVSIAGDGSK IAIGSPANDE NGLLTGKVRV LQYSSTECCW KPMGQILGDK EGEELGSAVS LSNDGLSLAV SSNEVQSDEH TSGVARMYQY DLPTSSWKQV GKDISGVMGQ ETGSRIGKSL YGSVISLAAN GKALAVCPRL NTTEDPLGHV QIFHLLESET
|
| |