Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47544 |
Symbol | |
ID | 7202781 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 70783 |
End bp | 73859 |
Gene Length | 3077 bp |
Protein Length | 935 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181993 |
Protein GI | 219123358 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCACCTTTCC CTTTCCCATC GCGTCCCCCA TTGTCATTCG ATATAGACTC GTGGCCGAAA CGAGCGAGAA TCATCCTAGT CATAAGAGAG AGCGCGAGCA ATTGCGGTAA GAAGCTATTG TAGAGGGGGA AGTGAACCCA ACCTTTGTTT CGATTCACAC GTCCAAGCCG AGCAAAAAGC TAGACCTAGG CTTACGGCAA TTATCTTTTC CCGACTTTTT TTCAAAAGGA CCACTATCAT GCGTATGCAA GTGCTTTTGC TCGTGAGCGC CATTAGTGGA ACAGCGGCTA GGCTCGGTGG AATCGCACCG ACGTCTATAC AGGAGAGTGC GACCGCTTCG GTTTCGACGG CCAGAACATC CGGCATTCGT TCGCACTATC CCATCAAGCA ACGTCGTGAG CTCGAAACGG AGCAGCTTTC GTGGGCCGAC TTCGAGGAGT GGCGCATCAG CTACTCAGAA GATAGTCGAC CCAAAGTCCT GCCTGCTTTT TACGTCGATT TGTTCGAGAC TAGTGGTGAC TTGTCGGCGG ATTCTCTTGC CCTCTACGAG ACCGCTATGG AGCAATTTCT GACCGCCGAG CTGGGTGCCG TCTACAATGA ACGCCCTCGC GTTGCAAGTG TTCGTGCCAA AGTGCTTTCC CAACGCGTCC TTTCACAGTC CGTCACGCGC CAACGCCGTC TTGACGACAC TCTTGGTGAT GGTGGTACCA CCGGAACAGA CGGTGTCATG ATAGGGACGC AGTTCGAAAC GCAAGTTAAC GTGACTTTTG TCAATGATCC ATCACCCAGT GTGTACGCCA TGCGTACCCT CATAAGTAGT ATCATGGATC GCGACATGTC AGGCTTTGTT GGAAATCTAT CGGCTGCCGC GGGCGACGAT CCCGCTCTTA CCCCTACTGA GGCTAGGTAC CGTGGCGCAG AGGGGGATGG TGTCAGCATC ACAACCGACG ATCTCGTTGG AGGATCTGAC CGGGATGACG ATATCACCGA TGACGGTGTT GCTGGGATCA TTGATGGTGG TAACAGGCCG GTCGAGGAGG ATGATCCGAA TCTCACTTTT ATTGTTCCGA TTCTCGCTGC CGCTTGCATC CTCATTGCGC TCTTGGCGCT ACTGGTGACC CGTCGTAGAA AGCAAAACAG CAGCACGCAT CTAGATGCCG TTGATGATGA CATGGATATT TACACCGTGG ATGGGAGGGA TGTTTTCAGC GAGGAAGACG TTGTCAGTCC CAAAGAACAC AATCGTGAAA CCGAAACAAA GGTTGCACCT TCGTCTCCCG TGCTAGATTC TACACAAGGC GACAAGGGGG CTCTGGAAGA CGGCGACGAC GTCTTTGCTG ATTTGCCAGA CTCTCCCCAT CGCCAAACCG GGTTCGGATC CGTCTTTTCT TTCTGGTCAA ATTTTTCAAG GGCCTCTACT ATATTGGCTT CGAATCGTAA CAAGCCCGCC AAAGATTCGG AAGGGTCCTC TAAGGAGGCT GCCGCGATTG GTACTACGGC AGGTGTTGGA GCCATTGTCG TTGCAAGATC CAGCCGTCGG CAAAGATCGC CGGAGAACAC TCCAGGCTCG CACATGTCTT CGCTCTATAC ATCTGATGAG GAGGACGGAG TTCCAGACAC GTCTGGGAGC GATATAAATT CGACGTTTGA AACGAATGCC ACTACCGACT CCGTTCTTAT GTCCCCCCAA TTTGAGAAAA AGCGTTCTTT CGAAGAGCCC TCCTTCCACA AATCCCAGAT TCAGGCACCA ACAGCTATAG AAACGACCAA GGAAGTTGAT TCCACTCAGT TTGTATCCCA AGTTCTAAAG CAGACGCAAG ATGTCTCTGT CGCGCCTCAA GACGAGGACT CGGCGAAGGC GTCTTCGCTT GCTTTGGGCC TTGTCGGCGC GACGGCTCTT TCCGAGTCCC CAGATGTTGA GGTCCTTTCA CCTGAAGCTG CTGCAAACGA TCCAGCCGAC GAGAAAAGCA ATCAGGTCCC ACTCGGAGGG GCGGCGGTCG TCCTCTCTGC TGCCGATCAC GAGTCCGTCA AGTCAAGACC TCAAGAGGAT CAACCAGGAA GTCCTAGAGC AAACAAACGG AACTCAGTGA AATTGGAGTG GGACAGTCCA GAAAGAAGTA GCAAAAACAT GGCGATAGCT AAATCCAATG TCGAAAAATT CTCTCCGAAA GCAGCTAGAA GCTCTGGCGG TCCTTCACTC ACCTCGACTC CCCGACGAAG TGGGCGGCGG CATGCTAAAT CCACAACGGG AGATGGCACT ATGGATTATC AGGCACAGAC GATGAATGCT GGAGAAGCCC AGACTTCTTT TGACGTTTCA CTTAGCGATA CAGAAACACA TGGTCCTTCC GAAAGCCACG GAATCACTCG CAGCCTGTTT TTCAATACCT CCAAGATTAA GAGTCCGAAA TCACCCAGCG AGGCACATAA CGTTGCGAGC CCATCTTCAC CAGCTTCTAT GAGCTCCTTG GGATCCCGTA ACAGCGTCTC CGTGAAATCC GGGGCATCGG AGCAGTCCGC TAGTCGCAAA GTGATAGCGG ATTTGGTCTG GCTGGAAAAG AAGATTGCGG ATGCTAGTCG TAGAATTGCT ACATCACCAC AGTCAACGAA CGCCCCTGGC AAACCTAAAA CGTCTGCCAT AGACCAGTCT AGCGATTCAC TTTCCTTTGC ATCTAAGGAA GGAGTAGTTT CTGCCTCCAC TTCATGCGAT TCAACCTTTG AAGTTGGTTC TCCCCAGAGC GGTTCCGGGA TGCCTCAAGC CGAGGAGCTC ATTGTATGCC GCGACTGCTT TGCACCTCCG GGTAAACTTA AGATAATCAT CCATTCCACC AAAGATGGAC CGGCAGTTCA CACGGTTAAA GACGGCAGCA GTCTCACGGG TCACGTGTTC GCCGGAGACC TGATCATCAG CGTCGACGAT ATCGACACGA GGTCATTTAC GGCGGAGCAA GTTATGAAGA TGATGACAAG CAGGACCAAA TTTGAACGAA AGATTACCGT ATTGCATTTC GAAGCTGCCG TGGCGAAGCA GGAAGTAACG CTGTGAAGAA ATTTATTAGG GACAGGTTAT ATTTCGT
|
Protein sequence | MRMQVLLLVS AISGTAARLG GIAPTSIQES ATASVSTART SGIRSHYPIK QRRELETEQL SWADFEEWRI SYSEDSRPKV LPAFYVDLFE TSGDLSADSL ALYETAMEQF LTAELGAVYN ERPRVASVRA KVLSQRVLSQ SVTRQRRLDD TLGDGGTTGT DGVMIGTQFE TQVNVTFVND PSPSVYAMRT LISSIMDRDM SGFVGNLSAA AGDDPALTPT EARYRGAEGD GVSITTDDLV GGSDRDDDIT DDGVAGIIDG GNRPVEEDDP NLTFIVPILA AACILIALLA LLVTRRRKQN SSTHLDAVDD DMDIYTVDGR DVFSEEDVVS PKEHNRETET KVAPSSPVLD STQGDKGALE DGDDVFADLP DSPHRQTGFG SVFSFWSNFS RASTILASNR NKPAKDSEGS SKEAAAIGTT AGVGAIVVAR SSRRQRSPEN TPGSHMSSLY TSDEEDGVPD TSGSDINSTF ETNATTDSVL MSPQFEKKRS FEEPSFHKSQ IQAPTAIETT KEVDSTQFVS QVLKQTQDVS VAPQDEDSAK ASSLALGLVG ATALSESPDV EVLSPEAAAN DPADEKSNQV PLGGAAVVLS AADHESVKSR PQEDQPGSPR ANKRNSVKLE WDSPERSSKN MAIAKSNVEK FSPKAARSSG GPSLTSTPRR SGRRHAKSTT GDGTMDYQAQ TMNAGEAQTS FDVSLSDTET HGPSESHGIT RSLFFNTSKI KSPKSPSEAH NVASPSSPAS MSSLGSRNSV SVKSGASEQS ASRKVIADLV WLEKKIADAS RRIATSPQST NAPGKPKTSA IDQSSDSLSF ASKEGVVSAS TSCDSTFEVG SPQSGSGMPQ AEELIVCRDC FAPPGKLKII IHSTKDGPAV HTVKDGSSLT GHVFAGDLII SVDDIDTRSF TAEQVMKMMT SRTKFERKIT VLHFEAAVAK QEVTL
|
| |