Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38005 |
Symbol | |
ID | 7202717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 637916 |
End bp | 640933 |
Gene Length | 3018 bp |
Protein Length | 1005 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182106 |
Protein GI | 219123591 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGCGTG AAGTGAGACT ACTCTTGCGG CCAATGCGAG GGCTTTGTTT CCTGGTGTCG GTGGTATACG TGGTCGCCGT CTCCGGCCCC TCGTCGTCGA CGACGACACC GGGGTATGTT CCTCCTGATC GCCTTTGGTT GGACGTGGAC AACACGTTGT ACAGTGAATC CATCCTATCG GGCATCGGAA GAGGCATCGA AGCGCAAATC GTACGCGGCG TTCACGAATT TTATGAGCGT TTCGAATCGG ACGATACCGA CAAGGTCGCT GCTTCAGCAT CACCACAAGA ACGAGCCGAC GCCCTTCATC AGCAGTACGG GTCCACAATT GAAGGCTTGC GACAAACACG GTGGAAGAAT CTTTTGCCAA ACGAGTTATC GGCAAAGATG CGGAACTTTT ACGAACGTGT CTACCAAGAC GTGGACGTAA CCGCACTACT CGACACCGAC CGTTCCCGTC ACTCTCACGG AGGAGCATCT TCCACGGGTT ACTCACACAA CGCTGTGGCC CAGCAACAAA CCTTGTTACG CGATGCGTTG CGATACTCCC CCGTACCCGT TGGCTTGGCC TCTAATTCGC CCAAGCGCCA TATTTCTAAA GTCATCCAAG CTCTTGGACT GACCCGTATT CCATGGCACG CCGTCTGCAC ACCCGACTGC GCCGACGCCC CGTCTCTGGG AGCGAATATT CCTTTCCGTA GTGACACTGG CTGCAAAGAG GATTTTCCCA CAAAGTTGTC ACCAAACTTT TTCCCGAACG TGCGGGGTAT CTGTGAAGTA TTGCTGGACG ATTCACCAAC CATCCTGCGG TCTGTGGAGG CTTTGCAGAA GAATTTACAG GGGATTCTCG TGTCGGAAAA ATCTTCGTTG CTGCAGGGAC TAGGGCAGGC AATTGGTTGG ACAGATCCGG CCTTCGAATT TTCCCAAATG GAGTATTTGC GGGCCAAAAA CGTCGTCGAT ATGGAATCGA TTCACGAAGG AACCTGGCAA CGACTTGGAA TGGAGCTGCG AGCACAACGC AAGGAGGAAT TCAACAACGC GAAAGGTACT CTAAGTACAG GCTCTCCGCT ATGTGTAGTT GACGTAGGGG CCGGGCTTCT CTCCATGTTG CGACTGATAC TGCACGGACA CGGAGCCCGG TTACCTTCGT TGGTACACCT TTTACGAGAA AATCAACATC CTGGCGTCTC CTCTCTGGAA TACTACGCCT ACGAACCAAA CCGTGAGTTA GGATATGCAG CTACCGTAGA GTTGGAGCGT CTGGGTTTTG GTTTGCAACA AACGTTGCAA TGGGAAGATT CGAGTAGTCC GACGCCACAG TCTTGTCAAG AATTTATCAT GGTCAAACCC GCGAACACGA CCGACGATCA GCCAAAGGTC ACTGTTTATC TTCGTTTTTG GGACTATCAA CGCGAAATGC ACCGACCGCA ACCGACGCCA CACGTCATAG TTGGTTGCTG CTTTGCCGAT CTCATGGATC CGTACGAATT GTCCAGATCC CTCCTTCGAC GATTTCTAGC ACCGCCATCT TTGAGTCATT TTGACCATAC CCTAGTCTAC TTCCCCATAA CCTTTTGCGG TGTCACTCAG TTCTTGCCAC CCCAACCAAT GGAATGGATT GCAAACATAC CGTCGGATAC AACTGCATTC GCACTCTACG CCAAAGCGCT CCGGGAGATT CACGGACACA GCTTGGATCC GTATAGTTTG GAACAGGCAT TGGGGGACTA CGGCGCGACA TGTTTGGCAA GAGGTCAATC TGACTGGCAA ATCGATCCTT CTCGTGACGA ATACTTGTGG GAAACTATGC TGTATTTTTT CGGAACCGTC ACCTCGAGTG TACTCGAGAA GGCGGCATGG AATGCTTTGG GTTGGTTGGA ACGGACACGG GGCCTCAGAC CTTCGATTCA AGTTTCCAAC ACGGACTTGC TCTTTCGCTT TCCGCATGTG GGAAGTTGGC AGGTAAAATC TGAGCAGTCG AGTGATACAT CGCGAAATCA GACACACACG TTCCAGGAAA TCCAATTTAC GGCTCCCTTC AAAGTGAAAG CAATATCAAG AAAGCTCGTC GCCCTTGGAC CCAATCAAGT CCGCATTCGA GCCATACACT CACTAATTAG TTCTGGGACG GAATTGAAGA TATTCAAGGG GCTATTTGAA GATGCTGCTC TGGACCTTAA CATAGAGGGA ATGACAGAGG AGCGCATGTC TTATCCCCTT TCATATGGCT ATTGCTCCGT CGGTCGTGTT GTGGAGTGCG GCATGGATAT TTCCAATCCA GGGGACATTT TGGGCAAGCT GGTATTTACT TTTTCGTCTC ACGCCTCGGA GGTAGTAACG GATAGAGATG CCATACAGAT AGTACCTGAC GGCATCGGCG CTCTTGACGC AATATTTATG CCGTCGGTAG AAACAGCCTT GTCGATTGTC CACGACGCTC ATATTCGTAT GGGAGAAAAC GTGGCTGTTT TCGGCCAAGG TCTTATTGGT CTCTTGGTGA CGGCGCTGTT TTCTAAGCAA GGCTTTGATA CTTCGGGACG ATTGCGAGCG TTAACGGTCT TTGACATGCT TCCCGATCGT CTTGCGATGT CAGCACTGAT GGGAGCGACC CAGGCGCTTT TGCCATCTGA AGTGAAGACG GCGGGCCCTT TTGACGTGGC AATTGAAGTC AGCGGTAACG GCCGCGCTCT CCAAGCAGCG ATCGACAACG TGAAAGAAGG GGGGCGTATT GTCATCGCGT CATGGTACGG AAGCACTGCT GTAGATCTTA ACCTTGGTAT TGAGTTCCAC CGCAGCCACA AGATTTTAAA AACTTCACAA GTGAGCAAAA TCCCTGCCGA GCTTGGATCG ACATGGACCA AGGAGAGAAG ATTTGCGCTA GCGTGGGAGC TAGTGCGAGA ATATCGTCCG TCGCGACTCG TAACAAAAAG GACGAAGCTA GAAGACGCTC AAGAAGCTTA TGACGCCCTA GAGAACGGCT CCGAAATTGC GATTGCTTTT GATTATGATT TAGCATGA
|
Protein sequence | MRREVRLLLR PMRGLCFLVS VVYVVAVSGP SSSTTTPGYV PPDRLWLDVD NTLYSESILS GIGRGIEAQI VRGVHEFYER FESDDTDKVA ASASPQERAD ALHQQYGSTI EGLRQTRWKN LLPNELSAKM RNFYERVYQD VDVTALLDTD RSRHSHGGAS STGYSHNAVA QQQTLLRDAL RYSPVPVGLA SNSPKRHISK VIQALGLTRI PWHAVCTPDC ADAPSLGANI PFRSDTGCKE DFPTKLSPNF FPNVRGICEV LLDDSPTILR SVEALQKNLQ GILVSEKSSL LQGLGQAIGW TDPAFEFSQM EYLRAKNVVD MESIHEGTWQ RLGMELRAQR KEEFNNAKGT LSTGSPLCVV DVGAGLLSML RLILHGHGAR LPSLVHLLRE NQHPGVSSLE YYAYEPNREL GYAATVELER LGFGLQQTLQ WEDSSSPTPQ SCQEFIMVKP ANTTDDQPKV TVYLRFWDYQ REMHRPQPTP HVIVGCCFAD LMDPYELSRS LLRRFLAPPS LSHFDHTLVY FPITFCGVTQ FLPPQPMEWI ANIPSDTTAF ALYAKALREI HGHSLDPYSL EQALGDYGAT CLARGQSDWQ IDPSRDEYLW ETMLYFFGTV TSSVLEKAAW NALGWLERTR GLRPSIQVSN TDLLFRFPHV GSWQVKSEQS SDTSRNQTHT FQEIQFTAPF KVKAISRKLV ALGPNQVRIR AIHSLISSGT ELKIFKGLFE DAALDLNIEG MTEERMSYPL SYGYCSVGRV VECGMDISNP GDILGKLVFT FSSHASEVVT DRDAIQIVPD GIGALDAIFM PSVETALSIV HDAHIRMGEN VAVFGQGLIG LLVTALFSKQ GFDTSGRLRA LTVFDMLPDR LAMSALMGAT QALLPSEVKT AGPFDVAIEV SGNGRALQAA IDNVKEGGRI VIASWYGSTA VDLNLGIEFH RSHKILKTSQ VSKIPAELGS TWTKERRFAL AWELVREYRP SRLVTKRTKL EDAQEAYDAL ENGSEIAIAF DYDLA
|
| |