Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48375 |
Symbol | |
ID | 7203640 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011685 |
Strand | - |
Start bp | 268501 |
End bp | 272498 |
Gene Length | 3998 bp |
Protein Length | 1169 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182932 |
Protein GI | 219125322 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTCTA CCGAAGAATC GTCCATAGCG CTTTCATCAC GGCACGAAGC CACCGCTCTG CAGGATGCGA CCGACAAATC TTCTGGAAGC ATTCACAGTC CGGAAAAGCG ACTACAACTC TTTTCTCGGG AAAGACGAGT TGTCCGTCGA TGTCGCTTTT TGGTGTACGC GGTCCTATTT GCCACGGGCC TGATAGCGTC GTTGGGAACT TTCTTTTACA CTCGTAGTGC AGACCGCATT GAATTCCAGG AAGACGTCGA GGAAATTTCC AGCGTACTCC ATAGAAGTGT CAGGAATAAC CTCCGCACGA CCATAGAAGC TATTGACGCC TTCGCTTCGG ATGTGTCAAC GTTTGCCCAA TTTTCAAGTA ACGTGACTAC TTGGCCTTTT GTCACTATAC CACGGTTTGA AGAAAAAGGT GTGAAGCTCC AAAACGCGGT CAAAACGACC AATTTTTTAT TCCTTCCATT GGTCCCTGAA GCTGATCGTG AAGCCTGGGA AGCGTATGCG GTAAACCATC ACCAAAAATG GTTACAAGAA AGCTTGGACT ACCAGAGTGA GGTGGATGGG GCCACTCCGG TCAAGGCCGG GGCAATTTCA CCTTCAATAT ACAATGAGGC GGGGAGTGAA CTGGGGACAG GACCCTACCT TCCACTATGG CAGGTACGGA CAAATTTGAC AAAAAGGATT GGATACTGGT TCTCTTGTCC TCTCATGTAC TTTCGTTCTT TGATTTAGAC TACGCCGCAA GCACAACCAT GGAACTCATA CAACTTTAAC ATGCTCAATC AAGTGGAATT TGCTGAAGGG TTGCAACAAG TTTTATCTAG TCGAAAAGCG GTCGTGGCTG GAAGTTTTGC GGCCACCGAT GAAGTTCTCG GTCTCTTTTA CGGAAACAAA GAAAACTCGC AAGAAGGATT GCTGAGTCTG TTCTTCCAGC CAATCTATGA AAGCCACTCT GAACCTATGG GTTTGGTTGG AGTACTTGCC ACTCTCATAG ACTGGAATCT TGTTTTTGAA ACCGATGTTA CTTTCAATGC TGATGGTCTA ATTGCCGTGC TCGAAACCAA CTGTGGAGAA TCATCAACCT ATATGGTGAA CGAACATTCC TTGGAATTTG TTGCAGGTGA CCTATACAAT CACAACTACG GCCTCGTTGA TGACGAGTTA GATATCAGCT TGTTCGACTC GCTAGGACAG CTACCGACTC CGGTCGACAG CGAAAGTTGC CAGTACAGGT TTTCCACGTA CGATTCCAAA AAGAAGGAGA ACGGGGTTTC TTCTCAAGAT GCTGCTATGC ACAGCACATA CGTGGCCCTC ATTTTTGTGT TCACGATGCT CTTGTTCATC TTGTATGATT ATGTCGTACA GAAACAGCAA CAGTTGATAC TCCAACAGGC CGAAGAATCC ACGGCGCTGG TTTCGTCTCT CTTTCCTGAA GGTGTTTGCA ATCGATTGAT GCGCACCTCT AGAGTAGCAA CAGAAAACGG AGCAAACAAA ACTGGCTCCG CCATCAATGT ACCTCACGAC CATGACATTT TGGATCTGGA CGACTTTTCA GACTCATGCA TCAAAGGATA CTCAAAAGAA TCTCCCAAGT GGCGCCTCAA GTCCATGATT CGAGAGTCGA CAGCTGACTT TTCGGAGGTA ACAATCAACG AAACAAGACC TATTGCCGAC TTTTTTCCGA ATTGCACTGT AAGTCCCTTT TGAGTCAGGC TTTTTTGCTG TACTCTTCAC ACATTCGCTG AAATCATTTC ACTTTGTTGT TTCCAGGTTC TATTTGCCGA TATTGTTGGT TTCACTGCTT GGAGTTCTCA GCATTCCCCT GGTCAAGTCT TTACTTTGTT ACAAAATGTA TACGCAGCGT TCGACAAGAT CGCGCGGAAG CTTGATGTCT TTAAAGTTGA AACAATCGGA GACAGCTACG TTGCAGTAAC TGGACTACCA GACCACCAGG AAGACCATGC GCTTATTATG ACTGGATTTG CTTTAGAGTG CCGGAGCCGG ATGCGGGAAG TTACTGGAAA ATTGGAGCGG GCACTTGGAC CAGAGACGGG TGATCTTCGG ATGTAAGGTA CAAGGTTGAT CTGCAAAGAT GTTTACTTAT AACAAATATC TTATGTTTCT ATCTCCTTCT TTCCACAGGC GATTTGGAAT GCACTCTGGA CCGGTGACAG CAGGTGTTCT CCGTGGCGAT CGAGCACGCT TCCAGCTTTT TGGTGATACA GTCAACACCG CTTCCCGTAT GGAAAGCTCC GGAATGGCGG ATCGTCTCCA GGTGTCCGAG TCTACCGCCG TTCTACTGCG AAGTGCGGGA AAAGTGCATT GGATCCAAGA GCGAGCTGAG TCTATCCACA TCAAGGGAAA AGGGGATATG AAGACTTTCT GGATCAAACC AATGGAACGT GATTTGAGTT TGCCAAGCTC AGATAGCCTA GAAATCTCAT CTCAAGCAAA GTCGCCGCGC TCTTCAATGG AAAATTTTCG AGACGTGGAT GGCAATCACT CTAAGAGAAA GTTGCTTATG TCCAAAAACT CAATGGTCTG GGGTGATTCA CAAGAGTTTC TGGAGTCCAT GTCTCCGCCA ATGCTGAGAA GACACTCATC AGACACAGTT TGCCGCTTAA TTGATTGGAA CGTTGAAATG CTTTCTGGCC TCTTGAAACA ACTTGTTGCG AAGCGAGAAT ATCTTCAGGA AGCGCAATGA AAATGAAGAT GGTGCCGAAT TGGCACTTTT TACTGCAGCA TTTCCAGGCT CAACCGCATT GGATGAAGTA GTGGAGATAA TCACTTTGCC CAAATTTGAC GCAAATGCAG TTGCCAGTCT TCAAGACATT GACCCAGATT TGGTAACCTT CGAAAGCGAT GCAATTGAGA AGCAATTGCG AGACTATGTT ACTGCAATCG CTTCAATGTA TCGTGACAAC CCTTTTCATA GCTATGAACA TGCCTGTCAT GTACAATTGT CCATGTGCAA ATTGTTACAA AGAGTGGTTG CACCAGATGG GATTGACTGT GATGGAGATA CTTCCGCGTT TGCCTTGAAA GCGCATACGT ACACCTACGG CATCACTTCG GATCCCTTAA CTCAGTTTGC TGTTGTTTTC TGCGCCTTGA TCCATGACGT GGATCACACC GGCGTATCAA ATGGTCAACT TATCAAAGAG AAGGCGCATG TCGCTTCTTT GTATAGAAAT CAGAGCATTG CCGAGCAGAA TTCCGTGGAT CTGGCCTGGG ATCTTCTGAT GGATGAACGA TACATGGATT TACGAAAAGT TTTGTTTAGG ACCCAGTCAG ACTTCACACA ATTCCGTCAG CTTGTGGTGA ACGTTGTTCT AGCCACGGAT AATTTTGACA AGGAGCTTAG TACTTTGCGA AAGAACCGAT GGAACAAGGC CTTTACTGAT GTGCAAGTGG ACGAGTCGGC TAGCATTGCG GCAAACCGCA AAGCCACAAT TGTTATTGAG CATCTTATCC AGGCCTCGGA TGTATCTCAT ACTATGCAGC ATTGGGATGT CTACTGCAAG TGGAATGAGC GACTGTTCTA TGAAATGTAT GCTGCCTTCA AGTCCGGTCG GACGGACACA AATCCAGCAC CAGGATGGTA CAAAGGCGAA CTCTGGTTTT TTGATAATTA TGTCATTCCT TTGGCCAAAA AGCTGAAAGA TTGTGGAGTC TTTGGAGTTT GTAGCGATGA ATGCTTAAAC TACGCTCTCG AAAATAGAAA CATGTGGGAA GCGCATGGAG AAGCAGTTGT GGCCATGATG CTAACTACTG ACTTTTATAC GAAGTTGAAG CAAGAACGAA TGGCGCGGAT GCCTCAACGC AGGGCAAAAC TAGCTGTCGA TTTGTAACTT TACCCGACAA GCGAGTTCAA TCGGTAACAC CGTTAAGCAC AGCGTCACTT GCTTTACAGC ATAGCTTCTT AAATCTGCTA TAAAATGTAG TACTCTTCAC ATTTACAAAC TTCGTTAGCC GACAAGAGCT CTAGTAAT
|
Protein sequence | MASTEESSIA LSSRHEATAL QDATDKSSGS IHSPEKRLQL FSRERRVVRR CRFLVYAVLF ATGLIASLGT FFYTRSADRI EFQEDVEEIS SVLHRSVRNN LRTTIEAIDA FASDVSTFAQ FSSNVTTWPF VTIPRFEEKG VKLQNAVKTT NFLFLPLVPE ADREAWEAYA VNHHQKWLQE SLDYQSEVDG ATPVKAGAIS PSIYNEAGSE LGTGPYLPLW QTTPQAQPWN SYNFNMLNQV EFAEGLQQVL SSRKAVVAGS FAATDEVLGL FYGNKENSQE GLLSLFFQPI YESHSEPMGL VGVLATLIDW NLVFETDVTF NADGLIAVLE TNCGESSTYM VNEHSLEFVA GDLYNHNYGL VDDELDISLF DSLGQLPTPV DSESCQYRFS TYDSKKKENG VSSQDAAMHS TYVALIFVFT MLLFILYDYV VQKQQQLILQ QAEESTALVS SLFPEGVCNR LMRTSRVATE NGANKTGSAI NVPHDHDILD LDDFSDSCIK GYSKESPKWR LKSMIRESTA DFSEVTINET RPIADFFPNC TVLFADIVGF TAWSSQHSPG QVFTLLQNVY AAFDKIARKL DVFKVETIGD SYVAVTGLPD HQEDHALIMT GFALECRSRM REVTGKLERA LGPETGDLRM RFGMHSGPVT AGVLRGDRAR FQLFGDTVNT ASRMESSGMA DRLQVSESTA VLLRSAGKVH WIQERAESIH IKGKGDMKTF WIKPMERDLS LPSSDSLEIS SQAKSPRSSM ENFRDVDGNH SKRKLLMSKN SMVWGDSQEF LESMKRNENE DGAELALFTA AFPGSTALDE VVEIITLPKF DANAVASLQD IDPDLVTFES DAIEKQLRDY VTAIASMYRD NPFHSYEHAC HVQLSMCKLL QRVVAPDGID CDGDTSAFAL KAHTYTYGIT SDPLTQFAVV FCALIHDVDH TGVSNGQLIK EKAHVASLYR NQSIAEQNSV DLAWDLLMDE RYMDLRKVLF RTQSDFTQFR QLVVNVVLAT DNFDKELSTL RKNRWNKAFT DVQVDESASI AANRKATIVI EHLIQASDVS HTMQHWDVYC KWNERLFYEM YAAFKSGRTD TNPAPGWYKG ELWFFDNYVI PLAKKLKDCG VFGVCSDECL NYALENRNMW EAHGEAVVAM MLTTDFYTKL KQERMARMPQ RRAKLAVDL
|
| |