Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44527 |
Symbol | |
ID | 7198063 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 809000 |
End bp | 813641 |
Gene Length | 4642 bp |
Protein Length | 1533 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178309 |
Protein GI | 219115027 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGACAA ACGAACCGTT GCGTCCCGGA AGTGGGAGCA GTAGTAATGG GGGAAGGAGC CGTTGTGGTA GTGACAGTAA TAGCCACAGT ACGAACCAGA ATACCGAAGG CAGCGTCACC CAGGCTCCAG TCCTTGCACC GGAAGAAGGA CCTCCCGTGT TGCTGCGCGA ATGGAATCCG GCACCGCCGC CGCCTTCCAC TCCGGTGGCT CCCGCCGTGA TCTGGCCCGA TCGATCCCCG GAAGCGGAAG ATCGTCGACC TGCGTACTCG TATCCCGTAG CCACGTTTGT ACGCGATCTG GTCCGTACGG GGATGGACCA AATCGAGTCA GCCTGGCAGC AACAAGAGTC GTCGTCGACT GCACAGCGAG ATGGAACTGG TACACGCCCT CACGGAACCC ACAATCGAGT AGAAGACGAC GACAGCGATG CCTATCGCCC ACTCTCCGTG CGTAGTGAGT CGGACGCTCT GGCCATGCGC TCCGAGGCCG CGGCGGCATT TATGGAATCC ACGGCGACGG TGTCGGCCCG AACCATCAAT GATCTGGACG ACGAACACGA GGATGCCGAA TTCATACTCG TTGACGGCGA ACTGGTGCTC ATGCCGCCCC CGCCGCCCAA CGCGGCCGAG CAGGACGGTC TCGTCCGCCA TTTGCAGAAT CAGGAATTTG AACGACACTT TTCGTTGCGC GCCAGGTATC ACCCCGACCA TCCCGCCGTC AAGTCGTGTC AACCGAATCG CGTCCGCAGA GGTGTCCGGA AGCTTGGAAA GATTTTTAAG CGTAAGAACG GTCTCCGTCG AGCGTCAAGT AAGGAAAGCG CGCAATCGCA TGTGCACCAC GTCGATGACT CCTCGGCGCA CACCTCGCAC GACGGATCTT CGTTACCGTC TCTTTCCGCC GACTTCTTAA CCGCGCCGTC GGGAGCTGAT TCTATAACCA CTCTGAATCA CACCACAAAC AACGGTGGGA TCAATTCCAA CAATAATCCC AAATCTCCCC CCAGGAAGAA AGACAAACGC CGTGGAAAAC GACGTTCTTG GCGGCGGACT GACAGCAGTG GTGGAGAGAA CGACGACTTT GATGACGAGA CGTCGCCAGA CATATTCGAA AGCCGTTCGG GTGAATTCTC GTCGCGTACG GGTCCGCGCC CTGCCACTTT TACCACAATT TCTGCGCTCG CCAATCGGTC GGCCGCGTCG ATTCGACCCC CTATGCCTCC CATGACACCG ATTCAGGAAA CCTTGGACGC GAGCAACATG TTTTCAGGAG GAAGAGCTGC GTCCGTCGAA GCCAGATTTG TTGGGACAAC GGCGGACGTT CTCGAGGCTG GTGTCCTGCC GACCGGCTTT GCTGCGGAAG CCTACGTAGA ATACGACAGC GACGAGGCCT TGAAAAAGGA CGATTCTCTC GTAGCCAGCG TCGTACCTTT TGATGCCGAC TACAAGGTGG ACAAGGCCGA AGCCCCCGAT TTTGAAAGTA TGCTGCGGAG AGCCGCACTC GAAACCGTAT CGGATCTGGA GGCGTCAGAA AGCACGGCTG CCCAGCCCCT GCGTCCGAAA CGAGTCAAGT CAGACGGGTC TTCGGTTCCG CTCAGTCTCG CCACTGGCAG CGATGCGCAC GTACACAACG ATATGCTGAA AGTAGTCATG GTTGGAGCAC CAGGCGTGGA CAAATCCTAT GTCGCCCGAG CCATCCGGCA AAGTCATAAA AGGGGACGAA AGCGCGTCAC TTTGGGAGTC GACGTACACT CCTGGTCGCC CAGGTCGGAC GTAAAGTTTG CAATATGGGA AGTCCAAGGT GCGACGTCCC GAGACCACGG GGCGCCTAAT TTCGGTGCGC ACGCCGCAAC TCAGGCACTT TTCTTTTCTT CCTCGTGCCT GTATTTGTTG GTGTGGGATT TGGCTTGTCA AAACGTGGCT ACGAATCGTT GTCCCAGTCG TCGGCATGAC GGCGAAGATT GCTTCGACAG TTCCGAGGAC GAAGAAGAAT ACGAAGACGA TTTCTTGCGG GAAGAAGCCA ACCGTCAGGC TGATCGGGCC TTGTACGCAG ACATACAAAC ACGCCTTCTA TCTTGGGTGG ATACGATTGC CTTGCGCGGA CCTGGGTCCG CCATTTTGCC GGTAGCTCTT ATCCCTTCCC ACATGAACGA AATAGAGGTC AAGCGACGAT GTGATACAAT GCAGAATCTT CTCGAGAACC ACCTCCATCG TTTTGATGGG AGCGAATATT CTCCGAAACT GTTGCTGGGC CAAGATACGA TTCTTTGTGT CGACGAGGTG ACCGGGTCAG GGATCGAACA GCTGCAGGAA ACAATGGTGG CCATAGCCAC AGATTCATCG CGATCTGTAT TTGAACACAT GGGTGCACCA GTACCGACGG GGACAGGCAG GGTACTGGAT ACGGTCCGGC GACTCAAACA AGATCACAAG CTTATCTTGC TAGATCATTT ATTGGGTGAG CTCGGTCCGG GCTTGGATAT GGCTACAGTA GTCCAAGCTT TGCACTTTTT ATCGAGCATT GGCGAAATTC TATACTTTGG AACGTCAGAT GATGAAGTTC TATCGCGATA CATTATCCTT AGTCGAAAGT GGCTTGTGTC AGCCTTGTCA TGCATTCTCC GCAATGACCT TAAGCGTGAG TTGACCGAAA CTAGAAGATT CATGAATATG CAATGTATTT ACAGCGACCA GAAATTTCCC GAGAGCGAAA TTACAAAAGT GCTCGTCAGT AGCACGGCGA GCTGTCCATT GCTCAGCGAT AGCGATGCGC GCATGCTTTG GCAGTCCATG AGTTTCATGC GCGAAGCATC GGATCGGTAT GCCGAGCTGA CGGAAAGTGC CACCACGGCG CCGACCATGT TTTACTTTTT GGAACGACTA TTGGTCCACT CCGGTGTTCT ACTGCCCCTA CGAGCTTCGC CACCACCGAC AATGGCCTTG GACCAACCGG TGCAGTCTGA AGTGTTTTTT ATACCTAGTC TCTTGACGCA AACGGATCCT CGGGATGTTT GGACGTTCAA ATCCAGTGAA AGTTGGATGA CCACCTTGTG TTACTCCTGG TTGTTCCGCG ATGGTGCCCC ATCGGATCTC ATGGAGCATG TGTCCGTCGA ATTGCTGAAG GACCTGTATG AATTTTCTCA AGACTTCCAA GGAACTCCCA AACAGGAGTA TCCTCAGCGG TCGCACACGG TACCAATAGG ACGGGGGTCC TTGCACCAGT TTCTGGAAGA GCATGATACA CAAGCGATCG GTCGCATCAA GGTGCATCAA ATCATGTGTT GGAAGACTTC AGTTCTTGTA AAGATCGGGA CTGTATTTGC CGATCAGGAC AGTGGGGAGC TACGGGAAAG TTTTGTGGAA GTTTTCGTGA CGGTTGTTGA TCAGAGCTCA AGTCAATGTG TCGCGTCAGA TGCGATGCGC GCCAGTATGC ATCGGGTTAT AGTCAGTGGG AAAGGTCAGG TAGGCCACCA TGGACGCAAG TTGTGGAAGG GTGGCTTTGA GGTTGTATTG GACTCGGTTC GGTCATCGCT CTCTACGTAC CCCAACGTGG ATTCGCAAGT GGTATGTCCA GAATGTCTCG CCCACTCAAG TCCAAGTAAT GCTTGTACAT GGGGGTGGGA TAGTGTCGTT GCTGCCGTTG AGCGCAAAGA CGCAGTTGTT CGATGCATGC GTGGACATCG TGTCGACAGC AGCCTAATTT CTGGCAACGC CAAAGATGCC GAAGTAAAAA CGCCAGCCGT GGAATCGTCT CATTCTCAAC GAGTGTCCAA GCCTGTTCCC GAGATGCTGC CCAGCGTTGT ACTGGTGGGT CTATGGGATG CGCAGCAAAA GGAGATTCGA AACGTTGGAT CTGGATTCAT CGTCGACAAA CGTCTTGGGT TGGTAGTAAC AGCTGCTCAT GTTCTCTACG ACATGGAAGA GGGGCCCAGG TTTGGGGTTC CTTTCTTCGG TTTACCCGAC GCCAAGGTTG TGATTGGAAT CATTCCCGAT GAAGGGCACA ACGCAGTCTT CCGCTACTTT GGTGAGATCG TGCTGAGCGA CGTCCACAAC GTAGACGCAT GCGTTGTGCG TGTTACGAGC AAAATGGCTG AAGACGTTGA CGATGAAGGC ACGGGCTGTG TGAATCAAAC CGAGATTTCT TTAGACTACG AGGCTGTGGA GTCCGAAAAA CTCCGTTCGT TGAAAATGAC GAACCGATTC GAGCTAGAGG AGTCTGTCCG CATCCTCGGG TTCAACCAAG GAGGTGAAGG TGTCTTTGAG CTGGGCAAAC ACGTAAACCG GTCCGCTGAC TTTGCCAAGG GCTATATCTG CAAAAAGTTC AAAGCTGCGA TTTCCGACGA CGGATCGCAC TCATCCAATT CGTCGGGCAA GACGTTTTCG CCACGCGAAG AAATCGTGAT CATGTGCCCG ACAATTTCGG GACACAGTGG TGGTCCGTGT GTGAACGACG AAGGTCGCGT GGTTGGTATT TTGAGTCGAG CCGATCCCGT CGATCGGCAA CGTTGCTATC TAGTCCCGGC GACCGAATTG AAGCGGCTGG TAACCAAGGC CAAGAAAACG TGTGTGAGGC CTGCCAAGCT AGCGACTGCT ATTACTATGT AATTTCGATC CGGGGCAATA ATATAAGTTT CCCTCCCTTT AC
|
Protein sequence | MSTNEPLRPG SGSSSNGGRS RCGSDSNSHS TNQNTEGSVT QAPVLAPEEG PPVLLREWNP APPPPSTPVA PAVIWPDRSP EAEDRRPAYS YPVATFVRDL VRTGMDQIES AWQQQESSST AQRDGTGTRP HGTHNRVEDD DSDAYRPLSV RSESDALAMR SEAAAAFMES TATVSARTIN DLDDEHEDAE FILVDGELVL MPPPPPNAAE QDGLVRHLQN QEFERHFSLR ARYHPDHPAV KSCQPNRVRR GVRKLGKIFK RKNGLRRASS KESAQSHVHH VDDSSAHTSH DGSSLPSLSA DFLTAPSGAD SITTLNHTTN NGGINSNNNP KSPPRKKDKR RGKRRSWRRT DSSGGENDDF DDETSPDIFE SRSGEFSSRT GPRPATFTTI SALANRSAAS IRPPMPPMTP IQETLDASNM FSGGRAASVE ARFVGTTADV LEAGVLPTGF AAEAYVEYDS DEALKKDDSL VASVVPFDAD YKVDKAEAPD FESMLRRAAL ETVSDLEASE STAAQPLRPK RVKSDGSSVP LSLATGSDAH VHNDMLKVVM VGAPGVDKSY VARAIRQSHK RGRKRVTLGV DVHSWSPRSD VKFAIWEVQG ATSRDHGAPN FGAHAATQAL FFSSSCLYLL VWDLACQNVA TNRCPSRRHD GEDCFDSSED EEEYEDDFLR EEANRQADRA LYADIQTRLL SWVDTIALRG PGSAILPVAL IPSHMNEIEV KRRCDTMQNL LENHLHRFDG SEYSPKLLLG QDTILCVDEV TGSGIEQLQE TMVAIATDSS RSVFEHMGAP VPTGTGRVLD TVRRLKQDHK LILLDHLLGE LGPGLDMATV VQALHFLSSI GEILYFGTSD DEVLSRYIIL SRKWLVSALS CILRNDLKRE LTETRRFMNM QCIYSDQKFP ESEITKVLVS STASCPLLSD SDARMLWQSM SFMREASDRY AELTESATTA PTMFYFLERL LVHSGVLLPL RASPPPTMAL DQPVQSEVFF IPSLLTQTDP RDVWTFKSSE SWMTTLCYSW LFRDGAPSDL MEHVSVELLK DLYEFSQDFQ GTPKQEYPQR SHTVPIGRGS LHQFLEEHDT QAIGRIKVHQ IMCWKTSVLV KIGTVFADQD SGELRESFVE VFVTVVDQSS SQCVASDAMR ASMHRVIVSG KGQVGHHGRK LWKGGFEVVL DSVRSSLSTY PNVDSQVVCP ECLAHSSPSN ACTWGWDSVV AAVERKDAVV RCMRGHRVDS SLISGNAKDA EVKTPAVESS HSQRVSKPVP EMLPSVVLVG LWDAQQKEIR NVGSGFIVDK RLGLVVTAAH VLYDMEEGPR FGVPFFGLPD AKVVIGIIPD EGHNAVFRYF GEIVLSDVHN VDACVVRVTS KMAEDVDDEG TGCVNQTEIS LDYEAVESEK LRSLKMTNRF ELEESVRILG FNQGGEGVFE LGKHVNRSAD FAKGYICKKF KAAISDDGSH SSNSSGKTFS PREEIVIMCP TISGHSGGPC VNDEGRVVGI LSRADPVDRQ RCYLVPATEL KRLVTKAKKT CVRPAKLATA ITM
|
| |