Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44767 |
Symbol | |
ID | 7199884 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 180714 |
End bp | 185593 |
Gene Length | 4880 bp |
Protein Length | 1557 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178945 |
Protein GI | 219116300 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TAAAGATGTA CTCACAGTAA ATTACGTTCG GCAGAGTCGA GGAATGCCTT TTCTATTGAG CTGCTGCGAG AAGGTAAAGG ATGGACTCGA ATCAGGAAAT GGTGCATCGC AGACGCTTGG CGCAAATTCT GCGCTACATT GCGAGTATGC ACACTGCGCT TAACGGAACC GTCGAAGGAG CCGCCAGTGG GAATGCGTTT GTCCGCGACG GTTTGCGTCC CGGGGTGATC CCCTCCGACT TTTTCCGGCG CGAACAACCG GAAGACGACC CCCATTGGGG CGTCTTATTT AGTACCGAGA GCTTTCGTGC GCTACTGGCC ATCGAACAAG ACCAACAACC GAAACTCTTT GCTACACCGC CGTTTACCTT TTACGCGGCT GTGGCAGGAT CCGGAATCGC TGCTTCTAGT TCCAAAATGT GGGGGAGCAC CAGCGCGCAC AAGCGCTCCT TTACGGACGC TTTTCCGGCG CCCTTCAAGG GTTCGCCTGT TGCCGTTCCC GAAAAAGCCT GTTGGCTACG TATCACGCGC TTGACGACTT CCCAATTATT ACCTCCACCC CTACGAAAGC ACCCCGCCGT GCGCCAGTAT CTCCAGGATT GGTGTACGTC GGACCCGACT ATGACTGGCG TAGACGATGT GTTGACCCTG CGTCGTTTGG AACAAGCACC CAAAATCGCG CCGACGGTCG ATTTGCAAAG TCGGGATGCA ATCAAGTCCT TTCTCCGCCG CACCAAAAGG TACTTGCGTC AAAAGTCATA CCAGAATACA CTCGAACCTC TCTACAACCG CTTATTCGAA TGGCAGCAGC AAAATGACCA GGACGAATTA GTTTGGGGAC TTGGGCACGC CAAAGTACGT GCCATCGACG GGGTCTGGAT CAATGGACCC CTGCTGGAAG TCCAGATGGA AGTCGAACTC GCTCGGGACG GTGCTTTGCT TGTACGACCA CGGGAACACA CTGGTGTATC ATTAAATCGC GATGTATTGG CGGCGCTTGG TACCTGGTCG GGGGAAGCGT TGTCTCACGT GGTGCTCTCA CAACTTCACC GGACCGTAGC TGAGCTCGAA TCGTGTCAGC TGTCGCCAGC GCAGCCCAAC ACTTACACTC CACTTCTTAA GCGTATGGCG GTGGAGCTGT CATCGGGTGG ATCCTTTCAA TCATCGTCAG TGGCAGCAAC AACAACGGCA CAACGTGATC CTGAAAAATT GGTGGTGACG GAAGCATGGT GTGTCTTTGC GCGCCCCAAA CCCAGCTCCG TTTGGGCTCG GGACGCCAAC ACGTTTGCCG ACCAAATAAT GCAGGGTTTG GAAAACCCTT TGGCATCCGT CGAATTACCA AAGGCGTCCT GGTCGTTGAC GCTCGGCCCG GGATCCTTGG ATACAGTTTT GTCTAGGCAA CAAAGCACCA ATTCAAAACG GTCGCGCCCT TGGGTGCGCT GGATTTCGGA AAAGGTTCTC GGTGTTCGAA AGTCGCCACC GGAGGAAAAA TCGGCAAGGC CTCTGTTCCC ACTTCCTACG TCTGATGCTC AAAACAAAAT CGCTGAATTG CTTTTGACCA AGAATTACCC AGCGGTCGTT TGCGAAGGTC CGCCTGGGAC AGGAAAGTCA CACTCCATTG CCAATTTGAT ATGCGCCTAC CTCTGTCAAG GCAAACGAGT CTTGGTAACC TCCAAGAAGG CACCAGCCTT GTCGGTTCTG CGGAACCGAC TCCCTGTGAC GGTACAGGAA TTGTGCGTGG ATGTATCGCG TAGTGAACTG GCGGGAATGC AACAATTGCA GCAAACCGTG GAACGCCTCG CCAATCGTGT TGCTAGCGCT AGTGCTGTCT TGGAAGCCGA AAAGTGCAAA TTTCTTCAGG TAGGTACAAA CTCCATGCTT TCTGCCCCTG AGTTCTGGTT CACGTCAACG TTTGTCTGCT AACCTTTTGG ATTGCTTGCT CATTGAATTA GCGAAACATT GACGAACACG AGGCCCAGCT CAAGGAGATC GATGCTAAGC TTGTGGCACG GAGCGAAAAG GTTCGAAAGT TAATGGATCA TCCCAAGGGG CAAAGCCTCG TTGAAATCTC TATGGCGATC ATTGATAGTG CCCCGTGGCT TGCCCGAACA ATAAGTGGAT GGACAGTAAA GGAAGTGAGC GCTCTCCGGG ATGTAGTTGT GTCACTTTTA CTCAAAGAGG GAGATCCAGC GCGTTCCGTG TCAGGTTTTT CGAGACCACC AACCAACGCA CTAATATCTA GAGTCGCTGC GGAAGCTGGA CATGCATTTC CTTTGATATC CAACGCTGTA AAAGGGGTGG CGGCGCGTAT TCCCATGGTT GGGTCGTTGA CTGGAATTGA ACAGCGGCGA ATGAGTGTGG AAGAAGCGCT GATGCAAATC CGAATAAACG GCCAGCGACC CTCTCTTGCA GCTGAGTGGC ATACCGTCCT TCGTGCTCTC AATCACGCTA AGGCATTGCA TTCCTTCGAG ACAATTACAT GGAGGACTCA CGAATCAGAG AACGCATGGC CACACTATGA ATTTGCCGAT AACATCGATG AACTCTTCCA GCTCAAAGCT ACATTCGATG ATGCTGTCGC CATGAAATCG CTTGCTTGGA ATTTGGATCT GGAAGATGAA ACAAATATGG CCGTTGAGTG CCGAGCACTC GATACAAAAC GTCGGATGAT TGCGACTCGG ATCTTGGGCC TTGCCGAGGA CCTCGTGGAC GCATCTGTCA TTACTGAGTT GAGTCGGTCA TTTTCTACGG ACGCTCAATC GGCTCTCATT CGTTTTGCAC AAATCGCGGG AAAGGCAAAG TTCAGTAGGA CTTCGCAGCC TTCAAAAATG ACGCAACGAC AGAGGCGCCG TCGACAAGAG TATCTCGATG CCTTCGATCG ATGCTGTCGC TTCATTCCGT GTTGGATTCT CACCGCTTCG CAGATAAGCG ACTATCTTCC GTCGGAATGT TTGTTCGATT TAGTTATTAT AGATGAGGCG TCACAAAGCG ATATTAGAGT TCTACCGGGC ATGTTGCGGG GTAAACAGTG GTTGATTGTC GGTGACGGGA AACAAGTTAG TCCGACAGAA GCGTTCGTAT CGGAAGAACA GATCGATGCA TTGCGAGCGG CACTTCCGCC GTCGCCATTG GAGGATTCGC TACTGCCTGG TCAAAGCTTT TTTGATTTAT GCGCCCAAGC GTTTCCACAA GGGAGAGTCG TCTTGAGTGA GCACTTCCGA TGTGCAGAAG AAATTATTGA TTTTAGCAAC CAGCAGTTCT ATGACGGTAG GCTTGTTCCC CTAAGACTTC CAACGAAATC TGAACGACTC ACGCCATCTC TTGTCGATGT CATGCTACGC GATGGTGTAA AGATTGGCAA AGTCAACGAA AAGGAAGCTG ACGAAATTGT CCGCATGATA CAGGACTTTA CTTCAGATCT ATTACAAGTC GCAAAGCCGC GATCAATCGG CGTGATCTCC CTTATTGGGG ACGAACAAAG CCGCCTTATA CGCGGTAGAC TTCTGGATAC CATTGGACCT CGTCTTATGG CTAGACACGA CATTCTAGTT GGCGACCCTC CGACGTTTCA AGGTGCTGAG CGAGACATTA TCTTCTTGAG TATGGTATGC TCTCGAGGTG CCATTCCGAC ACAGAATCAG CTTATGCATG TGCAGAGGGC AAATGTTGCA ATGTCTAGAG CAAAAGACCG TTGCGTTTTG GTTCGGAGCG TAAATATCCA CGAAATACCA AGTTCATTAG ACGTGAAAGT TCCAATCATA GAGTTCTTTC AACGAAGTGC AGCTAATTTT GAGAATAGAC ACGGGCTTGA TGAGATTGCA GTTGAGTGTC CACAGCAAAA GGGAGCCCTA TCAATCCGAT TATTACTTAC AAAACTTTTG AAAGAGCGGG GATTTACCGT CCGAGACATG GGCATCGTTT GGAAAGAAGG CCTATGTGTG GAGCATCCGG CTTCTGATCG ACGCGCTGCG CTACTGGTCG ATTGCGCAGG AGAGCCGCTG CGAGAGTGGC TAGCAAGCTA TAGCCAAGAG AAGGCTATCC AGCGAGTTGG TTGGGTGTGC TTACGTGTCG ACGCTTTGTC GTTTCTCCAC GATTTTCATG CCGCATTTCA GACAGTAGTC AAGTTTTTGT CGAGTGTTGG GATCGAGGAA TCCGCTATAC TGTACGACGA GCTCGACGAC GACCAAGATG AAAGTATGGC GCCGGAGGCT TTTGTTCAGA TCGAGGTGGA AGATCCTTCA GACGACGACT CGGAGAATCT TCATGAAAAT ATCGATCTGG AAGCCGCTGG AAACCAAATG CACGACGTTG TAATGATAAG CAGTGACGAA GAGGATACCG AGGTGACGGC AAAGAAACCT GCCGCAGTGA AAGCGGAGCG TGTAGCAAGC GAGTCTTTGG AGATCGTAGA CAACGAGGAA GTAGATCCAT CCGATTTTGG GCAAGTAGTG GACATTTCCT TTCTGCGGGG CGCGTCCATG TCCAAAGACG GCGACGATGA TGAAAATAAT GATTTTAATC TGAGTGTAGG ATCTGAAACG GAGGTAGAAA GAAAAAACGA AGAAAAAAAT GTTGCTCGCC GACCAAGCCA CAAGTCCAAC GAACTAAAAG ACGGCAAGGA CAAAGACGAA GGTGACACCT TCACCTTGGC AGACAGAAAG GCTCAAAGCC GTGTGTCGAA ACGGCGACGG TATCATCGAG TAGATAAATA CTCCCGAGAC GGACGTTGGT ATCCCGCACA GCAGCATGAA GGAGAGCAGG ATGACGAACA TAAATGGTAC GATACCGACT CCGACATGTC GGTCCACAAG GAGGATGCTT TGGTAAAAAA CAATACACAG ATGTAATGAT CGTGATTCCA ATAAAATTTG TGCAACAACT
|
Protein sequence | MDSNQEMVHR RRLAQILRYI ASMHTALNGT VEGAASGNAF VRDGLRPGVI PSDFFRREQP EDDPHWGVLF STESFRALLA IEQDQQPKLF ATPPFTFYAA VAGSGIAASS SKMWGSTSAH KRSFTDAFPA PFKGSPVAVP EKACWLRITR LTTSQLLPPP LRKHPAVRQY LQDWCTSDPT MTGVDDVLTL RRLEQAPKIA PTVDLQSRDA IKSFLRRTKR YLRQKSYQNT LEPLYNRLFE WQQQNDQDEL VWGLGHAKVR AIDGVWINGP LLEVQMEVEL ARDGALLVRP REHTGVSLNR DVLAALGTWS GEALSHVVLS QLHRTVAELE SCQLSPAQPN TYTPLLKRMA VELSSGGSFQ SSSVAATTTA QRDPEKLVVT EAWCVFARPK PSSVWARDAN TFADQIMQGL ENPLASVELP KASWSLTLGP GSLDTVLSRQ QSTNSKRSRP WVRWISEKVL GVRKSPPEEK SARPLFPLPT SDAQNKIAEL LLTKNYPAVV CEGPPGTGKS HSIANLICAY LCQGKRVLVT SKKAPALSVL RNRLPVTVQE LCVDVSRSEL AGMQQLQQTV ERLANRVASA SAVLEAEKCK FLQRNIDEHE AQLKEIDAKL VARSEKVRKL MDHPKGQSLV EISMAIIDSA PWLARTISGW TVKEVSALRD VVVSLLLKEG DPARSVSGFS RPPTNALISR VAAEAGHAFP LISNAVKGVA ARIPMVGSLT GIEQRRMSVE EALMQIRING QRPSLAAEWH TVLRALNHAK ALHSFETITW RTHESENAWP HYEFADNIDE LFQLKATFDD AVAMKSLAWN LDLEDETNMA VECRALDTKR RMIATRILGL AEDLVDASVI TELSRSFSTD AQSALIRFAQ IAGKAKFSRT SQPSKMTQRQ RRRRQEYLDA FDRCCRFIPC WILTASQISD YLPSECLFDL VIIDEASQSD IRVLPGMLRG KQWLIVGDGK QVSPTEAFVS EEQIDALRAA LPPSPLEDSL LPGQSFFDLC AQAFPQGRVV LSEHFRCAEE IIDFSNQQFY DGRLVPLRLP TKSERLTPSL VDVMLRDGVK IGKVNEKEAD EIVRMIQDFT SDLLQVAKPR SIGVISLIGD EQSRLIRGRL LDTIGPRLMA RHDILVGDPP TFQGAERDII FLSMVCSRGA IPTQNQLMHV QRANVAMSRA KDRCVLVRSV NIHEIPSSLD VKVPIIEFFQ RSAANFENRH GLDEIAVECP QQKGALSIRL LLTKLLKERG FTVRDMGIVW KEGLCVEHPA SDRRAALLVD CAGEPLREWL ASYSQEKAIQ RVGWVCLRVD ALSFLHDFHA AFQTVVKFLS SVGIEESAIL YDELDDDQDE SMAPEAFVQI EVEDPSDDDS ENLHENIDLE AAGNQMHDVV MISSDEEDTE VTAKKPAAVK AERVASESLE IVDNEEVDPS DFGQVVDISF LRGASMSKDG DDDENNDFNL SVGSETEVER KNEEKNVARR PSHKSNELKD GKDKDEGDTF TLADRKAQSR VSKRRRYHRV DKYSRDGRWY PAQQHEGEQD DEHKWYDTDS DMSVHKEDAL VKNNTQM
|
| |