Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54536 |
Symbol | |
ID | 7201272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 564266 |
End bp | 567149 |
Gene Length | 2884 bp |
Protein Length | 859 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180669 |
Protein GI | 219119835 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.253785 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTATTTCAAG CGTTCCCTTT ATTGATTCGT GTACCGTTGT GTGAAAACAT ACACAAGACA CGAGCGAACA CGCTCATAGT CGCTCACTTT CATCTACAGC TACCTACCTA GCTAGGTAGA GAGCTTGGTG ACGATGATGA ATGGACGTCG TGGATACACC GAAATTGCGG CCGGCGAGAC CGAACCCTTG CGGGAGACTT CCACTTCCAT GATTTCCCCG TCTCGCACAA CGGATCCGTC TCCCCGGGGA CGAAATTTGG TCTGTGCATT GGTGACGGTT GGTGTCCTCC TGGGAACCAC TTTGGGATGG ACCCGCGTTC CGGAAGACCT CGACTCTAGC GTCCCCGTCG GAGTGCCCGA TTCACAGGAT CCCACGGCCG CCTCCTCCAC GGACCGTTCC ATCCGGTACC GTCCGCTTTG TCAGTTTTAC GACGACACGT CTACTTCCGT ACGCATGATT CAAACTTCCC TCAAGGAACC GTCGCGGCAG TGGTCCGATC AGCCATGTTT CCCGGCATCC CGGCACACTA CATTGTTTAC CAACCAGCGT GGGCGTCCGC CGGTTTTGCC CTTGGACGCC TACGGCTCTC CCGACGCAAT ACTACGTGTC AACTTTTCCG CGGTTGCATT TCCCACACAC CAGACGCCCA TCCTAGGATT CGGTGGCGCC TTTACCGAAG CCTCCGGGCG GAATTATCAA CGCCTTTCCG AACAGGGCAA GCAGGCCGTC ATGGAACTGT TGTTCGGGGC GCAAGGACTG GGATACGCAT TGGGACGGGT CCACATCAAT TCGTGTGATT TCTCCGTCAA AAGCTACACC TTTGACGATG TCGAGGACGA TTTCCCCCTC GACCACTTTG ACGTTGGTGT CCACCACGAC GTCGAAGTGG GTATGGTGGA TATGGCCTTG CGCGCCACGT CAATCTTGCG TCAAGGCTGG CCGAGTGAGG ACGCCTACGA AGGAAATTTG CGACTCTACG CTAGTCCCTG GTCACCGCCG GCTTGGATGA AAAAACCGAC ATGGGAGGAC GCCAAGAACG CCACGCACGC CGCAAAAATG ACCTACAGCG CCGAACCCAC CTGTCTGCGC GAGGGCGTCG GTTCCCAATC CCGCTACGCC GCTTCCTGGG CCCTCTATTT CTCCAAATTT GTCACTGCCT ACCGAGACTT GGGATTGCCG CTCTGGGGCG TGACGGTACA GAACGAACCG GAATTCGCCG CACCATGGGA AGCGTGTGCC TACACACCGA CCACCCAGCA CGATTTTCTC ACCAACCATT TGGGTCCACA ACTTGAACGA GATCATCCCG ATCTGAAGAT ATTCATGTTT GATCACAACA AGGATCACAT CAATATTTGG GGGCAGAAAA TGCTCAACGC GTCGAGCAAT GCGTCCCGTT ATGTGGACGG GACCGCCTAC CACTGGTACG CTGGAGGTAT GGACCGTCTC CTCGACGGTG CTCTGGGTAA CGCCAATCTA CACCAGTTTC AAGAGGACAT GCAAACCTAC GGGATCCAAT CAAACCACAT TCTCTTCAAT TCCGAAGCGT GCCACTGCCC AACCACTGGC TACGCCGGTG GCGACATTAA CATTTACTGG GCGCGGGCGG AGCGCTACGC CCACACGATA CTGGCCGACC TGGCGGCTGG TAGTCATGGT TGGGTGGAAT GGAATCTACT CCTCGACAGT ATCGGCGGAC CTAATCATCT AGGGAACTTG TGTGAGTCGT CCCTTCTGGC CGTCCCCCAC CGCGCTTTGA ACGCAGATCC CCATACACCA CCTCTGCCCG ACTTTGAAAC CGACGGGCCT ATGGGCAAGG TGAATATCGG AGACGGTAGG ACTCGGGAAG AACTCAACGC TCTTGGTTTT CCCGCCAAGT TTCTAGACGT TGGCGTTGCC GTTCAGCCAA TTTACTACTA CATGGGACAC ATCTCGCGCT ACGTTCGACC GGGATCGGTA GCGGTGCCCG GGCTCGTCAC GGCCAACGCG CAACCTGGCG TCCGGATATT CCGGCCATCC GGGCAAGTGG TGGTCGGCGG CGGGGAGAAC GATTTGGCTC GGCACGGTAT GGAGATTACG GCCTGGCCGT GTGAAGGGTC CACCCGACAG CACTTTACCT GGAACGCCGA TTCGAAGCGA CATATTCAGG TACAAGGGCA CGACTGGCTT GGGAATCCAA CCAAGTCCTG TTTTGCCCGT AAATCCGACC CCTCGCTGGG CACGATGAGT TTGACGGATT GCAAACCGGG ACAAGTGGGA ATCTACGATA TTGTTCCAAT CGCGGAGAAG GAAGGCGATC AAAGATTCTT CCAAATTGTT TTGACGAATC ATCCCAAACT CGATCGACCC TGTTTGATCA TCCACCAATT GGGCAATGAC GGCGGAGCTT ATGGTCCCCG AGGTGGGGCT CCCGTCACCT TGGGTAGCTG TTCATCGAAA GCAGCCAGGT GGAAAGTGGA CGAGACAACA GGCGAAGCCT CGTCTACCTT CTTCTCGGAC GATGACGGCA ACGAAAATGA GGTGTGCATG ACCACAGGTT GGCCCTTTCT ACAAATGGGA GCTTTCTTAA CGCCAAACGG GGAAGTGCCT AAGACGGTAG TGATTCTCAA CGAAGCCAAA GAATCGGCCA ATTTTGCTAT TCAAGATCAA GATAAGGTTA TCGTCACAGC TTCTATTCCT CCGCGATCCA TCCAGACTAT CCTTTTGCAG TAAAGTTGCA TTGCATGCAG ATTATTTTGG TCGATTGACG AAGACCAAAG TATTGGAGCA CTTTATAGTT AGAAAGGAAA GCAGGTCTTG GTCTAAAATG CGACAATGTA CAACGTCTAA GTACACTAGA GCACCTATTC TATTTCATTG TAAGAAACGA AGTGTCAAGA TTGC
|
Protein sequence | MMNGRRGYTE IAAGETEPLR ETSTSMISPS RTTDPSPRGR NLVCALVTVG VLLGTTLGWT RVPEDLDSSV PVGVPDSQDP TAASSTDRSI RYRPLCQFYD DTSTSVRMIQ TSLKEPSRQW SDQPCFPASR HTTLFTNQRG RPPVLPLDAY GSPDAILRVN FSAVAFPTHQ TPILGFGGAF TEASGRNYQR LSEQGKQAVM ELLFGAQGLG YALGRVHINS CDFSVKSYTF DDVEDDFPLD HFDVGVHHDV EVGMVDMALR ATSILRQGWP SEDAYEGNLR LYASPWSPPA WMKKPTWEDA KNATHAAKMT YSAEPTCLRE GVGSQSRYAA SWALYFSKFV TAYRDLGLPL WGVTVQNEPE FAAPWEACAY TPTTQHDFLT NHLGPQLERD HPDLKIFMFD HNKDHINIWG QKMLNASSNA SRYVDGTAYH WYAGGMDRLL DGALGNANLH QFQEDMQTYG IQSNHILFNS EACHCPTTGY AGGDINIYWA RAERYAHTIL ADLAAGSHGW VEWNLLLDSI GGPNHLGNLC ESSLLAVPHR ALNADPHTPP LPDFETDGPM GKVNIGDGRT REELNALGFP AKFLDVGVAV QPIYYYMGHI SRYVRPGSVA VPGLVTANAQ PGVRIFRPSG QVVVGGGEND LARHGMEITA WPCEGSTRQH FTWNADSKRH IQVQGHDWLG NPTKSCFARK SDPSLGTMSL TDCKPGQVGI YDIVPIAEKE GDQRFFQIVL TNHPKLDRPC LIIHQLGNDG GAYGPRGGAP VTLGSCSSKA ARWKVDETTG EASSTFFSDD DGNENEVCMT TGWPFLQMGA FLTPNGEVPK TVVILNEAKE SANFAIQDQD KVIVTASIPP RSIQTILLQ
|
| |