Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42856 |
Symbol | |
ID | 7196446 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1325975 |
End bp | 1331557 |
Gene Length | 5583 bp |
Protein Length | 1366 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177270 |
Protein GI | 219111037 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.875616 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATAAATGTAG ACAGGAAACG ATCTGTAAAG CTGATATAAA TGCGGTGTAA TCATATGCAT ATGCACGCTC GTATACTGAC TTTCTTCGCC GTTCCTATTC CAGATGAGCA TTGGGCCCAG GAAATCAATC CGTACATACA ACGACACACT TACACTTAGG AAATAGTATG GCTGCAAGGA TTTTGACCAA TATTTTCACA TATTGCCTCT TTCCAAACAC CACATCTGAA CTCTTTTTGT AGAATTCGAC CATTGACTGT GGTGAACATT GCCGATCATC CTTTGAAAAA GCGTGATACA GTCCCCGACT CATCTGCTCG ATAGCAGGAA GGCGATCTCC TCTGACGAAA GTGATCTACG TACGTTTTGA TATTGTAGAA GAGAGAACAT TAATTGACTG CGATTGTACT CAGATCACTA CTATTCGACA AAGAAAGACT CACTTTGAAG AGTCCTACCT TAGAATAAGA AAAGAAATGA AGACTGAAAA TGGAAGAAAG CTGAACAGCT GGTATGTTCT GTTCTGTTTT GTGTCATCTA CAATGGCTGA GTACACCTAC TATGATAACG AAGGATATCA AGAAGGATAC AACACCAGCC ATACGATCAC CGGAGGTGAC GGCATGGAAT ATTACAGCAA TACCAGCAAT AGTGCTACCA ACAGTTCGGG AGGTGACATT GATTATTGGA CCAACTACGC CATCTTCCCT AAGCGATGCA TAGTTTAGTA AGTTTCTTCG ACCGGTGGCA ACTCCGTTCA CTTGTCCTCC TTCTTTTTGC CGCTCTATCT CACCTTTCCC GCACTGCCTT GTCCTCATCC TTGTGTAGCA AAAAGACAGA CTACATCATG TACGAAATGT TTGATCAGCA ATATTGCCAA GAAGAAAACC GATTGGGAAC CTACATTTCG CGCGTTCCTG ACTACATGAC GGGCCATCTG CAACAATTGG CCGAACAGAT GTCGGATCTA GGTGTCGACG ACTACACCAA ACCAGAGGTG GCGCAGTACA TTGAATGTAC ACCTTTTCAA ATCCAAGGCG CCTATTACTA TTTCCAGATT GGGTGCGCCG ATGGCGTCAC TCAAAAGCTT GCTGTAAATA TTTACTCCGA CAATACGTGC TCAAAGAAAT CAACTGTGGA CGGCTTCGAT GATTCTGTCA TTGACGTTTC GGCTTTGAAT GTAAGCAGAT GCAGTTTTTT TGTCCTGGCG ACGGATTGGC TTGTTGTGTT GCTTAATGGC AGTCTATTTA TTCCCAACAG ATTCCTTTCA AACAATGCCA AACGTGTGTT AACTGGGTAA ACGTGGATGG GGCTGACGAT CAATTCTACA TCAAGCGTCA ACAAAATGCT CCTCTTTGTA AGACTGCTTG GACGTACAAG AAGCATTGTG GGCGCCAATG TCAATCCGTT GGTCGCAATG CGGAAGTCGA AGGCTGGAAT GCTTCAGATA AAATTCTAAT ATCGGTGCTT TCCTCTTTTG CTCTCATTCT GCTTGGATCA ATTGCCTTCC GGAGACAGAA GATGCCCAAC AAGGATATAC TCTTGGAACA AGCATTTATA AACTCAGCGG GACTTCGGCA ATCACAGATT TTTGTCATTA TCATCATCGT CATGGCAGTT ATCGCAGTCC TCATCATGCT GGGTTTGAAG GACGCGACCT GGGCTTTGCT ACTGGTATTG AATACTGCTC TGTTTGCTTA CCTTATGAAG CTGACACTGG AAAGCAGTGT CAATACCGGA GAAATTATCA TCGGACCAGA TGGCACTATT CTACGCAAAG ATTCCGACGA TTCTTCCATA GAGAGTACAA GTAACCCCAA CAATGGTACC TACATGTTGC CAACGTTGAC ATGATCGAAT TGTGTACTTA AATGACATTT AGGTTTTGTG CCCTCGACTG ACGCAATACC TCGTGGACCT GTCGATCCTT TTGTCCTCGC TCGTTCCCCT CTTGTTTCTG TCATTAAGTT AAACAAGGCG TCGTGCAAAG TCTATAGTAG TTGAAAACGG CTAAAGAGTG AAATATTCAA TAGGCATGCT TTCCTAAGAC GGAATAATTG GTATTTGATA AATTTTCATC TCGGCTGAAA ATCCCTGCTG TTGGTCTTGG CCATAGGGCT CGAATTTGTT CCGCACTCAT GTTTTTTTTA TTGCTGTGCA CACAGAGTCA TATAAGACAA GCATCTGATC CTCTCTATAC TATCAATTGA TTGTAAAAAG TTTCATGTCT ATTAGAATAC TGCAACTTTG TTTGCAACAT TGTGTGTTTC ATCCGAGCAA CATTGCCTAG CGCGAGGAAT TGAATCAATT TTTTTGCTGA CTGTGATTGT CTCGCTTGGT TATTAGATTG AACCGCAGCA GTGTTGATCG ATCCAAAGGA AGAGCTTTCC GACTCAGTAT CAACCACACT CCCTTTAGCG AAAGCTTCGT TGGTTGTTTC CATACTGTCG CATTCACATC AACAGGAAAA CGCCTTGGTG ACGCCGCTCT TGCGGTCAGA TCATCATCAC GAACTGAGTC TACAAGCAAA TTCCGAGGCT TCCGGCCACG TCGCCGAAGC CGATATCGAA GGGGAAGAGG TTGGAGGTAC CAGCATGAGC AGCACTTACC ATGATCCCCG ATTTATTCGA AATCCAGTTA TTCGGGGACT ACTGCTGCGT TTGCTGCGTA TGCAATCAGC GGCCGATTCC TTACCACACC TAGTTCAAAT GATCGTGTCG AATGGAAAGA TTACAACTGT ATCATTCGTT GCTCTCTATC TCGTGTGTTT AGTCCTTTGG CTACCTTTCT GGCTGCTTGC TTTGCTAGTC ACGGAATGGG GCGTCTATGC GCTGTCTGTT GCGGGAGGGT ATTTCATTGG ACGATGTATT ATTCGCATGA TTGCTTTTCC TGGGGCTTCC CGAAAAGTCA GTACCGATAT TGAGAAAGAA TTTGCCAAAT ACTCAGTCCG TATGTTGCAG TCCGGTGTCC AAAGCTTCGC GGAAGTCGCT TCGATCATGT CAGCCAATCC GAACCATCCC CAACGAGTGA GTGTTTTGAA GGGGTACACA CTGCCGTCTC TTTGGAGTAG GGCCAAAACC TATCGAAACC GTGTGTTGGG AGTATACTTG GAAGTTCTTT TGCACATTTA CCAACAGGCA CCAGAATCAT CTACTGGGGC ACAAAGAGGA TTCACAAAGT ATGGCAACAA CGTTTTATCT GGAGATATTG GGAACATTGC CGGACTATCG GTAAGTTTTC ACGTTGTCTG CGCTGCAATT GTGTTCGTTA TTCTCAAAAT AGATAAAAAT GCTTCTACTC TATCCTGTAG GCCCAAGCCC AAAATGATGG GCTTGGTCTC ATTGAACAAC TCAAAACGGT ACTGGCTTTG GTGGACACTT TGGAAGAGCA AGCGCATACG TTCCTTGAAG GTAGAGCAGC TGTTGTAACG CCAAATGATC TTCCGGAAGA TGCACGCCGG ACGGCGCAAC ATCTTTTGGC CCGGTCTCAG GAGCTTACCA ATTTCGTCTC GTCCTTAAAA CCTCCATCTG ATAGCAGCAA TGAAGACATA GAGGATGAAT CAGAAGAAGA CTTAACGGTT GACGCAGTAC GCAGAAAACT AGAAAGGCAG CATGGTTCCA CAAGAGAGGC TATTAAAAAA GGAATTGCTT CTGTTATCCC CCTGTTGGAC CCCCCACCCC ACAACTCAAT ATTCTCGTTT GATTTACAAC GTGGTTGCAT GTTGAGTCGC TATCGAGGTG CTCGCCAGCT TTGGGTGCGT CGTCCTAGCG GCGGCATGTT GGATGTTTTG CATTTTCCCG CTCGCGATCG TTGTGCGGAT ACTCAGCGGA ACTCCAAAGC ACTTCTTTAC TGCAATCCTA ACGCAGGTTT AGTTGAAGTA GCAGCCGGGA TGAGCCTAGT TGGTGGGAAC GTGCCGTCAA ATGAGGGTAA TGAACAAGCA TCCGATGGGA GCTGGGTGGA CTTTTACACC TCCGTCGGCA TAGATGTCTA TGTCTTCAAC TATGCAGGAT ATGGACGAAG CTTTGGGTCG ACAACATGCC TAAAAGGGAA CAGCGCGATC GATAGCTATA CTCCCGGTAT CCTACCCCGA TTGATTCGAA TTATTCGCTC TACATTTTTA ACGTTCACGC CAAATCCAGA CACTCTCCGC GATGACGGGT TTGCCGTTGC GGTACACTTG CTCAAGGAAG TGGGTATAAA ACAGCTTATA ATCCATGGAG AGAGCATTGG TGGGATGGCT GCATCAAGTA CCGCTCGTCG AGTCTCCCAC GAGCCGGAGC TACAGGATAA GCTGGCTCTT CTGATTTGCG ATCGAACGTT TTGCAACTTG GAGGCTGTAG CTCAGCGTCT CGTCGGAGGG TGGACAGGAA ATGCGATTCG GATGTTAGCA CCTTTCTGGA GTACAGATGT AGCGGGCGAC TTTTTCGCAA CAAACTGCCC CAAAATTGTT GCCAATGATG CAGCCGATGC AATTATTTCT AATGAAGCAA GCCTGAAGTC TGGAATTTCG CTTTGGAAAG AATTGCATCG CGGAATCGCT TCCACAAAGG GAATTGGTTG GATGACGGAA GCACCCTTAC AGTACAGAAT GGCCGATTGG GAGAATGTCT GTGTGAATGA TTCGAAGTAC GTTACAGCAC CAGGAGTACT TCGATCTCAA GCACCGACAT GGCCTCACGA CAAGCACATA TCCGTCGAGG AGGCTTTTCA TTTTGCGGCA TGCTGTAAAC GGATTGGAAA GTACGCTAAG GCTTTCAGGA AAGGCTCGGA GGGTGGCGAT TTGAGTGGTT TGCACCCCGG CTCTAGGCGT GCTCTAATGG AAGCATGGCA GTCTCTGGCA TGTCTCGACG GTCTCACTGG AGCACCGCTC GGTGTTGCTG TCAAACAAGG GTTCGATACG ACGGTAGCTT GGCTATGTTC TTTTCTGATT TTCGGTGCTC AATCAATTGT AGCTGTCGCT GAGCGACGCA CGGAGGGTGA CGTTGGGCTT CGGGAGCAGC TAGCAATTGT CCCCTCTGAC TTTGATTCCC GACCAGCCGG CTTTGCAGCT GCAGAAGAGG GAGGCATGGT GCATCCGAAA CCACTTCCAG AAGTTATTGA ATCCCTTTTG TCATTTCAAG AATCAGGAGA TCCTTCCCTC AGAGACTGTA AGTTGTGACT GTCGTTTCGA TTTAGCTACA GGGCTATCCC TTATCTAACC TAAGTGTATT CCTCTCTTCC AGTGTCCCAT GAATTTCAGT TTGTTGTTGG CGTTCTTAAG TATTTGCAAG GCCGCCTGTC GGCATCCACG AGTATTGAAG CTGCGCAAAA AAGCCGAAAA TTGCAAGTTT TTAAGGAAGG TGTTGGCTTC TTGTTGAACC TACATTGTGG ACACAATAAT CCCTTTTCCA AGGAGGAACA ACTGCAGCTG AAAGGACTTT TGGATAGGGC CATCGGAGTG AAAGGAGGTG GATTGGTGTA GTCTTTGCAT TTACCAGAAA ATACAGTAAG CCATACTTCA CGCTGGCTTT CGGAAACGAG TTTGAGATGT ACACCGTTTT TGGCTGGCTT ACAGTTATTA CAAATCGTTC ACAACACGCG TTT
|
Protein sequence | MKTENGRKLN SWYVLFCFVS STMAEYTYYD NEGYQEGYNT SHTITGGDGM EYYSNTSNSA TNSSGGDIDY WTNYAIFPKR CIVYKKTDYI MYEMFDQQYC QEENRLGTYI SRVPDYMTGH LQQLAEQMSD LGVDDYTKPE VAQYIECTPF QIQGAYYYFQ IGCADGVTQK LAVNIYSDNT CSKKSTVDGF DDSVIDVSAL NIPFKQCQTC VNWVNVDGAD DQFYIKRQQN APLCKTAWTY KKHCGRQCQS VGRNAEVEGW NASDKILISV LSSFALILLG SIAFRRQKMP NKDILLEQAF INSAGLRQSQ IFVIIIIVMA VIAVLIMLGL KDATWALLLV LNTALFAYLM KLTLESSVNT GEIIIGPDGT ILRKDSDDSS IESTSFVPST DAIPRGPVDP FVLARSPLVS VIKLNKASCK ENALVTPLLR SDHHHELSLQ ANSEASGHVA EADIEGEEVG GTSMSSTYHD PRFIRNPVIR GLLLRLLRMQ SAADSLPHLV QMIVSNGKIT TVSFVALYLV CLVLWLPFWL LALLVTEWGV YALSVAGGYF IGRCIIRMIA FPGASRKVST DIEKEFAKYS VRMLQSGVQS FAEVASIMSA NPNHPQRVSV LKGYTLPSLW SRAKTYRNRV LGVYLEVLLH IYQQAPESST GAQRGFTKYG NNVLSGDIGN IAGLSAQAQN DGLGLIEQLK TVLALVDTLE EQAHTFLEGR AAVVTPNDLP EDARRTAQHL LARSQELTNF VSSLKPPSDS SNEDIEDESE EDLTVDAVRR KLERQHGSTR EAIKKGIASV IPLLDPPPHN SIFSFDLQRG CMLSRYRGAR QLWVRRPSGG MLDVLHFPAR DRCADTQRNS KALLYCNPNA GLVEVAAGMS LVGGNVPSNE GNEQASDGSW VDFYTSVGID VYVFNYAGYG RSFGSTTCLK GNSAIDSYTP GILPRLIRII RSTFLTFTPN PDTLRDDGFA VAVHLLKEVG IKQLIIHGES IGGMAASSTA RRVSHEPELQ DKLALLICDR TFCNLEAVAQ RLVGGWTGNA IRMLAPFWST DVAGDFFATN CPKIVANDAA DAIISNEASL KSGISLWKEL HRGIASTKGI GWMTEAPLQY RMADWENVCV NDSKYVTAPG VLRSQAPTWP HDKHISVEEA FHFAACCKRI GKYAKAFRKG SEGGDLSGLH PGSRRALMEA WQSLACLDGL TGAPLGVAVK QGFDTTVAWL CSFLIFGAQS IVAVAERRTE GDVGLREQLA IVPSDFDSRP AGFAAAEEGG MVHPKPLPEV IESLLSFQES GDPSLRDLSH EFQFVVGVLK YLQGRLSAST SIEAAQKSRK LQVFKEGVGF LLNLHCGHNN PFSKEEQLQL KGLLDRAIGV KGGGLV
|
| |