Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31815 |
Symbol | |
ID | 7196384 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 992151 |
End bp | 996118 |
Gene Length | 3968 bp |
Protein Length | 1246 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177199 |
Protein GI | 219110895 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.56287 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCGAG TTCGCAAGGC AACCGGTCCT ACCCGGAAGG GAGCGACCGA AACGGTGCCG GAGGAGCGAG TGGAAGAAGA AACGCCCTTT GAGGCCGTTG AGTCGTCGTC CAAGGACAGT GACAATGAGA CGCAACCATC GTCCATGGGC GATGACAATG ACTCACAGTC TGAGATCGAG TCGTACAAGA TTGATACCGA CATTGATTTC AAGTACAACC CAAACTTTTT TGAGGACAAG AAAGCCCTTG AAAGTGTCTA TGGGATTTGG AGATATCCAT GTGAAGTCAC TCCAAAACGA AGGTTTGGAG ACCGCAAATG ATTTCTTGCT TATTTCTATG AGTGACATCA ATGATCTTTG CAACAAGCTT TTGTTTGCAA CAGTTTACAG GGCTCGCCTA CGGGCATTTG CTACATGGTT ACGTAGTCAA CCCGACAACG TAAATATTAC CCAAGAATGG ACAATTCCAG TTATGCAATT GGAAATGCAG ATGAAGGCGC AAGCGTCTCC ATTTGGAACC TCCGAGACCA ACAAAACAGA CAAGTCAGTC TCCAGTCTGG TGCCTGATCC CTTTGATGGT ACACAGAAGA AGTGGCTCGC CTTTCGATAC AGTTTTGAGG CATGGGCCGG AGCAAGTGGG CAATCTTTTG ATGCCTGCAT CTCACATGAC TCGGAGCGAT ATTCCCGTTC AGAACCAACA GCGACCTACA ATGACATCAA TGACGAACCT GATTCATTTA AATATGACTG GAACGTTAAG TCAGTTCGCA ATTCAAATAT CTTTTTTATG CTCAAGTCGC TCACGAGCGG GGGAGATGCA TGGGGCCTTA TCGAACCTTA CGAGGTTTCA AAAAATGGCC GTCATGCCTG GATCACCTTG TGTGCGTTCT ATGAAGGAGC CAGTCAGGTG GGCTTAACCA CGGAAGAAGC TCGCACTACA ATTCTGACAG CGAAGTATAC CGGACAATCC CGGAACTTCA CTTTTACCAA GTATGTTCAA AAGCATCTTA CTGGTAACAA CATATTGGCT CGCAACAAAG AGGCCTACAC GGACTCACAG AAAACAAACT TTTTCCTACA GGGAATTGTT GATCCTGAAC TTATGGCATT CAAGGCAGCT GCTGAAGCTA ACCTAAATGA ATGGAAGTTC GAACGCGTTG TCACGTACAT GCGTACTCAA GCCGCCAAGC TCACGAGCAA GGACGGTAAG GATTCCCGAA ACATTCGTCA GGCTACGGGC TTGTCGAAAA ACAGGAACAA CAAAAACAAC CGGCGCAAGC GCTCGGAATA CCAAAGCCAA GGCAAAGGTA ACAAAGAGTC GGGCAAAGGA AACAATGCTC CTAGTACTCA ACTCCGCAAG GACATCTGGG ATGAATTGTC TCCCGAGATA AAGGATGCCA TCAAAGCGGC AAAGCGTAGA GCGTCTACGG ACCCGCGCAC GGCTAAAAGA GCCAAGACTA GTAGTACGGA TAACTCTAAC GCAAGCGTTG AGTCCTACTC GCCTGATTTA AGGTCAATGT CTACTGAAAT ATTTAAAGCA GATGGTGACA AGGACTTGGC TTCAGGCCAG CCTGAGGCGA ACGATACACC ACTTCATTTG GAACTTGAAG ATACGCTTAA GAAACCTACA TATGGAGCAG GTACCCTATT TGGGCGATCT GCTGACAGGG TCTCCTTTAA TCGTATGGTA TGCAGTTCAG AAGAAAACAA AGTCACTCCT TGGCGCATGT CAGAACTACG GCTTGCGGAT GCAACAATAA GACGCATTTG TAAGAATCGC ACACGAAATC CTACCGGCCG TTCAACATGG GGCGAAGCTG CCATTGATAC TGGTGCCGAC ACAATTTGCA TTGGTTCAGG CTATACTGTA CTTGCTCATA CAGGTTGATA TGTGAGTCTG CGAGGTTTTC ATGACAGTGG TGATACTCTT GATCGAATTC CAGTTGTGAC GGCTGCTACA GCATATGACT ACGATGACGG AACCACCGTT ATTCTGGTTT TCCATGAAGC TTTGAATCTT GGGCCTACAC AGTCCACATC TCTCATCAAC TTGAATCAGA TTCGGCACGC CGGACATCAG ACTGATGACA TTCCGAAGTT TTTATCCCAA GGGAAATCTC TTCACGGAAT TGAAACAATT GATGGCGACT ACATTCCTTT TGCATTGAAG GGACGCACAT CATTGTTGTA CTCACGAGTA CCTACTCGCC ATGAGCTTGA GAACTGCCTG CACATTGATC TTACATCTGA TCAACCCTGG GATCCAAACA GCAAAGACTG GGAGGATAAT GAGCAGCGCT ACACGCGTCA TGACCGACAA CGGAATGCAC GCTATACCGC AACTGATAAT GAGGATGAGG AGAACTTTTA CCATGGGTAT TTCTCTCTCC CTGACTCTAA GGAGTTCCCG GTTCTACCGG CAAACAATAA TGTTATGAAC CCACATGATG TCGTACGCGA GATCAAATAT GCTACTGCAC GGGTTTCAAA ATCTAGCCCA CGGGATCTAG ATGTCGATCG AGACAAACTT CGCCGCATCC TGGGACATGT TCCTATGGAA GTAGTTGACC GAACACTGGA AGCTACAACA CAACTTGCGG AACGCTCTGG CAAAATGCCA CTGCATCGAC GTTTTAAAAC GAAGTTTGAA CAATTGCGAT ACCGCCGGTT GAAGTGTACG TTATATAGCG ACACTTTCAA ATCTACTGTT AAATCCTCCC GAGGACACAC GCATACCCAA GGGTTTGTAT GTGGTGATTC TTACTTTGTA TACCACTTTC TTATGAAAGC GGAATCCGAA GCAGACCAAG GTCTTGCGTC AATTATACAA GATATAGGAA TTCCGGCACA AATTCACACC GACAACGCAA AAGTGGAAAC CTTAAGCAAA TGGAAGAAAA TCACTTCCGG TCACTGGATA AAAGTCACAG TCACGGAACC ATACTCACCG TGGCAAAACC GTTGCGAACA CGAATTCGGT GCGGTTCGGA TCCAGACACG ACTTGTTATG GAAACGACAC AATGTCCAGA ACAGCTTTGG GACTACGCCA TTACCTACGT GGTAATTGTG CGTAATAATA CCGCTCGCAA AGCCTTAAAT TGGCAAACGC CCTTAACGGT TATGACAGGT GACACGAGCG ATATTTCAGA ATTGTTGGAT TTCGAGTTCT ACGAACCGGT ACAATATTTT GACAATCCTG AAATTAAATT TCCACAAGCT AAGGCTAAAG TTGGTCGGTG GCTTGGTATT GCAACAAATG TTGGACAAGC TATGTGCTAC TATGTCCTAA CAGACAAAGG AACCGTGATA ACGCGTTCCA CAGTCACACC ACTTCACAAA GTTGATTCGA CTGCTTTGCA AACCTCTCTT ACAGCTTTTG ATGCTATGAT AAGGGAGATT TATCAGCCTA CTGATTTTGC TCACAGCACT AAAAAGCAAG CTGCCTCGTT ACGACGAGAT GAAGCAATGA AGGTTGCCAG AAAAACTGGT GAACCTGAAG ATCCAGGAGT CCGTAATAGA CATGTTCTGT ATGACTTAAA TGAGGGAGCC GACCATGACC AAGTGGAACC AGGACTATCA GTTGATGATT ACTACGGTAA CGACGACGAA AAAGAGTCTG GTTCGTCGGA TCTCCTTGTC GGCAGCGAAG TACTCCTTAC TAAGGGAGGT ATACAACATC TAGACAAAGT CACCAAGCGT GATAAAAATG GCCAGCCCAA GGGCTCAAAC GAAACAACCA ATTATGTTGT TGAGTTCAAT GATGGTACTG AAGAGATTCA TGGATACAAT GCTCTGCTTG ACGCTGTGTA TAAGCAAGTC GATGATGATG GTAATGAATG GTATACTTTT GAAGATATTG TTGACCATCA AAGGCGCCCA CGTGGCGGCC GAGGACGAAC GAAAGGTTGG TTCCTCCGTG TTAAATGGGC CAATGGTGAA TACACCTGGG AGCCTCTTAC CTCTTTAA
|
Protein sequence | MARVRKATGP TRKGATETVP EERVEEETPF EAVESSSKDS DNETQPSSMG DDNDSQSEIE SYKIDTDIDF KYNPNFFEDK KALEIYRARL RAFATWLRSQ PDNVNITQEW TIPVMQLEMQ MKAQASPFGT SETNKTDKSV SSLVPDPFDG TQKKWLAFRY SFEAWAGASG QSFDACISHD SERYSRSEPT ATYNDINDEP DSFKYDWNVK SVRNSNIFFM LKSLTSGGDA WGLIEPYEVS KNGRHAWITL CAFYEGASQV GLTTEEARTT ILTAKYTGQS RNFTFTKYVQ KHLTGNNILA RNKEAYTDSQ KTNFFLQGIV DPELMAFKAA AEANLNEWKF ERVVTYMRTQ AAKLTSKDGK DSRNIRQATG LSKNRNNKNN RRKRSEYQSQ GKGNKESGKG NNAPSTQLRK DIWDELSPEI KDAIKAAKRR ASTDPRTAKR AKTSSTDNSN ASVESYSPDL RSMSTEIFKA DGDKDLASGQ PEANDTPLHL ELEDTLKKPT YGAGTLFGRS ADRVSFNRMV CSSEENKVTP WRMSELRLAD ATIRRICKNR TRNPTGRSTW GEAAIDTGAD TICIGSGYTV LAHTVVTAAT AYDYDDGTTV ILVFHEALNL GPTQSTSLIN LNQIRHAGHQ TDDIPKFLSQ GKSLHGIETI DGDYIPFALK GRTSLLYSRV PTRHELENCL HIDLTSDQPW DPNSKDWEDN EQRYTRHDRQ RNARYTATDN EDEENFYHGY FSLPDSKEFP VLPANNNVMN PHDVVREIKY ATARVSKSSP RDLDVDRDKL RRILGHVPME VVDRTLEATT QLAERSGKMP LHRRFKTKFE QLRYRRLKCT LYSDTFKSTV KSSRGHTHTQ GFVCGDSYFV YHFLMKAESE ADQGLASIIQ DIGIPAQIHT DNAKVETLSK WKKITSGHWI KVTVTEPYSP WQNRCEHEFG AVRIQTRLVM ETTQCPEQLW DYAITYVVIV RNNTARKALN WQTPLTVMTG DTSDISELLD FEFYEPVQYF DNPEIKFPQA KAKVGRWLGI ATNVGQAMCY YVLTDKGTVI TRSTVTPLHK VDSTALQTSL TAFDAMIREI YQPTDFAHST KKQAASLRRD EAMKVARKTG EPEDPGVRNR HVLYDLNEGA DHDQVEPGLS VDDYYGNDDE KESGSSDLLV GSEVLLTKGG IQHLDKVTKR DKNGQPKGSN ETTNYVVEFN DGTEEIHGYN ALLDAVYKQI LLTIKGAHVA AEDERKVGSS VLNGPMVNTP GSLLPL
|
| |