Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31544 |
Symbol | |
ID | 7196082 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 326727 |
End bp | 330132 |
Gene Length | 3406 bp |
Protein Length | 1033 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176564 |
Protein GI | 219109619 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTTC ACACCAGGAT TGTCACGTCC ACCACCAATA GAAGGCTCCA TCTGCTAGGG CTAAGGCGTG GTGAGGAATT ACTTGGTATG GACAGTGGGT GGGAAAATAC GACAAAGCTT TTTCGGCATT TGCAATGTCG GATTCTTTCT TCACAAAGGA CTTCAAGCCA ACCGAATCCC CGATACGATC CCACTGCGAA GCGACCAAAC AAGGTTTGCG ATCCATACGG CCAAGGCGGA AAAGCCATGC CTCTGTCAGA TATTCAGGCT TTTCAGGCGA CCATAGACGA TCAATGGAAA GTGACGGAAG ACGGGACGGC TCTTGTCCGG GACTTTGTAC ACGCAGACTT TCTGACAGGC GCTCGTTTTG TGCAAAAGAT TGCCGCCGTC TCTCAAATGA ACAGCCACTT TCCAATAATC GGACTACAAA GGTGTATTGT CAAAAAAAAT TGGCAAGTGG TCACTCGTAT TGAATGTAGT ACAATGGTTT TGGGGGGCCT GTCGGCTCAC GATTTCCATT TAGCTATGGT ACGTGCACAT CACCTCTCGT GGTGTATCGC GAGATGCAAA TATGAACGGT AACGAATCTG ACATTTTGAG ATTGTTTATG CTACTAGCTG ATTGATGTTG AAACCGCGCG ACCGGAGGTG CATGCTCTAC TGGCGGCTGC GGACGACAAG TAATGAAGAG AGGACATTGT AGCATCTTAA AAGGTATCTA TATGGACGTT GTGCTCTGCC AAAACACCTA TCTGACAAAG GTAGCGCTCT TGCTTTATAC GAAAACAGCA CGTGTTACTG TTGGAAAATG TAAAGATAGC CTGACATATG TTCCCCAATT GTGACATAGC TTTCGTGAAT GAATATCAGA AGGCTTTTAA GATCGATCGT CGACGTGTCT CGGGCCGCTG GTTTTCCGTG TCAAATTTAT GTCAGTCAAC TCGAATCTCG TCTCCAACAT CACGTGCTAA CAGCGGATAC AATTGCTTTT GTAGCGTCGA ACATCAAGCG CTGCAATATA AACGCAAACA ATTCGTTTGA TGGCGAGGAT GAAGACGAAG TGTGTGGTAT GGGTATCTAA TGCCATTGTA GCCGCGTTGG CATTTCAGCC GTCCTATCAG AACCTTCCCC TTAGACCTGC GCAGCACCTG CGACTGACTA CATCCGGATC TAGCGATGAT GGTTTCAGCC AAGAAAACTC GGAGACCGGT ATTTCTCTTT CTCTCCGAAA CACGGGTACG AATGAATATA CCAACCCCGT GAAGCAGCGA ATGAGCTATC CTTCATCGAA ACGCCAAAGG AAACAACACC GGGACATTGT GATCGTCGGC GGAGGTCTGG CAGGCTTGTC AGCAGCCCTG TACGTCTCGC AGATTGACCC GACACGACAT GTAACTATTC TTGATAAGCA GGATCCGGAG TCACACCGTT CTAAAGATTC TACTGTTGCT AGCTACGCAG CGGCGGGCAT GTTGGCCCCG AATTCAGAAC GCTTACCAAA AGGTGATTTA CTTAATCTAT GTTTGGAAAG CAGAAACATG TATGGAGACT TTTGCGACAT GGTCGAGTCT TTGGCTCAAG AATCTGGAGA AGAAGGAATG AAATATTTGG CAACATCCTC ACAGAGCGCT GACGGATTAG AACCCTGGAG TATTGGATAC GTTGCTTCAG GTGGTTTCTT GGCTCCAGCA TTTGCAGGTG ATTCAGTCGC CACGTGGGCC CCACCAGATG ACGGTGGGGC AGCAACATGG CTGGACGCTA CCCAAGCACG AGAGCTAGAA CCTAACCTGC ACCCAGACGT TGTCGGCGCC TACTGGTTTC CCGAAGACGC TAGTGTCGAT GCTCGGCGAT TGACGAATTC TTTGCGGGCT GCCTGTGTAG CGGCAGGAGT CCAGATTCTG CACGGACCCT CCAACGAAGT CACATCACTG GATCTCTCAG AAGGGATCTG CAAAGGTGTC CGGTTGCAAA GTGGACGTTA TCTGAGTTGC AATTCGATTC TGGTCGCCAA TGGTGCATGG ATGCGCAATC TTTTACCGGT TCCTATTGAG CCACACAAAG GCCAATCCTT GTCGCTTCGC ATGCCGAAAG ATCGTCCGCC AATTCTTAAG CGCGTTCTCT TTGCCCAAGA TTCATATATT GTACCGAAGG CAGACGGTCG CATTGTTGTC GGTGCGACTG TAGAAGCAGG AAGCTACGAT CCTAACGTGA CGCCTGGCGG TCTTTTGCAC ATTTTGACAC ACGCATTGCA GCTGGTACCC GCATTGAAAG ACCTTCCCAT TGAAGAAACA TGGGCGGGAC TTCGTCCAAC CACGCCGGAT AAAGGTCCAA TATTGGGAAA AACACCGTGG GAAAACCTGT ATTTGGCTGG AGGGTACTGG CGAAATGGTG TCTTGTTGGC TCCAAAAACT GGAGAACTAC TGGCTGCTCT CATGACCGGA CAAGAAATTG ACGAGCAGGA TCAGGCGATG TTGGATGCGT TTGCTTGGGA TCGCTTCACG AACAAGGACG GTGGCGATCG CCTCTCAGCC AACGCAAGGT ACGCCGCCTC GATGCACCCG ATACATAGTC GAAAGTCTGG TGCTGGCGTC GCAGCCTCGG TTGGAACGGA ACTCGGAACT TACTCAAGCG CTCGTTCGGC GAAAGAAGAA AGACAACAAG ATCGTAATTC ATTGTGGAAC GAAAATGGAG ACGGAGACGT TGCTTTTGAG CGGGCAGCAA CAATGGGACG GAACGACGGG GCGGCGTACT CTTTTGGAGA CGATGAATCT CCGTATGAAC GAAAATCCGT TTCACAGTCA ACAGCACAAA CAACTCCTTC GTTTGAGGAT CCCAAAAGTT CAAAGAGGTC ACTGAAGGCT TCTGATACTG TGGATGCGTA TACGGTAGGA GCGTCGGACG AGATTCAGGA CTCTCATTCT GCGGAGACAA AGGCTTCCGA TTTGACTGAC ATGTATGAAA AAATTAGGGC AAACAAGGCA AAGAAAACTA CGACCTTAGG TGAAAGCGAT GGCGACGAGG AGGTACGTCC CGATCCTGGC TTTCGAATAT TTTATAAAGA TCCAGAAACA GGTGAACGGC ACGAAGTCCC TCCGTACACA TCGCCCGGAG TGTTCCAGCA AAAACTGCAT GCGAGGAAAA AGTCAGAGCG ATCCGCGAAC GGAACCAGGA ATGATGTCCC AATCAGTGAT GTTGTTGCAC CTTCGCCAGC GGCGAATGGC AACAAGGAAG CGCAGCAATA CAGCGAAACC ACCTATGACG GTTACCAAGA GATTCAGTCG GCTAACTCAC GACAAACTCG AGCAGAAGAA TTAGAAGCGA TGCGAATGGC ACGACAGAGT AATCGTGTTG GCCAAGAAAG CATCAAAGAG TCGGATATTG GCGCCCAACC GATGGGCGAC GAGTAG
|
Protein sequence | MSFHTRIVTS TTNRRLHLLG LRRGEELLGM DSGWENTTKL FRHLQCRILS SQRTSSQPNP RYDPTAKRPN KVCDPYGQGG KAMPLSDIQA FQATIDDQWK VTEDGTALVR DFVHADFLTG ARFVQKIAAV SQMNSHFPII GLQRCIVKKN WQVVTRIECS TMVLGGLSAH DFHLAMLIDV ETARPEVHAL LAAADDKRLL RSIVDVSRAA GFPCQIYVSQ LESRLQHHVL TADTIAFVAS NIKRCNINAN NSFDGEDEDE VCAALAFQPS YQNLPLRPAQ HLRLTTSGSS DDGFSQENSE TGISLSLRNT GTNEYTNPVK QRMSYPSSKR QRKQHRDIVI VGGGLAGLSA ALYVSQIDPT RHVTILDKQD PESHRSKDST VASYAAAGML APNSERLPKG DLLNLCLESR NMYGDFCDMV ESLAQESGEE GMKYLATSSQ SADGLEPWSI GYVASGGFLA PAFAGDSVAT WAPPDDGGAA TWLDATQARE LEPNLHPDVV GAYWFPEDAS VDARRLTNSL RAACVAAGVQ ILHGPSNEVT SLDLSEGICK GVRLQSGRYL SCNSILVANG AWMRNLLPVP IEPHKGQSLS LRMPKDRPPI LKRVLFAQDS YIVPKADGRI VVGATVEAGS YDPNVTPGGL LHILTHALQL VPALKDLPIE ETWAGLRPTT PDKGPILGKT PWENLYLAGG YWRNGVLLAP KTGELLAALM TGQEIDEQDQ AMLDAFAWDR FTNKDGGDRL SANARYAASM HPIHSRKSGA GVAASVGTEL GTYSSARSAK EERQQDRNSL WNENGDGDVA FERAATMGRN DGAAYSFGDD ESPYERKSVS QSTAQTTPSF EDPKSSKRSL KASDTVDAYT VGASDEIQDS HSAETKASDL TDMYEKIRAN KAKKTTTLGE SDGDEEVRPD PGFRIFYKDP ETGERHEVPP YTSPGVFQQK LHARKKSERS ANGTRNDVPI SDVVAPSPAA NGNKEAQQYS ETTYDGYQEI QSANSRQTRA EELEAMRMAR QSNRVGQESI KESDIGAQPM GDE
|
| |