Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21929 |
Symbol | |
ID | 7203051 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 111359 |
End bp | 115412 |
Gene Length | 4054 bp |
Protein Length | 1270 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182326 |
Protein GI | 219124051 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAACCA AGTTTGAGTC CAAATCGGCG CGGGTGAAAG GTTTAGCCTT TCATCCGGTG CGTCCCTGGG TATGTGCATC TTTGCACAAT GGTGTAATTC AACTGTAAGT TGTTTAACTG TCGTTGTGCC ACGGAGCAGC GGACAAAAAT GACCGTTCGT CTGCGGTGAT TTGTGGGGGC GTTGAGAAGG ATCGTGCCCG AATGTTTGTC TAACCTTTTC TGACTCGTGT TTGTTTCTGT ATCTCTTTGT GTAGTTGGGA TTATCGAGTA GGCACCGTCA TTGATCGATT TGAAGAGCAC GAAGGTCCGG TCCGTGGTGT CGATTTCCAC GTCTCGGAGC CGTTGCTCGT ATCCGGCGGT GACGACTACA AAATCAAGGT CTGGGATTAC AAGCTCCGTC GTTGCTTGTT TACCCTACTG GGACATTTAG ACTACATTCG GACGGTTCAA TTTCACAGTA CGTTTCCCTG GATATTGAGC GCGTCCGACG ATCAAACGCT GCGTTTGTGG GATGTTGATC GTCGCACCTG TTTGAGTGTC TTGACCGGAC ACAATCACTA CGTTATGTGC GCGAGCTTCC ATCCCACCGA AGATCTCATC GTGTCAGCGT CGTTGGATCA GACGGTCCGG GTGTGGGATA CCACCGGTCT CCGTAAGAAA CAAACGGGTG AGGCCAGTGG TGGGGGACAC ATGGATGGAT CCATGCGTCC GCCGTCCACC GGACTTAACG TGCAAGCCGA GCTGTTCGGA ACCAACGATG TTGTCGTCAA GTATGTACTC GAAGGTCATG ACCGAGGCGT TAACTGGGCG TCCTTCCACC CTACTTTGCC ACTCCTCGCC TCGGCAGCGG ATGATCGACA GGTCAAGTTG TGGCGTATGA GCGAAACCAA GGCCTGGGAA GTCGATACCC TCCGGGGACA CGCCAACAAT GTGTCCTGCT GCTTGTTTCA TCCCAAACAC GATCTTGTCG TCTCCAATTC GGAAGATCGC TCGATCCGTG TTTGGGATGT CAGCAAACGT GTCGGCGTCC AAACCTTTCG CCGTGAAGGC GATCGCTTTT GGATTCTCGC CGCGCACCCG ACGCAGAATT TGCTCGCGGC CGGACACGAT TCCGGTATGA TCGTCTTCAA ATTGGAACGG GAACGTCCCG CTTCGTGCTA CGGACCGACT TCCCAGCTCT ACTACGTACG CGGTCGGGAA CTTCTCCTGC ACGACTACGG ACGCGGTAGC ACCGGAGTCG ACGTTCCCAT TACCAGTCTC CGCCGTATGG GAACGCAGGC TCAAACGGAC GGTATAGGCT CAGCCCCGCG ATACCTTACC TACAATCATC ACAACCCGTC CGAAGGGAAT ATTTTGGTCA CTTCGGATGT CGACGGGGGT TCCTACGAAC TCGTTACCTT CAGTTTGAGC AATGCGAGTG GTTCCGTCAC GGACGGAAAA CGTGGTTCCT GCCTTGGGCC AGGTGTTTTT TTGGGACGCA ATCGCTTCGC CATTCTTGAT CGCCAGCGAC AAATTGTAAT CAAGAATCTA CAAAATGAGA CGACCAAACG TGTTCAACCA CCTGTTCCCA ACGTGGACGG ACTGTTGGAC GGTGGCGCTT CGGGTCGCGT ATTGTTGCGC GCCGAAGACC GTGCCATTCT GTTTGAAGTC CAATCCCGAC GCGTGCTGGG AGAAATCACG GCTCCCAAGA TCAAATCGGT TGTCTGGAGC CCGGATGGAA GCAAAGTGGC CATTGTCTGC AAGTACGGTG TCGTTATGGC GGATCGCAGC CTCGAGCAGT TGTGCTCAAT TTCGGACAAC GTCCGCATCA AGTCGGGTGC GTGGGACGTC AGTCCCACGG GCGGCACGGC CTCGGAACTA TTTGTCTACA CCACCCTCCA TCACGTCAAG TACTGCTTGC CGTCGGGAGA CACTGGTACG ATCCGTACGC TAGATCAACC GCTCTACGCG CAGCGTATCG TCAAGGACCA GCTTTTCTGT CTCGATCGCG AAGCCCGACC CCGCATTCTG AGTCTCGACA CGACCGAAGC CCTCTTCAAA CTGGCGCTGT CGCAGCAAAA GTACGGCAAA GTTATGCACA TGGTCCGTCA CTCTCGCTTG TGCGGGCGCG CCATTGTTGC TTATTTGCAA AACAAGGGTT TCCCGGAAGT CGCATTGCAC TTTGTGCGGG AACCCCGGAC CCGTTTTCGC CTCGCCTTGG CGTGCGGAAA TATTGAAGCC GCCATGGAAT CAGCCTTTAC ACTGGAGCAA AAAGCTCAAG CGGAAGGCAA GGATACCGGA CGAGACGTTT GGGGCGAATT AGGCAGTGAA GCGTTGCGTC AAGGCAATCA CCAAGTTGTC GAAATGAGTT ATCAACGCAC GAAAGACTTT GACCGCTTGT CGTTTTTGTA CTTGATTACC GGTGATACAG ACAAGTTACG CAAAATGCTC AAAATTTCGA ACATGCGTCA AGACATTATG GGTCGCTACC ACAATGCCTT GTTGTTGGGC GATGCAGCCG AGCGTGTGCA CGTCTTGGAG GAGTCGGGAA ATTTGCCGCT CGCCTACATC AGTGCAACCT TGCATGGTTT GATGGAAGAC GCGGACCGGA TCAAGATTAC TATCGAAACA AATGGTGGCA GTGTGGATGG CCTTATGGAC AAGGTTTCCG CCGAGGCTGG GGATAGAAAG ACACACTGTT TGCTGCAACC CCCCACTCCT ATCCTCCGCG CCAACAACTG GCCAACACTC GAGGTACAAA AGACGACTCT TGAAGACCTA TCGGCAGCCG ACGGTGAAGC TCACGAAGAG GACGGTGGTG AATATCACGA TGCAGCAGCT GCGGCAGCGA CTGAGTTGGG TACGGAAGAT TGGCAAGACG ACGACGAGGA TATGGGTATG GGTACCGGCG CCGCGGCAGC GGCTGCTAAT GACTTGGACT TTGGTGCCGA CGACGATCTC GGCGACTGGG GCGACGATCT GGATGAACTC GGCGACCTGG GTGAACCGTC ACATCGTGAG GCTGACGAAA TGATAGACGT TTCGGAAGTC GGAGAAGTTG GTGACTTTGT TATGCCTACT TCTGGACGCC CTCCTGCTGG TTGTTGGGTA GGCAATAGTT CACACGCGGC CGATCATCTG GCAGCCGGAG CTGCGTCTTC AGCGTTACAA TTATTGAATC GTCAAATTGC GGCGAGCGAA TTCGCTCTAC TCAAGTCAAA TATGATCGCT TGCTATTTGG GTTCCATGAC GAGCGCTCCT GGTGTTTCGG GCAGTCCGAG CATGTCCATT CCGTTGCTAC GAAATGATGT TAACGGACAT CCGGGTGCGG AAAGTCTGCC TCGTACACCC TTGACTTTGA AGCAAACGGT AGCCGGGATT CGCAATGGAT ATCGCTTCTT CCAGGGTGGA AAGTTCAACG AGGCCAAGGC AGCTTTTGTA TCAGTGTTGG CCGAAATTCC GCTTGTAGTT ACCGGCAACC GGGCAGAAGG CAACGAAATT AAGGAAATGC TCAGTATTTG CCGCGAATAC ATTACAGCAA TTCGGATCAA AGCGGAAATG GCAGCAGCTG CGACTGACCC GGTCCGCTCC ACTGAGCTGT CAGCCTACTT TACTCACTGC AACCTGCAAC CGGTCCACTT GCTGCTTGCT CTTCGTGCTG CCATGGGAAC GGCCTTCAAG AACAAAAACT TTATCGTGGC TGCCAGCTTT GCGCGTCGTT TGTTGGAGCT TCCAGACATG AGTAACGAAC GCAATGCAGA ATTGCGAGTC AAGGCAACTA AGGTGTTGCA GAAGAGCGAG CAAATGGCCC GAAATGAGCA TCAGCTGAAC TATGACGAAA CGAAGACATT TGCGATTGAC TGCAAAGACT TTGTCCCTAT TTATTCGGGC GACAGCTCGA CGCAGTGTTC ATACTGCGGA TCTTCCTACG CGGACGAATC TATGTCGCAC AGTCTGTGCT TAACATGTGG ATTTTGTGCT GTCGGGATCC AAACCATCGG GCTCGTCACT GGATAAATTT CTGCCGGATG TTCTACTCTA TGTTGGCATA TCTACTAATG CAAGTTTTAA ATTC
|
Protein sequence | MLTKFESKSA RVKGLAFHPV RPWVCASLHN GVIQLWDYRV GTVIDRFEEH EGPVRGVDFH VSEPLLVSGG DDYKIKVWDY KLRRCLFTLL GHLDYIRTVQ FHSTFPWILS ASDDQTLRLW DVDRRTCLSV LTGHNHYVMC ASFHPTEDLI VSASLDQTVR VWDTTGLRKK QTGEASGGGH MDGSMRPPST GLNVQAELFG TNDVVVKYVL EGHDRGVNWA SFHPTLPLLA SAADDRQVKL WRMSETKAWE VDTLRGHANN VSCCLFHPKH DLVVSNSEDR SIRVWDVSKR VGVQTFRREG DRFWILAAHP TQNLLAAGHD SGMIVFKLER ERPASCYGPT SQLYYVRGRE LLLHDYGRGS TGVDVPITSL RRMGTQAQTD GIGSAPRYLT YNHHNPSEGN ILVTSDVDGG SYELVTFSLS NASGSVTDGK RGSCLGPGVF LGRNRFAILD RQRQIVIKNL QNETTKRVQP PVPNVDGLLD GGASGRVLLR AEDRAILFEV QSRRVLGEIT APKIKSVVWS PDGSKVAIVC KYGVVMADRS LEQLCSISDN VRIKSGAWDV SPTGGTASEL FVYTTLHHVK YCLPSGDTGT IRTLDQPLYA QRIVKDQLFC LDREARPRIL SLDTTEALFK LALSQQKYGK VMHMVRHSRL CGRAIVAYLQ NKGFPEVALH FVREPRTRFR LALACGNIEA AMESAFTLEQ KAQAEGKDTG RDVWGELGSE ALRQGNHQVV EMSYQRTKDF DRLSFLYLIT GDTDKLRKML KISNMRQDIM GRYHNALLLG DAAERVHVLE ESGNLPLAYI SATLHGLMED ADRIKITIET NGGSAGDRKT HCLLQPPTPI LRANNWPTLE VQKTTLEDLS AADGEAHEED GGEYHDAAAA AATELGTEDW QDDDEDMGMG TGAAAAAAND LDFGADDDLG DWGDDLDELG DLGEPSHREA DEMIDVSEVG EVGDFVMPTS GRPPAGCWVG NSSHAADHLA AGAASSALQL LNRQIAASEF ALLKSNMIAC YLGSMTSAPG VSGSPSMSIP LLRNDVNGHP GAESLPRTPL TLKQTVAGIR NGYRFFQGGK FNEAKAAFVS VLAEIPLVVT GNRAEGNEIK EMLSICREYI TAIRIKAEMA AAATDPVRST ELSAYFTHCN LQPVHLLLAL RAAMGTAFKN KNFIVAASFA RRLLELPDMS NERNAELRVK ATKVLQKSEQ MARNEHQLNY DETKTFAIDC KDFVPIYSGD SSTQCSYCGS SYADESMSHS LCLTCGFCAV GIQTIGLVTG
|
| |