Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45922 |
Symbol | hUfd2 |
ID | 7201009 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 677620 |
End bp | 681914 |
Gene Length | 4295 bp |
Protein Length | 1121 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180093 |
Protein GI | 219118650 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGCTTTGGA ATCCGTATCA GTTCACCAAA GACCTAACTC TATAAAAAAG CCCAGCTGTT TCTAAAGGCA ACGCTATATT TGCAATAAAA AAGAAAGAAC TCTTGTCCGA ATCATCAAGG TTGGAAGAGT ATGAGTGGAT TTTTACCCGA TCTGGCGAAC TGGGCCTTGC GCGGGGGCGC CGGCTCCGGC CGGAACGCGG AAGAGGGAAG CTCCCACGAG GAAGAAAGCC CGCCGGATAA TACGGGTCCG GTCTTGACGC CGGAGGAAGT GAGGGCACAG CGCTTGGCAC GAATGGACGC ACTGCAACAG GCGCAACGCC AAGAGCAAGC CGCCAGTGTC TCTCGTTCTA CTGATGGGGA TGCGTTGGCG TCCCGCGAAG CGCAATCCTC GGAAGGCAAC AACAACAACA ACAACAACAA CAAGTTGGAA TCTTCCACGG GCAGCGCCAC TCCGCAACCC ATGGATACCG ATGAGACGAG CTCGTCTCCA CCCATATCGA AACCCACTGA CGAACGGAAT TTGTCCGAAG CGAAGGAACA GACAAAGAAG AAGGCCAAAA ACGAACCGAT GGATCCCGAA AAGAAACTAC AACGGAACAA AGAAATGTTG CTGCACAAGA TTTTGGAAAT TACGCTCAAA GGGAGCAGTA TGGCCAAATC CAACTCGGCG TCAATGGCGC TCAGTATGAA TGCTTCTTCC TCTGCTGTTG TCGTCGACAT TGGCGACACC GCAATTACAG CACAGACGAT TGCCGAGATT TTAGCAACGC GTCTATCGCT GCCGGCCATT GATCCGGCCC TCAATACGGT CCCTCCTCCC AAACCCTTAC TGGTTTACCT AGGTCTGTGT CATCGCAGAG CGTCGGAGGA ACTGAAAACA TTGCGCCAGT CGTCCAAATC ACCTGACACC GAGATTATGG ACATTCTAGA GGAGTGTCAA CGTCAGGTAG TCAACTACGC CGCTTCAACT TTGATGGAAC CCGACTTGTT CGAACTGGGT GCTGATGGTG CTCTCCAGCT TGCCAAATGC TTACTATCGA GCGTGACGGA ACATAACACG GCAATAACCT TTGGTATGAA TGGGGCGGCG TCATCCTTTT ATCATTTGGT CTGCGACGAA CTCGTACAGC AGGATTCCAG TGCATTGTTT ACTGCGATCA ATTCAGTGGT GGACTACTTT TCTGAGCTTT TAAGAAAGTG CGAGTCCGTA GTGGATGGCG TAGAGGGTGC GGACGGTAGT CCTATGCTGA TAGTATCGGC TTTGGCCTCC ACGTGCCAAC ACAAAAAGGC GGCCGAAGCA GTCTCGCAAA GTGCTTCGTT TTTGCTCCCG GCCGCAGGAA CACCGGAAGC GGCCCAGATT GTCCGACCAC CAATGCCTGC CAATATGACT GGTAACAGTT TCTTGCAAAT GCTGGCGGGG GAAGGGAATC GGCCTTACAT GAAACGCTCC GGACCGGGTC TAGAAAAGGA AACGCTCCTG GGTTTGATTC TGCGGGTCGG TGCGCCCAAG AACAATACGG CCTTTTCGCC GTCTTCCGTG TTACGGCAGT CACTTGTATC AATGGAAAAC ACCAATACCA CTCAACGAAG TCAGTTGCGT GCACATCAAG AGGCGTGTAA AAATCTAGTT CTAAATTTGG TCAAATCAGG TGCGAGCGCG AGGAGTCAAG TGATGCAGTG GTTTATGGAT GCCTTGGACG TGAACGCGAA TGCGTCAGCA ATGCGACCAG ATCCGTCCAA AGTCAGCTCT TCGTCGCTGC TACTCAACAT GTCCGTTGTT TTGCTCAAGC TGTGCGACCC TTTCGTCGAC GACGGAAAGA AGCAACATTT GATTGACCCT GGCTTTGTCT CATCACTCGA AGCTCACAAT GGCGTTTTCG CAACCAGCGG AGAACACGCC GTCAGTCGAC TTGGAGAGAT GGATGACAGT CGAATGATAG ATTCATACAG TCCCAAGAAT TCTTTCATCC CGCAATGTTT CTTCTTGTGT GCCCGGTCGC TGCATTTCGG CATTGTCCCA CAATTATCCT ACCACGAATC TTTGTTGCGA CACATTTCGC ATCTCCATTG GCAGATTTCT AACCGCAACG GTGATCTACA AAGCGACCCG CAGTTTGCGC TCATGGTGAG CAAGCAACGA TCCAGCGAAG TTGCGCTGTT TGAGGAGGAA ATGGTAAAGG ATACACTTCG TTTTGGCAAC TTTGTTGCCA AGGTTTTATT CGATATGGAT GATGACACAC TTCGTACTAT GCCCGAGGAC TTTGTGAGCG ATATGTGCGA CATCATTATG GCGATTGCGA AACTCAAACC GAAGATGCTT CGCAATTTGG AGTTCCGATA CGTGTTTAAA CTGGTGGTCA AGCTCCTATC GGCTAAATAC GCTAGTGTAA GTCTGATGAT GGCGATTGTC GTTGTCCCTG ATAACATGGC ACATCTAACT TGATCCTCAT TTGATATTAC AGATGGTACG AAATTACAAT TTACGCGCCA TGCTCGGCGA TGTTTTGTAC GAATTATTCA TGCCACCAGA GACAGGCGAT CGCCGGGACG TGCCGGCATC GGTGTCGACG GATCTACTTG CGGGGGGACA AACGTTTGTG CTTTCCGATA CTGCGGCGCA AGAAACTCTT GCTCCATCCT TATTGCTGCT TTACGGCGAG GTTGAACACA CAGGATACTA CGATAAAATG AGTCACCGCG CCAAGATTGC TTCTCTAATA AAGTATTTGT GGAACAGTCC GGAGCATCGT CCTGCCTTCC GTCGGATCAC ACAGGACCGC GCTTCTTTCA TCAAGTTTGC TAACGGGATC ATTAACGAAA CTAATACCCT AATTGCCACT GTCATGCAGA AGCTTCCGGA AATTCGCGAA GCTCAAGAAA AGATGAAGAA TCAGCAAGAT TGGGGACGAC TGACGGAAGA TGAACAAAGC CAAGTTTCGA GCCGTTTGGA CGATAACGAG CGAGAGGTAA AATACGCATT ACCACTGTGC AACAAAACTT TGCAAATGTT TGGCTACTTG AATACAGATG GCGACATCCG GGAGCTGTTT CTACTGGAAG AACTGTGTCC TCGTTTGGTG GCCATGCTGT TGCATGTTCT CACAAAATTG GTCGGTGCCA AAGGACTGGA CCTCAAGGTT GATAATCCGG AACAGTACGA CTTTCGTCCC AAAGAAATGC TCCGTGACTT GTGCGCCATC TTCAGTCTCT TTGCGTCATC TTCCGTCTTC CAAGTTGAAT GTGCCAAGGC GGGTTGTGAC CCCAATCTTT TGCGGTCGGC TGTGAAGACC ACTCGCAAGC TCAACCTGCT GACAGGCGAG TCCATGATCG CTTTCGAATC GTTACCGGAA CTAGTGGAGC TCGCGAGTCG GACGGTACTG GCCGACGAAG CCTTTTTGGC GGACGCCCCC GACGAATTCT TGGACGAAAT TTTGTCCACC TTTATGAAAG ATCCCGTGGT TTTGCCGTCT GGTCACTTTG TGGATCGCTC GACAATCACG CAACATTTGT TGAACGATCC GATTGATCCG TTTAACCGCG AACCCATGAC GGTGGAAGAT ATTCGCCCCG CGACCGAGCT GAAGGCCCGG ATGGACGCGT GGTTGGCGGG GAAGAAAGGC TTTGCTCCGT AAGGACTAGA CGTGAGCGCC ACATTTTTTT GCGCCCACGT ACTGAGCATC AGCAACGCAT TTCTATACAC AGGTAAAGAA CCATTAGAAA GAGCAACAGA AAAACTATTG GTCTCTTGCT AACGCATCGT GCATAAATGA AGAAAGGGGT TCGGTTGCGT AGAGAGGGGT TTGTTTGTCG GTATAAAAAC GTAGATTTAG GGAGTCGCCT TTTTGTTTTC TCAGAAAGGT CACGTAGCCG CTACTTGGAC TTTGCGATTG CGCAGCGTGA GAACCCGCAG AGTATGGTTG TTGTTGTTGG TATGAGAATC GATCGCTATC GCTGGCGCCG TATCCGTGTG CTTTGTTTTT TTGGACGGCG ATCCTTTCGG AGTGATCGTA CTCGTACCAC GGTCCTCGTC CAAATCCGGC TCCACTCCGT GCGACCTCTT GAGATGCACC GACCAGAGTT CTTCACCGAC GGTGGGCCCG TCGCGTTTGG GTCCTGGGGA GAGCCTGTGC GGCACACGGG GATCGTCGTC ACGATCCTTG TCGTCGGGTC CCGTTTTTTG CTGACGGTGC AAATGGACAA GGTACAGCTC TTCCCCTACA CTGGTAGGTT TGGTCGCGGG CGTGGGGACT TTCCGTCCGA CAAGTTTGGA CTTCTTTGCC ATGGTGACAC AGTGC
|
Protein sequence | MSGFLPDLAN WALRGGAGSG RNAEEGSSHE EESPPDNTGP VLTPEEVRAQ RLARMDALQQ AQRQEQAASV SRSTDGDALA SREAQSSEGN NNNNNNNKLE SSTGSATPQP MDTDETSSSP PISKPTDERN LSEAKEQTKK KAKNEPMDPE KKLQRNKEML LHKILEITLK GSSMAKSNSA SMALSMNASS SAVVVDIGDT AITAQTIAEI LATRLSLPAI DPALNTVPPP KPLLVYLGLC HRRASEELKT LRQSSKSPDT EIMDILEECQ RQVVNYAAST LMEPDLFELG ADGALQLAKC LLSSVTEHNT AITFGMNGAA SSFYHLVCDE LVQQDSSALF TAINSVVDYF SELLRKCESV VDGVEGADGS PMLIVSALAS TCQHKKAAEA VSQSASFLLP AAGTPEAAQI VRPPMPANMT GNSFLQMLAG EGNRPYMKRS GPGLEKETLL GLILRVGAPK NNTAFSPSSV LRQSLVSMEN TNTTQRSASA RSQVMQWFMD ALDVNANASA MRPDPSKVSS SSLLLNMSVV LLKLCDPFVD DGKKQHLIDP GFVSSLEAHN GVFATSGEHA VSRLGEMDDS RMIDSYSPKN SFIPQCFFLC ARSLHFGIVP QLSYHESLLR HISHLHWQIS NRNGDLQSDP QFALMVSKQR SSEVALFEEE MVKDTLRFGN FVAKVLFDMD DDTLRTMPED FVSDMCDIIM AIAKLKPKML RNLEFRYVFK LVVKLLSAKY ASMVRNYNLR AMLGDVLYEL FMPPETGDRR DVPASVSTDL LAGGQTFVLS DTAAQETLAP SLLLLYGEVE HTGYYDKMSH RAKIASLIKY LWNSPEHRPA FRRITQDRAS FIKFANGIIN ETNTLIATVM QKLPEIREAQ EKMKNQQDWG RLTEDEQSQV SSRLDDNERE VKYALPLCNK TLQMFGYLNT DGDIRELFLL EELCPRLVAM LLHVLTKLVG AKGLDLKVDN PEQYDFRPKE MLRDLCAIFS LFASSSVFQV ECAKAGCDPN LLRSAVKTTR KLNLLTGESM IAFESLPELV ELASRTVLAD EAFLADAPDE FLDEILSTFM KDPVVLPSGH FVDRSTITQH LLNDPIDPFN REPMTVEDIR PATELKARMD AWLAGKKGFA P
|
| |