Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47504 |
Symbol | |
ID | 7202281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 832197 |
End bp | 834751 |
Gene Length | 2555 bp |
Protein Length | 812 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181634 |
Protein GI | 219122609 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.467202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGTTG TACGGAACAC CGCCATTCGA CAATCGCTAC TCTTCCCTAC ACAAGCCTTT TTCTCCGGCA CCAGGCTACC GTCGTCCATT ATGGAGCGGT GTCGGTCGTT GAGGATCATG ATGGTGCTGC ATCTAATTGT ACTGGTAGTA TTAATGAGTG CACCAGTGAA TGCTGCCGTC TTTCCCGGGC ACAGCAGCAG CAGCACGGCA CCGTCAACAA CAACAAAAGG GGCGACAACG ACATATATTC GTCGAGAATT GCAAGAGGAG AAAAAACAGA GCATTGTACA ACAATACGAG CTATGGGAGG CGGAAGAAAT CAGTTCCCAG ATACAAAAAT GGGCTTCCCA CTATCCCGAT CTACTCCGCG TCACTACCTC GCAAGAAGCC TACGGCTTGC CTCGAGCCGG AGGGGCCGAC GATTGTCCGT TCGACAAGGG CGGCGACGGT TGTCTGAACT ATATACTTAC ACTACAAGAC TTTGTGAAGC ATCCGGAGGG ATCGGCGACG TCCAATCAGC TGCCGGAAGT CCTCTGGAGT GGTGAAGTCC ACGGCAACGA ACGGGTCGGC CCGACCGCCG TACTGGAAGC CGCACAGCTT CTCATGGAAG CGGCTTCGTG CATCGCACAT CCCCGGGTAG CACTGCGAGA TAATCCCAGT GCCTGGAAGC TAGAATTGAC CAAAGCACGC TCGTGTCGGG AAGAAATGAA GCACATGGGC CTGGATGAAA GCCACATCCA GTGGTTGGCC CGCCTTGTGT CGACTCGACG GATTGTGATT ATCCCGACCG CGAACGCACT CGGGTATTTC CGTAAAGTGC GCGAAGAAGG CAATGTCGAC CCGAATCGAG ACTTTCCCTA TGATTTGACT GATCCCACGC TCTGCATGCA ATCAGTTGCC GCCCGGACTT TGAACGAGGT CTATCGGGAG CACTTGTTCC AACTTGCCCT TACCTTTCAC GGTGGTATGG AAGCGATTGG GTATGAATGG GGGGCTCCAA CTTGGAAATC GAAAAAATCG CCGGATGATT TGGCACAACA AGAGATCGCC GACGCGTACA GTCGCTACGC CGGGGGATGG TTCGGAACGC GAAACTACCA ATACGGCCCC ATGAACGATC TGGTGTACCC CGTGCGTGGT GGGATGGAAG ACTGGGCCTA CGCAGGCTCC TGGGATACTG ATAGAGTCAT TGCTTGCCGA CCCACTACAT TTGGGGGCTA CCCAGAAGCC AAGACCGTCT ACGACAACTC CACTTTACGG GTCTTTAATA TGCTGGTGGA AACCAGCAAT GACAAAACTC CGCTCAAGGA TCAACTCGGC ACATCCTTGG ACGTTTTGAA CTCTGATACT ACCGGTAACG GCCACGTGTC CCGCAACATT CGATTGGCCT TGTTATCGGC GGATTTGGTA CAGCCTTATG TGACACTGCA ACGAGTGAAC GACTTGCACC TGTCCGACGA TGTAGTATCG CTCTCACGAG AGGATGGTCA CAGTTGTCAA GGTACACGCA CAGTGACGAT CTCCTCAGAA CGCCCAACGG TCACGTTGGA GTGGACGGTT GGAGGGGCAC TACAAACCCA TGAGACACAA GTGTTTTACG CCAAATGGAG CGATCTTCCC TTGGAAAAAC TCGACTGTGT GAACACACCG AACATTCAAG ACGTGGAAAG TCTCATGATT GAAGGAACCA TGACTTCAGT TACGTCCGGT ACCAACCATT TCGCCGACGC CAAAGCGACC ACGTTCAAGT CGACCATCGA TGTGCAAAAC TTTCACGCCC ACGACAAGAT TGTGGTTATT GTCATGGCCA CGGCTGATCA AAATTGGCAT ACACAAGATC CCAAGGACGC TGTTGGTCCG GAGAATAGTC CACCTCAATC GCACATTGCC AACGCCCGCA TCAATGCGGA TTGGTACTTT GCCAAAGAGA ATGGCAAAGT CGTTCAGGGC CGCCGCGAAT GGTTTTCCCA ACCTTTGACT ATCGAGATCG GTGAATTCGC CACCAATGGC ATGGGCGCCC ACGGAAGCCT CGTCGTCGAA ACCTTCGAAT TGTCGAATCG CCTCGGGGAA ACGACGGGCG GCGGTTTCCC CACTGGTGGT GTGCGTCCCA ACGCGGGCGT GTCTCCCGGT ACGATACAAC CACGGTCGCT GTTCCGAAAG GTCGCCGCCG TTGGTATGCT GGTATTTGCA GCGATCGCTG TGGCATACGG GGGGCGGCTC TACCTACGGA ACAAAATGCG GTCCAGTCGA CGAACGCAAA TTCGTAACTA CATTCAGGAC GAAAGTGCAC CGAGTCCGGG GTTGCGCGAT ACAGCGCGTG TCAACGGTGC CAGCAAGAGT GGATACGTTC GCTCGGCATT CCGAGACGAT TTGGATCTCG AAGAAGAAGA TCCTCGACGA GAGCAAAGAA GCGAAGTGGA ACTGGGACAG TACACTTAGT AGGTCCACTA CTTACCGCGG CAAACGGCAA AACCTAATCT GATTTGTCAA ACATCATAAA TTGTAGTCTA ATCCATGAGA CACTTTTGAT CGCAACTAGC CAGAGTAATA CATTT
|
Protein sequence | MGVVRNTAIR QSLLFPTQAF FSGTRLPSSI MERCRSLRIM MVLHLIVLVV LMSAPVNAAV FPGHSSSSTA PSTTTKGATT TYIRRELQEE KKQSIVQQYE LWEAEEISSQ IQKWASHYPD LLRVTTSQEA YGLPRAGGAD DCPFDKGGDG CLNYILTLQD FVKHPEGSAT SNQLPEVLWS GEVHGNERVG PTAVLEAAQL LMEAASCIAH PRVALRDNPS AWKLELTKAR SCREEMKHMG LDESHIQWLA RLVSTRRIVI IPTANALGYF RKVREEGNVD PNRDFPYDLT DPTLCMQSVA ARTLNEVYRE HLFQLALTFH GGMEAIGYEW GAPTWKSKKS PDDLAQQEIA DAYSRYAGGW FGTRNYQYGP MNDLVYPVRG GMEDWAYAGS WDTDRVIACR PTTFGGYPEA KTVYDNSTLR VFNMLVETSN DKTPLKDQLG TSLDVLNSDT TGNGHVSRNI RLALLSADLV QPYVTLQRVN DLHLSDDVVS LSREDGHSCQ GTRTVTISSE RPTVTLEWTV GGALQTHETQ VFYAKWSDLP LEKLDCVNTP NIQDVESLMI EGTMTSVTSG TNHFADAKAT TFKSTIDVQN FHAHDKIVVI VMATADQNWH TQDPKDAVGP ENSPPQSHIA NARINADWYF AKENGKVVQG RREWFSQPLT IEIGEFATNG MGAHGSLVVE TFELSNRLGE TTGGGFPTGG VRPNAGVSPG TIQPRSLFRK VAAVGMLVFA AIAVAYGGRL YLRNKMRSSR RTQIRNYIQD ESAPSPGLRD TARVNGASKS GYVRSAFRDD LDLEEEDPRR EQRSEVELGQ YT
|
| |