Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48735 |
Symbol | |
ID | 7195023 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 103670 |
End bp | 106592 |
Gene Length | 2923 bp |
Protein Length | 820 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183421 |
Protein GI | 219126347 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.60554 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTTTGC CCGTTACTGC CTATCTAACC ATCGAATCCT GTTTAAAATG TGCAAACGGA ACAATATTGG TACCCAAAAA ATCTGTCACT GGTTGTCATG AGTATCAAGG TTACAATGAT GGATTCTTTT TTGTAATTAA CACATGGAAG TTATTGGGTC GCGTAACGTT TGGACGTATT CTCAATCGGC AAGCGGAGTT CGTTCTTTCA GCGAGCCCGC CGATTCGTAA GTAGAACAAG GTTGATGTTT GAACTGAGCT AGTCGTAGAG TAAGGTAACG AGTCCCCTCC AACGGTACCA CAATACCTTG CGCATTTTCA CTGACAGTGA GGACGATTAA TAGATTGATA AGATCAAAAG TCTTCAACAC GACTTTGAGA GTTCCGTTTG CTGCCTTTTG CTTTTTCGAG AAAGATGATA ATGTTTCGTT CCGTCGTAGC CCTTCTTGCG TTGGGCATGA CATCAGGCCA AGCGCCGACC TGTGACGTTG CGGACGCCGT GCCTAGCTTG CCGTTTCGAG TTTTGGAAGA TACGCAAGTG GTGAGCCCAA CAGCGCAAGC CGGAGCCGAT GTGTGTGGAA TCTTGCCCGC TAGTAACGTA CAAGGACATT GGTTCTCCTA CACAGCTCAA GCAGACGGCT GCTTGGACGC CGAAGTCATT GGTCTCTCCG CAAGTGATGC TATGACGCCG CCTCTTGACC CCATCCTGTT GGTCTATACG GGGACTTGCA GTGCGCTTAT GTGTACAGCC ATGGCCGACG ACATCTCCTT GCAGAACCTC AATAGTCTCG TGGAACTTCA AGCCACTGCT GGAACAACCT ACTATTTCCT GGTCACTGGT TTTGGGGGTG AAAGTGCTGG GCCCTTTTCT TTCACCATTG TGGTAAGTTG CGTGGTTTCG CGTTGCTGAT TGACGCTCTT TTGTACCCTA CGTTGGAATT GTACCTAACC ATCTTCATCC TTTTTGTCTC ATTTCAGCCT TCTGCGTCTA CTCAATGTGA AAATCCCTCT GCCAACCAAA TCTGTCCCGT CTGTCCCAAC GGTGGCGAGC CGGCTGCCGG CGCCCTCTTC GATGAAGACG TGCTGTGTTC CGATGCCGCG GTAGACGGCT CAATCCTCGA TGACGGTGGG GAAAGCTGCG CCATTTTGCA AATTGCTGGA ACCACCATTT GCGGATGCCC CACCTCTCAA GAATCTTGCC CACTTTGTCC CGGTGGAGAG GACGTTGCCA ACCCAGATCT GGTCGTTTTG GGCGATGGAA GCCTGACATG CGGAGTCTTG AACAGTCTTG ACGGAGCCGA CACCTGTGGA GCCGTTACCG CCGGGTTCGC AGCGGAATGT GGCTGCCCTG GGACAACCCC CTGTCGTCTT TGCGATGAAA CGGCAACCAA TCCAAATCCT GAACGCCTCT TGTTCAACTT TCCACCGGAT AACGCTCCGT ACACCTGTGC GGATGCCGAA GCGGATATTA GAGCCTCGGC TGTGCTCAAT CCCATTGAGC AAGGCTGCAG TCCAAGCCTT GTGGACTTCG TCACCGGATT CGATGTCGTT GACTTTTGCT GTTTCAACGG AGACTTCCCC GGTGATCTGA ACATTTCAAA TTGCGATTCT GCTAGCGTCA TTACGGCCTT GCCGTTTACG GTCAGTGGAA ATACAGGAGA TGCCACTCCA GAAGTGAACG CCCAGGCAGA ATCATGTGGC CTCCTGAACT ACGGAGAACA CCAAGGAGAA TGGTATACCT ATACCGCTGA TGCAAACGGC TGTGTTACCG TTCGGACTTC CGGAAACTTG GACTCCATGT TGTTTGTGTA CTCTGGAGAA TGTAGCGACT TAACGTGCGT CGCGATGAAC GACGACGCCG TTTTCACAGT CTCCGGAAGT GAACTGACCT TTGACGCGGT TGCGGGGACG AGGTACTTCT TTATGGTCAC GGGATCTTCC TCCGACGATG TTGACACCTA TACCCTAGAG ATTTCGGTAC GTGCTCGAAT ACCAACTATG CAAACAATTA AATTCGCGCC AACATTAAAA AGGAGCCGGC TTACCTAAAT ATGCCTCGTG TCTTACTACA TTCTTCCCCT CTGTTTTCGT TTCGACAGCA AATTGCAGGA AACTGTCCGA GCCCGACCGG GGGCGAACTT TGTCCCGTAT GTCCAGACGG TAGTGACCCC GATCCTACCG CATTTTACAA TGACGACCTT TTGTGTATTG ACGCAGCAGC CGAATTTGGA GTTGTAGACG GAGACTCACA AGATTGTGTG CTATTCCAAA CTATCGGAGC ACCCATTTGT GGTTGCGAAG TTGTTGCAGC AGACACGTGC AACTTGTGTC CGGATGGAGA AGATGTCCCG GCTGTCGCGG CCGACAAGAG TATTCCTGAT GCCCTGACGA CCTGCTCGCA GCTCAACAAT GTCGCAGGTA CCAGCACTTG CGGAGACGTC ACGGCTGGTG TAGCCAACTT CTGCGAATGT CCAAGCTCCA GTCCTATTTG CACTTTATGT GACGCCAGCT CCACCATGTT CAACCCTGAC CTTGTTCTGT TGGAAGATGG AAATTACACG TGTGGAAATG CCAACGAAGA TACACAGTAC TATTACCTTT GGTACCCTCT TGACGCCGAG GGCTGCAACC CAAGCATTGC GGTATCCTTC ATTGATAGTG GAATCAACGT GATTGACTAC TGTTGCAACG GCGGTCCTCT GACGGGAACC CTCGGTCCCA CGGCTTCTCC GGCGACTGGT CCAACGGCTT CCTCGGACGT CCCGACTACC GATTCCGGTG CCGGTGCCGG CTCCGGCAAT ACACCGGAGT CCACGTCGAC TTCCGTTGCA GTTTCGTTTC GAGGTGGAAT CGCCACAGTG TCCTTGCTTT GTTTGTTTGT CTTGCTCAAC TAATAGGTTA GACTATTCTT ACTGGATGTT TTA
|
Protein sequence | MTLPVTAYLT IESCLKCANG TILVPKKSVT GCHEYQGYND GFFFVINTWK LLGRVTFGRI LNRQAEFVLS ASPPIPLLAL GMTSGQAPTC DVADAVPSLP FRVLEDTQVV SPTAQAGADV CGILPASNVQ GHWFSYTAQA DGCLDAEVIG LSASDAMTPP LDPILLVYTG TCSALMCTAM ADDISLQNLN SLVELQATAG TTYYFLVTGF GGESAGPFSF TIVPSASTQC ENPSANQICP VCPNGGEPAA GALFDEDVLC SDAAVDGSIL DDGGESCAIL QIAGTTICGC PTSQESCPLC PGGEDVANPD LVVLGDGSLT CGVLNSLDGA DTCGAVTAGF AAECGCPGTT PCRLCDETAT NPNPERLLFN FPPDNAPYTC ADAEADIRAS AVLNPIEQGC SPSLVDFVTG FDVVDFCCFN GDFPGDLNIS NCDSASVITA LPFTVSGNTG DATPEVNAQA ESCGLLNYGE HQGEWYTYTA DANGCVTVRT SGNLDSMLFV YSGECSDLTC VAMNDDAVFT VSGSELTFDA VAGTRYFFMV TGSSSDDVDT YTLEISQIAG NCPSPTGGEL CPVCPDGSDP DPTAFYNDDL LCIDAAAEFG VVDGDSQDCV LFQTIGAPIC GCEVVAADTC NLCPDGEDVP AVAADKSIPD ALTTCSQLNN VAGTSTCGDV TAGVANFCEC PSSSPICTLC DASSTMFNPD LVLLEDGNYT CGNANEDTQY YYLWYPLDAE GCNPSIAVSF IDSGINVIDY CCNGGPLTGT LGPTASPATG PTASSDVPTT DSGAGAGSGN TPESTSTSVA VSFRGGIATV SLLCLFVLLN
|
| |