Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46341 |
Symbol | |
ID | 7201613 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 43919 |
End bp | 46897 |
Gene Length | 2979 bp |
Protein Length | 948 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180936 |
Protein GI | 219120394 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACAAGC CCAACCCAAC TCGGAATTAT CCGCGGCAGT TTAAGAATCC CGAGCTGGAT CCCAAGACAC ACCAAGCCAG CACTGCTCCT CTACCCTTAT CGAATGCTCC CGTCGTTGCT ACTACTACCG CTACTACTAA TAACACTATT ACTACTAGTC GAGCCGTGAC GGGGGTACCA GAATCCACCA GTGTTACTTC TGACGATCAT CGCAACGCGC CCCAACACAA GCTGACGGTG AATGCAACCA CACAATCGAC GCAAGAATCA GAACTCGTCA CGGTGGAACT CTCGCATCCG TACGCACCGT CGAATGAAAT CCGTGAGACT GTGGTAGAAG GCGATCGAGA CGCATCGGTA CGACGCAGAC CATCAACGGA CTCGTCCTTG TCCGAACCGC CGCCGTACAC GAGTTCGGAC GAACGGGAGA CAGCCGGCAA GACCGGACCG GCTTTGCGTC ACAAAGCATC ACCACGTCGA CTCGCACAGC GCCGTCGTCG ACCTAGACCA AACGTTACGA CTGTCGTGGA GCCGTCGATC ATCCAGTTGC GTCCAGCGCA GCGTCACGAC CAATCGACGG CCTGGTCCCC CGTAGACGAT GACACCGTCA CGGGTAACTG GAGTCTCGAC GACGATTTCG ACCATTTGCA AAATTCGCAA CGACGAGCTG ACCAACGTGG GTCCATGGAC AATCCCCGCT CCCGCCTCCC GTCAATCGTG TTGTCGGCTC AGGATTTGGC CCTCTGTCAG CGTCTGGATC AGGACTACGA ACGTGCTTTG GAATCACTAC AAGTCGGCTA CAGCGCTCGC TACTATTCCG TACGACAATC CGCACTGTGC AGCGTCATCT TCATGCTCGT ACTCCTGACC CTCGGGACTA TTTTCTTTTT GCGCCAAGCT CCCTTTTGGA GTCTGGAAGA GGCCCTACTT TTTAGCGTAT ACACCATTAC GACGGTAGGC TACGGGCACT TGCAGCATCC CGAAACAGCC GCCTTTCAAC TTTACACCGT CGCGTACATT TTTGTCGGTA TTGCCACCCT GACCATCATG GTGGCGCAAG TCTACCAATG CGTGGCCTTG GAAGCGGCTC GGGCGCAACA CGCGGCCGCC GACGAAGGCA ACCGTCGCAA CGCACAAATG CCGGACCGGG ACGGCATTGT CCGTGACCAG CGCCACAACC AGGACCCCCC GAGCCTCCAC CACCACGAGA CACGGAGCGA GAGTTTCTCG TCCGATATGG TGTGGGAATA TTCCAGCTCG GTCTGGGATT CCGTGACGGC CATGCTACGG CGAGCCTACC GCTACTTTCG CCAAGACGAA TTCGGTCGGA GCTTGTCGGT CATTTTCCCC ATGACTGGAC TCGTTCTCAT TGGAGCCGTT GTCATTGGCG TGTTGGAATC CTGGACCTGG CCGGAAGCCC TCTATTTTGC CGTCGTGTCG CTCACCACGG TGGGCTTTGG CGACTACTAT CCCACCAACC CAGCCGCCAT TTGGTTCTGT ACCTTGTGGT TGCCCTTTTC GGTGGGGTTC ATGAGCGTCT TCTTGGCCAA GGTGGCAGCC TTTTACATTC GACTGTCCGA CACCAACATT TCCCGGATTG AACGAGCCCT ACGACAAGAC CTGGTACAGA CCAAGCGCCA GGCGGCGCGT GAACGGCAAG CCGCTCTCGC GCGGGCGATG AGGGGACAAC AATTGCGTGA TATCGAGAAC GACCACGGTC ACAATGGTGA GAGTCACGAT TTGGCTTTGA AGGAATCAAT TGCGCTAGCG AAAGAAACAG TGTCACAACA TCGACGACGG AGGCGAGGTT TTGATACCGT CCCTACGCAA TCAGTCGAAG CCGGAAAGGC GGTGGTCTAT AAGGACAATG ACGGCGAAGA CTGTGAGGAT TCGACCGCCG ATAATAGCGC CGGTTTGTCG GTCGACTCGG AGTCTCGTCG ACACCTGTTT GGATCTCCGG AAGCTCAAGA GGATCCGGCG GAGACTCGTC GTGAGCTTGT CCTGCGCAAC AGTTTGGCGT ACAGTACTCA CTCCGAGGGA GATGAGCTGG TCGACGAGGA TCAGGAGCTG GAGGATGGTC GATCCGAAAC GAGTGACGGC ACCGTGTCCC GCCCTCGCGG CTCGACTCTG TCTACAATGA AAGACGTCTT ACGTACCGTA CACGGAAGTG ATAGGGATGT GCGGTACGGT CCCGATTCGG AATTCTTGTC CGTAACGTCG AAGCAGCCAC TCCACGCACA ACACCACGCG CTGCGCCGTC GATCCCAAAG CCTGCTAAAA CCGTCCTTTG CCTTGCGCGC GTTGGTACAG GAACGGTTTG CCGAAATAAT TGCGACCGAC ATTGCCGGTT ATCAGAACGC GATTGAAATA AAAGACAACT CCATGACCGT CACCATCCTT CGTCTCAAGG CCGTGGCCGA CAAATGGTGT GTCCCCAGAC GCGCCCGTAA GGCATTTCGG GCCGTCGCCT TTGAAACACT ATATTTCGTG GGCGAGCACG ATTTAATTGT CGAGGGCGCC GATGCCTTGT TTGCCTTGTC ACCGTTGGAA TTTCACAGCC TGTTTGCGCC GCTCGTGGCC GCTCTAGGAG ATGCTACGAC CATGGAAACG TGGCTGGAAC AAACGCAAGT CTTGGCGGAT GTGGACTTGA TCAGTCGCGA TGAGAGAGTG TCACAAAGCA TGCAAGAGCA ACGAAGTCGA CGACGGCCGA GGCGTTTGGG TCGAAACGAC TGGGGGGATG TTGAAGAAGA TGGAATCATT AAAGGGGAGG AAAGGCTATA CCGAACTACT GACAAGTCAA CACGGGGTAA TGCCGCGATT CCAGGAAGCG AGTTACATCT GACGTGACGG TAGCCTGTTC AAAGTCTACG TACAGGTTCT TTCAATTATG GAATGGTCGT TGTGGCAATT TTTTCTTATT CCGATCCATG GATCAGTTTG CCAAACCGTC CATAGGAAAA CGAAATACCC ATTCAAAAA
|
Protein sequence | MDKPNPTRNY PRQFKNPELD PKTHQASTAP LPLSNAPVVA TTTATTNNTI TTSRAVTGVP ESTSVTSDDH RNAPQHKLTV NATTQSTQES ELVTVELSHP YAPSNEIRET VVEGDRDASV RRRPSTDSSL SEPPPYTSSD ERETAGKTGP ALRHKASPRR LAQRRRRPRP NVTTVVEPSI IQLRPAQRHD QSTAWSPVDD DTVTGNWSLD DDFDHLQNSQ RRADQRGSMD NPRSRLPSIV LSAQDLALCQ RLDQDYERAL ESLQVGYSAR YYSVRQSALC SVIFMLVLLT LGTIFFLRQA PFWSLEEALL FSVYTITTVG YGHLQHPETA AFQLYTVAYI FVGIATLTIM VAQVYQCVAL EAARAQHAAA DEGNRRNAQM PDRDGIVRDQ RHNQDPPSLH HHETRSESFS SDMVWEYSSS VWDSVTAMLR RAYRYFRQDE FGRSLSVIFP MTGLVLIGAV VIGVLESWTW PEALYFAVVS LTTVGFGDYY PTNPAAIWFC TLWLPFSVGF MSVFLAKVAA FYIRLSDTNI SRIERALRQD LVQTKRQAAR ERQAALARAM RGQQLRDIEN DHGHNGESHD LALKESIALA KETVSQHRRR RRGFDTVPTQ SVEAGKAVVY KDNDGEDCED STADNSAGLS VDSESRRHLF GSPEAQEDPA ETRRELVLRN SLAYSTHSEG DELVDEDQEL EDGRSETSDG TVSRPRGSTL STMKDVLRTV HGSDRDVRYG PDSEFLSVTS KQPLHAQHHA LRRRSQSLLK PSFALRALVQ ERFAEIIATD IAGYQNAIEI KDNSMTVTIL RLKAVADKWC VPRRARKAFR AVAFETLYFV GEHDLIVEGA DALFALSPLE FHSLFAPLVA ALGDATTMET WLEQTQVLAD VDLISRDERV SQSMQEQRSR RRPRRLGRND WGDVEEDGII KGEERLYRTT DKSTRGNAAI PGSELHLT
|
| |