Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49624 |
Symbol | |
ID | 7198264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011691 |
Strand | - |
Start bp | 230979 |
End bp | 233546 |
Gene Length | 2568 bp |
Protein Length | 855 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184425 |
Protein GI | 219128448 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCACATC GAATGAGGTC TTTGTTTCTT GGACTCTGTC TAGCCGTGCT CGCTGTATTC GTAGACTCGT CGTCTCCCTT TGGCATTCAA ACCAATGAAG ACACAGCGAG TACCTTCGCG TCAAGCATGG TGTATGACGC TACGCTTTCG AGACTATACA TTACCGGTGC CACCTACGGT CGCGAATTTG ACAAATCTGG GAGCACTACA CCGATCTCAC AAAATACCAG CGATTGCTTT TTCGGCATTC TACAACTCCC AGTAAAAGCC GCCAACACCC TACCAGTCTG GATCATTCGC CAGCAGATAG GTCGAATGAA CACACAAGAG GTATGCTCTG CAATATACAC CACAGGAAGT GGAAGTGACA GAAAAATGTA CGTTCTTGGC CACTCTGTGG AATCTCCGAG TGTGCTAGCA CCTTTGAAAT TAGAGGGTCC GAACAGGACA GTCTATGGAA TGGTTCTTGA TTTAAACTGG AAGGGAAAAG TACATGGTGG CCATCTGATA GAGTCTGCAG CGGTTCAAAT CCCGATTGCT ATGCAATTCG ACGATGGCGA CTTGCTGATA GCCTCTCTGA AATCGGCTTC CCCTGAAGCT CAACAGGCCG TCCAAAGCCA ACAGTCTCAA GATTCTACTG CCGACCCTAC CAATGATGGA GCGTATTTCC TGTTACAGAA TGACAAACCT CTGTCTCTTT CTCTTCAAAG ACTACATCGT ATGGGTAATC GCTCAGTTGC CCCTCGTTTG GATTCCGGCA GAATTGAAGA TGGACCTATC CTCCAAACTC TAGATTCTGA ATGGTGGAAG GAATTTGCGA CGGACCAGCT CGCTAGTGTT CAACTTTCCA GCATGATTCG ACTGAACGAC TCAACTACGA TTCTTGCTGG TTCCACACGA GGTACAGGCA TTGCCTTTGG TGGTGGATTT GCGGCTAACG CATTGGACGG GTTCTTGATG TTGTTTAAAT CAAGAACAGG AAATACAGTG AGTTCGAAAA GAGTTTCTTC AAATGGAATG GATCGAATTC TCGGGCTTTG CCAAGCAGCC GACACAGGTT TCCTTTACGT AACTGGAATG ACAGATGGCA ATCTACTACA AGGTGAAGCT GTGCCGGTGC CTTCCTCGCA ACCCGGACAC TATCAAGCCT TTCTACATAA AATCAACGCA ACAACACTGG AAACGATTTG GACCTACCAG ATTGGAACAA TCCTCGTTGG AGATGATACG TTGCAGACAC CACAAGTACA TGGACTGGCT TGTGATGTCA CTTTGGACGA TAAACTGGTC TACATGACTG GAATCGTCAA AGACGGTGCC GTTTTGACAA CTAATGGGGT GACAGGTATT GTGCCGCAGA GTGCAGGAAA GGACGACATT TTTGTGGTTC AGCTGAACAC GGATGATGGC TCTCTTAACT TTGCCCTACA AATCGGTACA TCAGAAGACG ATACTCTTGC ATCTGGAATT GGGGTACTTT GCGATAAGGA CGGAAACGCG ATGTTGCTTT GCAATACGAA AGGTTCGATG TTTAGAGAGA AATCTGTGCA ACCGAACAAC TTTGTGCACA GCAACATAGC TGTTCTCTCA GTGGATAGAT TGTCAGGTGC TTTGGCATCT GAATTCATGC AGAATACCGG GGCACCTACT CTTGTTCCGA CGGTGGCCCC GATGGCTCCA CTACTCACCG TCATTCCGTC TTCAACGCTC TCATCCGCTA CGGCGCCTGC TCAGACCGTT AATGGATCGT CATCAACTGA ATTATCGAGT GAAGGTCCAT CTTTACCGTT CGTTGCAACT AACCTTCCAA CAGCTGTACC AGCACAGGCC ATAACTTTTT CGCAGAGACC CCAGATTTCG CCTATGCTTA TCCCAACTAA TCATCCTTCA ATAGAAGCTC GAGTATCAAC GAACGTGCCA ACTAGGGGGA AAGGAAGCTT TGTTTTAGAG TCAGACTTGG GCAGAGATGA AGACGAAATA GCGGCGATCA ACATAACCAA CCCAACTACA GACAATGCAG GAGTCAGCAC AGGTCGAGCG ACAAGTAACC TTTACATTAT GAGCGCGATC GGCTTGATCA ATATCATCAT GGGCAGTCTC ATCGTCATAC TCATTCTTCA GCGACGTAAT AAGAAAAAAG TTGGTGCTGA ACATGCTCTA TCTTCTCGTG GGTTACAAAG CCACAGTACC ACGGGATACT CTGATTCAAG TTTGCAACAT CACTTCTTTA ATGACTCAAA TCGAGAAGTA GAAGAACCAC TCTATTTGGA TGATGGAGTC TTCCACGATG ATGCCACTTT CCTATCCGGA ACTACGTTGT ACGAAATTCC AAGAAGCTTC GAAAGCAGCG AAAGTCGACT AGAGAATGTC GATACTTTAC TGAGACTTAA ACCTGCCTAT ACTATGGCGA CAAGGAACCG CGATGATGAA GCCTTCAAAC GGCTAAAGCA AACACGAACA CGAACCATCG AGCTTGAAAG ATTTGAGCCT GTACTCGAGG CCAACATTGC ACGCAATGGA ATTTACAGGC CGAGTAGGGA TCCTTTCGAT CTGTCACGTA ATGCGTAG
|
Protein sequence | MSHRMRSLFL GLCLAVLAVF VDSSSPFGIQ TNEDTASTFA SSMVYDATLS RLYITGATYG REFDKSGSTT PISQNTSDCF FGILQLPVKA ANTLPVWIIR QQIGRMNTQE VCSAIYTTGS GSDRKMYVLG HSVESPSVLA PLKLEGPNRT VYGMVLDLNW KGKVHGGHLI ESAAVQIPIA MQFDDGDLLI ASLKSASPEA QQAVQSQQSQ DSTADPTNDG AYFLLQNDKP LSLSLQRLHR MGNRSVAPRL DSGRIEDGPI LQTLDSEWWK EFATDQLASV QLSSMIRLND STTILAGSTR GTGIAFGGGF AANALDGFLM LFKSRTGNTV SSKRVSSNGM DRILGLCQAA DTGFLYVTGM TDGNLLQGEA VPVPSSQPGH YQAFLHKINA TTLETIWTYQ IGTILVGDDT LQTPQVHGLA CDVTLDDKLV YMTGIVKDGA VLTTNGVTGI VPQSAGKDDI FVVQLNTDDG SLNFALQIGT SEDDTLASGI GVLCDKDGNA MLLCNTKGSM FREKSVQPNN FVHSNIAVLS VDRLSGALAS EFMQNTGAPT LVPTVAPMAP LLTVIPSSTL SSATAPAQTV NGSSSTELSS EGPSLPFVAT NLPTAVPAQA ITFSQRPQIS PMLIPTNHPS IEARVSTNVP TRGKGSFVLE SDLGRDEDEI AAINITNPTT DNAGVSTGRA TSNLYIMSAI GLINIIMGSL IVILILQRRN KKKVGAEHAL SSRGLQSHST TGYSDSSLQH HFFNDSNREV EEPLYLDDGV FHDDATFLSG TTLYEIPRSF ESSESRLENV DTLLRLKPAY TMATRNRDDE AFKRLKQTRT RTIELERFEP VLEANIARNG IYRPSRDPFD LSRNA
|
| |