Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44864 |
Symbol | |
ID | 7199574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 473981 |
End bp | 477004 |
Gene Length | 3024 bp |
Protein Length | 492 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178783 |
Protein GI | 219115976 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCTCACAGTC GACATCCTCG AAACTGAAAG AGAAGCGTGT AGACAACCGC TTTACTGTTA ATCGTACTTT CTACTACAGG TCGACTTTTT TCTGAAGCGC CATGAGGCAT CACATTCTCG CCCTACCCTG TATACTTTCA ATGGCTACTG CTTTCTCTGG TACGCAACCG TCGATTCGAT CGTCTTCAAC AGCACCGTCG GCAGCGACGA TGCTGTCGAT GAGCGACAAT TCCAGTAGCA GGAGTGACGA ATTGAGTCGT CGCCAACTAG GTGAGCTGGC CTTTGCAGCT AGTGGACTCG GGGTGACGTA CTTCGGTACT CGCGAAAGAG ATCCCTTGGA CTACGGGCTT TGGGGTGTTC TACCAGTCGG CACCTACAAA AAGAAGAAAA CGATCCTTGA TACTATTATA CCGGATAATA TGTGGACCTT TGACCAAAAG TTTGGAATTT TGGACGTTCA GGTTCCTTTG CGTATGACCG TCACGCGATT ATCGTCCGGC GGTCTGTTTG TGTACAATCC CGTTGCGGGA ACGCCCGAAA TGGTTGGCAT GTTGCAGAAG CTTGTGGACC AGTACGGGCC GGTCAAACAC ATTGCCGTCG GATCGGTGGC ATTGGAACAC AAGGTCTATG CCGGAGTTTT GGCGCAGAAG TTTCCGTCCG CTCAGGTCTG GTTGACACCG GGGCAGTACT CGTTTCCACT CAATCTACCC GAATCCTTCC TAGGCTTTCC AAAGTCGCGG ACGCGCGCGA TGCCTGCCAG TATAGACGAC GCTCCTGAGG ACTGGAAGGC CGACTTTGAT GTAGCCGTTC TAGGACCAAT TATTTCGCGT GATGGAGCTT TCGCCGAAAC TGTATTCTTT CACAAGCCGA CCAAAACCTT ATTGGTGACC GATACGGCGG TACAAGTGAC AGAAGAGGTA CCCGCCATAT ACGACTCGGA TCCTTCGCCA CTCCTATATC ACGCCCGCGA CACAATCACC GACAACGTAC AAGACACACC CGAGACTCGT AAAAAAGGAT GGCGGCGTAT TGTTTTGTTC GGACTCTATT TTACGCCGAG TGCGATCACT ATTAAGGATT TTCAGTCAGC AATTCAGGAG CGTCGACCAG ATATCAACTC GGACTTTGCA GGTATTTATC CGTGGGACTG GGATGGTGAC GAAATCGCTA GTTGGCGTGC TCTTACCGGT GACGGCACCA AACCACTGGT TGCCCCTATT CTACAAACGC TTCTTCTGAA TCGCAGCCCG GTAGAGGTTC TGGACTTTGC CGATAAGGTA TCGCAATGGC CATTCACCCG CATCATCCCG GCTCACTTGA AGAACAACAT TGCCATGACT GGTGATGAGT ACCGGAAATC GTTCGGGTTT TTGGAAGAAA AGGGGGTGCC GCTAGGCTAT CCCAAACCGT TGCAGTCGGA TTTACAATTG CTTTTGGATG CGGAGCAAAG TCTAGTCGAA TCCGGCGCCA TTAAACCTGC ACCTCCTAAA GTTGGAGGTC AATACTCCCG CGCCGAGATT ATTGCAAAAA CGTCGTACCA ATGTCGAGCC GGAACCTGTG CCCCCCAGGC CAATCCGTAA AAGGATACAC AAAGCGAAGT GTGCGGCATC ATAGCTAGTG CACCAACAAT ATAGAGAGAA AAGACACCTG GACGACGACT TGATTACACG TTGGTTGGCC ATTTTAACGG AACAGTACTC CACATTAGCG GAGAATTTTC ACCGACGCAG TTTTTGACGA CTCTCTGACT TCATGTACTC CGCAGACGCT ACCGCTACCT CTGTCGTTCG ACGTTGCCCA ATTTTTCGCT GTCTACGATA TTGCGAGCTC CACGTTTCCT GTAACTCGTT GTTCTTCCAC CAATTGTCTT CCACAACTGA GCATACTTCA TTCGACTGAT CTCGTTCAAT GTCGCTCAGT ATCCCTGATC GTCTCCGGTT AGCTCCGGCA CAAGTCACAA ACTCCGGCCC ACAACGCTTT TCGTCGCAGT ACGTACAACG ATTCAAACTA GTCGGATGAT CGTCTTCGGC CCCGGTCGAG TCTACTTGAG AAAAAAGTCC ATGCCAGAGA CAGACAAAGA CACACTTGCG CAGATCGCAC ATTATGTTCT CTACCCAGCA ATCATTGCAA TCCGACGTAA AGCCAACACG GTCTTCCATG CAATTGCTGG CCGTTTTGCG ACCCCAGATT AGACCGCGTT TGGCGCACGC CACGGAATCG TCGTATAGGG TATTTTTGGT TTCGTCGTAG ATGCCCACGT CACGGGGATT GGAGCATTGT CCGCAATCAC CGCAGTGTGC GATAAAGGAA TTCCCCAATG TGGATGTGTG TGGAGGACTC GTTTGCTGTC TAACCGCTTC TATTACCGAA GCGTTTGGCC AGGTTTGTAT CACAATAGAA GATGCGATAG TCGCATTATC CTGGGCACTG GTGCTGTTCC TATCGTTGTT GAAGGTAATG CCACATATTT GTGGTTGCTG GTAGAACCAC AATGTATCGT CAGCTTGACG GACTTTAATT TCTTGGAATT CCATGCCCAG CCAAAGCACC AAGATGAGGC TGACGAACCC TAGTAAGCCA TACTGGATGA ATACGCGGAC GAGACGCCGG GTGCGCGGAT TTATACAGCA CGTGGGAGGT ATTAGGCGCG CCAAGTGGAC ATTTGGCCTT TTGCGGACCA ATGGCCAGCA ATTCAACCAG CGGCGACGGT AAGGCATCAC CACAATCAAT TCCGAAAGAG TTGTTCCGAT GGCCAGCGGG CCGAGAAGAG AGTTCAAAAC TTTGTCAAAT ACTTACCCTT CGATTTGGTA CGAAACGCTC GACGGCTGGA TAGCAGGAAG CGAATTCCTG GAGTCGAAAC GTACCGGAGT CTTCGATTGT GTGATTGTGA AAAGTTTATT ACAAAAGCTC CAGCAACACC ATACTCATGG CAAAGTTCTA GTGGCGAAGT GCATCCTCGG ATGTCGTCGA AGGAAGTTTG TTGTGCCTCT CGTTTTTCGT TTATGGTGAG AAGGAATTCG TTTC
|
Protein sequence | MRHHILALPC ILSMATAFSG TQPSIRSSST APSAATMLSM SDNSSSRSDE LSRRQLGELA FAASGLGVTY FGTRERDPLD YGLWGVLPVG TYKKKKTILD TIIPDNMWTF DQKFGILDVQ VPLRMTVTRL SSGGLFVYNP VAGTPEMVGM LQKLVDQYGP VKHIAVGSVA LEHKVYAGVL AQKFPSAQVW LTPGQYSFPL NLPESFLGFP KSRTRAMPAS IDDAPEDWKA DFDVAVLGPI ISRDGAFAET VFFHKPTKTL LVTDTAVQVT EEVPAIYDSD PSPLLYHARD TITDNVQDTP ETRKKGWRRI VLFGLYFTPS AITIKDFQSA IQERRPDINS DFAGIYPWDW DGDEIASWRA LTGDGTKPLV APILQTLLLN RSPVEVLDFA DKVSQWPFTR IIPAHLKNNI AMTGDEYRKS FGFLEEKGVP LGYPKPLQSD LQLLLDAEQS LVESGAIKPA PPKVGGQYSR AEIIAKTSYQ CRAGTCAPQA NP
|
| |