Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45740 |
Symbol | |
ID | 7200770 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | + |
Start bp | 92030 |
End bp | 94626 |
Gene Length | 2597 bp |
Protein Length | 825 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179974 |
Protein GI | 219118400 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.635371 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGACAGT CGACCCGTAC TTCCCAGAAC GACTCTTGGG TACCCCGGCA TTTCCGGCAC GTTTCTTCTA CCCAAACGGT CCCGACATGC CCGCACCACC GACAATTTGG AGTCCCTACC GTAGGCTTCC CGATTCACCG ACGGATGCTG GTGGCCCTAC TGCTCGGGGT ACTGGGGACG GCGTCCGGGG ATACCAACCC ATTTTCGTAC ACCGCCAGTC GAGCAGGGAC CGTCTCGGGA ACCTTGCGTC AATGGGAAAC CATTACGTTG GCTTTGCGGG GACCTCCCAA CAGTAACAAC GACAGTAGCG CATCGTCGAA TCCGCGCATG GACAATCGAC TGGACGTGAC CTGGGCTCCC GAAGGATCGC GCAGTTCGCT CACCGTTCCG GGATATCTCG CGACCACAAG TACCAGCACC AGCTCCAATA CCACGAACAA TAGTACTACG ACGAATCCTG AGACTGACGG GAGCGTTTGG TTGGTGCACG TCACGTTGGA ATCGGCCGGA ACCTGGCACT GGGAAGCCTC CTTTCGAACC GGTCCCAACG TGGCTATCCG GAGCATGGAC AACACCGACG CCAACATCGA TACCGACACG GACACGAGCA CCTCGGGCTG GGGGGACCGC ATTACTGGAT CGATCGTGGT GGCACAATCC GCACACGAGG ATGAATTTGC GTCGGAAAGT GCCCCTGGGG AGACCTTTTG GGAACTAGGG AAACTGGAAT ACGTCGGAGC ACACCACGCT TTGTTTGCTC GGGGTCACGG ACGGTGGTTT CTGCAGTCCG GAGTCGACCT GGGCAGCACG CTGTTGCGTT GGTTGGACTA CGGCACGGAC GATTACCCCA CCGAAAGCTC TACCACTACT ACTAGTCTCC CACCAGGCTC TGCGTGGCTA ACACGTCACC AAGCCTCCTT CCCGATATCA CACCACGCTC CGTCGTGGAC TCACGAAATG GGGAATCAGG CCATGGGACT TCTCGAGTAC CTCGCTCGAC ACGGGGTCAA CGCCATTCGT CTGAACGTGT GGGACGATAC GGACGCCTCG ACATCCCACC AATTCCTCTT TGCGGCCGAC CAGGACCCCA ACAGTATCCG TGTCTCCCAA CTCGCACGCT GGCAGGTGGC TCTGGAGTAC GCGCAACGAC TAGGCCTCGC ACTACAGATT CAATTGCACG GCAGCGACGG TCCGGAAAAC GAATCACCAG ACGTTACTAC TGAACAACGA CGCAAGCTCT ACTACCGTGA ACTCGTGGCC CGTTACGGAC ATCATCTAGC CTTGTCGTGG CATTTGGGTC CCGCCGGACC CACGTCGGAA TCGCGAGCGA ACGATTTGCG TTCCATCGAT GCGTACCAGC ACACAATTGT GGTGGATTCC ACCACGACGG AACGAGACTT CCCTACTCAC AATTTACTGG GAGTCGATAC CGTGGACAGC GTAGTCGTGC CGACACCGAC CCGCAGTCTC GCTCTCACCA ACACGGTAGT CCGGGATGTG ACGACGTGGC GGGACTTGTC GCGAGCCCGC AAGCACCACT GGATCGTTCA CACCCAACAC GCGTACACTC CCAGTGCATC GTTGCCACCG CAAGTGCGGC TCCGGGACGA CGATGAATAC GTGCAAAATA TTCGCCGAAA CGTCGTATGG GGCAATCTCC TGGCCGGTGG AGCGGGAGTG GAATACGGTC TGCGCGAGGC GATTGCGAGC AATGGTGCCG ACACGCTCGT GGACCTGGAG CGTATATGGC TGTACACCCG ATTGGCGTTG GATTTCTTGC AAAACTTAAC CCCGCGGATC CCGTTCTGGT ACATGGTGAA CGAAAATTTT CGCGTCACGT CGTCGACTGC GGGGGCGCTG TGTTTGGCGC AGCCCCGAGG CGACGTGATG GTTTTGTATT TCCCCAAGGG AGGCACGGCA CGAGTGCATT TGCCCAAGGG TGACGACGGG ACGTCGTACA CAATTCAGTG GTACGACCCT CGTGTCGGCG GTGAGCTCCA AACGGGTAGT ATCGCGAGTG TGCAAACCGT AGCGTCGACC AAGGTGTCTC TTGGAACGGC ACCCAGTGTA CCGCATCAGG ACTGGGTGGT CTTGTTGCGT AGAGTAGTCG CACCGATCCC CGTCACTCCA AAGACCAGGA ATGGACCACG GCCGTGGTGG GGTGTAGGCA TGACGCTGGG GACAGCGTTG GTGTTGCTAC TGACTGGTGT GTTTGTACTC TACCGGTATT ACGGTCCATC GGGACGACAA CGACGAGGAC GGTGGAATAC CGATGCCTGG TACGCCCCAC GATCGGGACG GGGTCGTGTA TCGGCACTGT CCCGACGACG GGGCACCACC ACCCGTTCCA ACACCAGCGG TAGTAACAGC AGTGCGAACG TGAACGTGAG TGCGCATACG TTCGTCCCAC GCACGTACGA TTCCCATCTC CGCCAAGGTC CACCGGGAAG AGCCGGCCAC GGCCATTCCC TCGTGTAACG GACGCAAGGT GGGTGCTGGG GAACGGGGAC ACTGTGGGGC TAGTGGTTAT AAGAGTGGTT GTGGTTGTTA GGGTCGTTGT TACAAGAATC AATAAACGAA AAAAGACTAA CCGTATG
|
Protein sequence | MGQSTRTSQN DSWVPRHFRH VSSTQTVPTC PHHRQFGVPT VGFPIHRRML VALLLGVLGT ASGDTNPFSY TASRAGTVSG TLRQWETITL ALRGPPNSNN DSSASSNPRM DNRLDVTWAP EGSRSSLTVP GYLATTSTST SSNTTNNSTT TNPETDGSVW LVHVTLESAG TWHWEASFRT GPNVAIRSMD NTDANIDTDT DTSTSGWGDR ITGSIVVAQS AHEDEFASES APGETFWELG KLEYVGAHHA LFARGHGRWF LQSGVDLGST LLRWLDYGTD DYPTESSTTT TSLPPGSAWL TRHQASFPIS HHAPSWTHEM GNQAMGLLEY LARHGVNAIR LNVWDDTDAS TSHQFLFAAD QDPNSIRVSQ LARWQVALEY AQRLGLALQI QLHGSDGPEN ESPDVTTEQR RKLYYRELVA RYGHHLALSW HLGPAGPTSE SRANDLRSID AYQHTIVVDS TTTERDFPTH NLLGVDTVDS VVVPTPTRSL ALTNTVVRDV TTWRDLSRAR KHHWIVHTQH AYTPSASLPP QVRLRDDDEY VQNIRRNVVW GNLLAGGAGV EYGLREAIAS NGADTLVDLE RIWLYTRLAL DFLQNLTPRI PFWYMVNENF RVTSSTAGAL CLAQPRGDVM VLYFPKGGTA RVHLPKGDDG TSYTIQWYDP RVGGELQTGS IASVQTVAST KVSLGTAPSV PHQDWVVLLR RVVAPIPVTP KTRNGPRPWW GVGMTLGTAL VLLLTGVFVL YRYYGPSGRQ RRGRWNTDAW YAPRSGRGRV SALSRRRGTT TRSNTSGSNS SANVNVSAHT FVPRTYDSHL RQGPPGRAGH GHSLV
|
| |