Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39429 |
Symbol | |
ID | 7194946 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011687 |
Strand | - |
Start bp | 500717 |
End bp | 501772 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183495 |
Protein GI | 219126502 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.266609 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGATCCA ACGTTACATT GCTCTTGTTG GCAGGAGCCT CGTTCGTCGT GTCACTGACT ACCTTTATGG AATCACGCAT GGTCCTACTC GCGTCGCACC GTACGGTAGA AGAAGTTGTG GCTATCGAAT CGAATCTGGC TGATCTTTAC AGTAGTAGAC TCGTCAGCAC GGAACAGTCG CTTGGTGCCC GTTTGAGCGA GACCAGTTGC ACACTATCAG ACATATCCTT TGTCCGAAAT CTATCGCTTT CAGTTTCACA GCCAACTTGG CTGGCTTCCT TTCCCGGTTC GGGCTCCGAA TTGCTCCGCG AACTCGTACA AGCCTCCACT GGATTCGCCA CGGACGAAGT TTACAACCGC GACACGGACT GTCAGGACTC CTCCGTGATC ATGTGCAAAA CGCATTGGCC CTTGCGCATC GGTGGTGATG CCCGGAAATG GGGTGCTCCA CCGGTGGCCC GGGCTCCTCG TTTTGCCAGT GATGTATTCG TGTTGGTCCG CCATCCGGCC GAAGCTTTGC CATCCTTTTT CAATTACAAG TGGGAAATGC AGCATCACGT TCAAGATCAT TCGCAACAAG GGACCGAAGA GGAATGGAAT GCTTGGAGGG ATCGACGTTT CGATCAAGAT TTAGAAAATT GGAAGCGACT TTTAAGGGTG TGGGCGACGC AAATGACACC CTACAACGTG GCGCACGTGA TTGCTTACGA AGATTTGGTG GATGAAAGGA AAGGTCCTCG TTTGTTTCAA ACCATCACTG AACATCTCCA AGCCACCGTT TCCCGACCCA TATATGGATT TGACGACCCA GAAAATGCCT TCAACGACAA CGCCGTTACA CAATGGACGT GTCTATGGAA AAAGATAGTG CAAGAACGGG CAGCAAAGAA GCGACGCACT CGAGGGTATC AGCCATCGTA CCATCTGCGG CAAAAGACAG CTTTGATCCG AATCCTGCAG GAGGCTATGG ACGAGCTGTC CGACTACCCC CTCTTTCTAC CAATTTTGCA ACGCTACAAG GACGATACGG AACGTATTCT TGTCTCGCAA AGTTAA
|
Protein sequence | MGSNVTLLLL AGASFVVSLT TFMESRMVLL ASHRTVEEVV AIESNLADLY SSRLVSTEQS LGARLSETSC TLSDISFVRN LSLSVSQPTW LASFPGSGSE LLRELVQAST GFATDEVYNR DTDCQDSSVI MCKTHWPLRI GGDARKWGAP PVARAPRFAS DVFVLVRHPA EALPSFFNYK WEMQHHVQDH SQQGTEEEWN AWRDRRFDQD LENWKRLLRV WATQMTPYNV AHVIAYEDLV DERKGPRLFQ TITEHLQATV SRPIYGFDDP ENAFNDNAVT QWTCLWKKIV QERAAKKRRT RGYQPSYHLR QKTALIRILQ EAMDELSDYP LFLPILQRYK DDTERILVSQ S
|
| |