Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38016 |
Symbol | |
ID | 7202721 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | + |
Start bp | 670869 |
End bp | 672197 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181950 |
Protein GI | 219123269 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0971741 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGAGTCC GCCGACCCTC GTCCACCGGG TCATTCAGCG CTAGCACCAC AAACATGCGA AGGAAGTTGT TGCCATTCAA TCCGGTGGGA CGCAGCAGTA TGTTGTGTTC GACGGAATTC AATCCTTTGC AAACGTTGGT TTGGAACGCA ACCTCGGGAT TTTATGACGT TCCAGACAAC ACGATCAACA CTACGAATAC TACGGCTAGT ACAACGAATG CTGTTGGTGG AAACACTGGA CAACGGTGGC TACAATCCTT GTTTTCGTGG ACCAACGAGG ACGAATCGTT ACAAAATTCC GAGGACGTCA GCGGCATTCA CAGCCGTTCA CGTCGCGACG CTCCTATTCA AACGGAACAA TCGTCCCGTC TCTGGACGAG CCACAAGGCG GACTATGATT ACCAGGGACG TGTGGATTCT TACGTGCCTC ATCGCCGTTT GGAGGATCCA CCGACACCGG TCTTGTACGG CCGTCAGTGT TTGTGCTCCC CGCTCCCCGA CACCTACTGC CCCATCGGGG CGCACCACTG CAAGATTGCC TTTACCGACG CCTCGGCTCG CCGGGATATC TTCGAAATAT CTTGCGCCGC CGATAGCAAA GACTCCTTTG TTCGCTTCGT TGTGCCCCTC ATGTTTTTCT TCTGGTTCCT CGTACTCTGT AGCTGCGTCT ACTCGCCCAA GGGTGCCTAC GCACGAGGAT ACCTCCAGCG CGTTGTCTTT TGCTGGCAGA CACCCCGGTA CGAACAAGCC CTGCAGGAAA ATCTGGATCG CATCGTACGA CGCAATCGCC AACGCCGCGA AGCACAGGCC CGGATTCGAA GGCGGACGGT CGCGAGCCAC CGTGTGGTTC TGGGACGCGA TCGCCCTCCC GGCACGGACT CCTTATACCC ACCGCAGTTG GATCACAGCC GTCCCGGTCG TCGTGTCGGT AGCCCTGTTG TTACCCACAC TACGGATCCA CCGACGGAAC TTCCACCCGC AGACGTGGAT TGGACGGCTC AAGCCGGTAT GGAACTGGTC CAACGGTCCG TGGTGGTCTT ACGCACCCGG CGGTACCGGG ATCGACCGAC GGTCCACCAG TCCAGTGACC ATGCCGACGC GGCGCAGCAG GAAACGGCAC CGCACGATGA CGTGTGTGCC ATTTGTCTCA ACGCCTTTGC CGACGCCGAT CGTGTGGGTG ACTTGCAGTG CCAACACGTC TTTCACGTCG ACTGCCTCAA GTCGTGGATT CAACATAAAA ATCACTGTCC CTTGTGCAAG GCGGATGATC TCGCCACTCC TCCCGAAGCA CCGCGTAGCA GTCCCAATTT GGAACGGCCA TCGTCGTAA
|
Protein sequence | MGVRRPSSTG SFSASTTNMR RKLLPFNPVG RSSMLCSTEF NPLQTLVWNA TSGFYDVPDN TINTTNTTAS TTNAVGGNTG QRWLQSLFSW TNEDESLQNS EDVSGIHSRS RRDAPIQTEQ SSRLWTSHKA DYDYQGRVDS YVPHRRLEDP PTPVLYGRQC LCSPLPDTYC PIGAHHCKIA FTDASARRDI FEISCAADSK DSFVRFVVPL MFFFWFLVLC SCVYSPKGAY ARGYLQRVVF CWQTPRYEQA LQENLDRIVR RNRQRREAQA RIRRRTVASH RVVLGRDRPP GTDSLYPPQL DHSRPGRRVG SPVVTHTTDP PTELPPADVD WTAQAGMELV QRSVVVLRTR RYRDRPTVHQ SSDHADAAQQ ETAPHDDVCA ICLNAFADAD RVGDLQCQHV FHVDCLKSWI QHKNHCPLCK ADDLATPPEA PRSSPNLERP SS
|
| |