Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45748 |
Symbol | |
ID | 7200897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 124859 |
End bp | 126581 |
Gene Length | 1723 bp |
Protein Length | 497 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180181 |
Protein GI | 219118829 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTCCTT GCAACACGGA GCCCATACCG CTTCACGAAG GCGTGACATC GCGCCAATTG AACTGGCGGC AACAAAGCCT CGTGCGACGA GCTGTTTCCT TTGTGCAGGA CGATCTCCAG CATGTCTTGA GTTTTTCCCC CTTTCTCCGC TCTCCACCCC ATCACAACCA GAGCGCCCTT TTGCATCGAA CCGAGATTCA AACAGGCCAA CTTTTGGGCT ACGGTGGTTT TTCGGAAGTC CATGAAATCG TTGGATTTGC CCTGAATCCA AGCGTGTCTC ACCAATTGAC ACCCTTTCAG CAAAGAGCTC GCCTACTGTT ACAACGTCAA TGTATAGATC CAGTCACAGG ACGGGGTCGT TACGCGATCA AACATTTGCA GGCACGATTA CTGGAGAAGA ATTCCGACGA ATTTGCCCAT GCGGCATCAG ACTTGGCCGT GGAAGCCCAA TATCTTGGCG CCATTGAACA CCCGCATATT TTGTCAGTCC GAGGACTTCC AATTGAAGGT ATTCACGCGC TAGCAGACGG AAAGCACGAC GCATTTTTCA TAGTCACCGA CCGCTTGGAA GACACTCTCG ATAGACGGGT TCAGAGTTGG CAAACCAGAG ATGCAGACGT TCTGTACAAA GCCGAGTTGG CTCTTCAGCT CGCCTCGGCT TTGGAATATT TGCATGAACG AAGGATTGTA TTCCGGGACT TGAAACCCCA AAACATTGGA TTCACGTCAG ATGGTACCCT GAAGCTATTT GACTTTGGTC TGTGTCGCGA GCTGCCCTCG CCCGATCTTA CTTGTTTTGC GGACGTTTAC GAAGTCTTTA GCATGTCGGG AGTCGGCACC CGCCGGTACA TGGCACCGGA AGTTGCGAAC GAAGGCAAGT ACAACTGCAA GGCGGATGTC TACGGTTGGA GTATGGTAGT ATGGGAAATG TTGAATACCT CGAAACCATA TCCTACATAT TCGCTTGAAG ATCACAAACA GCGGGTTTGT ATAGAAGGCG AACGACCTCC CATCAATCGG TCGTGGCCTT TGCAGCTTCA AGGCTTACTA ACGCAAGCCT GGACACCGTT GTTGCCTGAA CGATTGACTT CACGAGAAGT CTGTGGTATG TTACACGGTA CGATCAATTC CATTAAATTA TCTTTATTGT TGGATTCGCC GGAGTCTCCG ATTGCGGTTG CGGAGTCGGC TTGGGCTATT GACCACACTT CAACTGCTAT CACATCGATG TCCTCCTCTT GGGGCCTCGC CAAAGCCAGT TGCTCTCCCG ACAGTATACG GCTCAGCTTG CCTCCAGATC TTTTGAATGC TTCGTACTCC TCAGACACTT GCAGTTTGTT GGATAGAAGC TTGATGGCCT TGACTGTCTC TTCCAGCTCG TCCATGGAAG GGTTTGAAGT GACCTCCACG GCAGAACCTT ATCAGCATCA TCAGTTTCCC TCCCAAGATG AAACGTATTG GGGCCGCAAC AAATTGGTGC AATACCATTT CTAACCGCAT GTGCCATTGA GTGTAAGAAA AAGTACATTA GTATCTAGTC CTTGTACAAA TATATATAAT TAAAGCCGCT CATTTTGTTT ACACTACACA TTCGACATGA ACTAATTGAT GTGCGGATCT CCCGATAATA AGTCACTATA GTAGCCCCAG ATGAGATCAA TGTGGCCGTA GTGATCGGCG TCAACGGGAT TAAGGAGAGA AGAGTATTTC TGGTCGGGCA TGG
|
Protein sequence | MAPCNTEPIP LHEGVTSRQL NWRQQSLVRR AVSFVQDDLQ HVLSFSPFLR SPPHHNQSAL LHRTEIQTGQ LLGYGGFSEV HEIVGFALNP SVSHQLTPFQ QRARLLLQRQ CIDPVTGRGR YAIKHLQARL LEKNSDEFAH AASDLAVEAQ YLGAIEHPHI LSVRGLPIEG IHALADGKHD AFFIVTDRLE DTLDRRVQSW QTRDADVLYK AELALQLASA LEYLHERRIV FRDLKPQNIG FTSDGTLKLF DFGLCRELPS PDLTCFADVY EVFSMSGVGT RRYMAPEVAN EGKYNCKADV YGWSMVVWEM LNTSKPYPTY SLEDHKQRVC IEGERPPINR SWPLQLQGLL TQAWTPLLPE RLTSREVCGM LHGTINSIKL SLLLDSPESP IAVAESAWAI DHTSTAITSM SSSWGLAKAS CSPDSIRLSL PPDLLNASYS SDTCSLLDRS LMALTVSSSS SMEGFEVTST AEPYQHHQFP SQDETYWGRN KLVQYHF
|
| |