Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40031 |
Symbol | |
ID | 7195503 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 641059 |
End bp | 642501 |
Gene Length | 1443 bp |
Protein Length | 480 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183908 |
Protein GI | 219127367 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.686468 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACATTG CCGTTCGTCG ACGGCGTAGA CCGCAAGAAA TGGTCTGCTG TTGCCCACAC AGTTGTCTCA CTCTGACGAG TCGAACAAAA TCAAGGCGCG GGGTATTCTG GCTGCACTTT TGGACGGCCG TGGCTATTGT ACAAACCGGG ACAAGCCGAC CCGATCCTTC CTCTCCCACC GTGGGCAATC ATCAGAAAAA TCCGTTACAA ACTTGGATAG ATGCTCGTTG CGGACCCAAC GGACACGCCG TTTGGGCGTA CTCTGGCAGT CTCTTCGATC CGCTGAACGG AAAAAAGATT GCGAACGTGG AAGGTTTGGA ACTAGTTCGC AGACTTGCCG AGACGGACGA CGACGTCGAG CACAAGCGAA GGTACCAACG ACGCTGTGGG GACTTGAAAA TGGCCCAGGC CATCTTGCAG GAGAGCTGCT CTTTGGATTA CGCGGGAACA ATCCTTTCTC GCAAAATATT CTGCTACAAA CCTGTCGACG ACCCGAAATC GCTCTTATCG TCCGTGCGAT TACGACCTCA GGGTCCAGAA AAAGCAATTC CAACGGATCA AGCAGCAACG GTCTTCGACA CGGCTATTAC GGTCATTCAG AAAGGCCCAT CCTGGTTTGT TCATGGTGAA CTCCCCAACG GAAACGTTGT ATGGAATCAG GCCGAAGTTA AGCTAGCAAG CGGAGAGTCA TCTCGCACAC ATTTTGATTA TACTATTTAC AGTCGACCGC GGCATTCAAA ACGCCAACAA ACTCCAGACT TGACAAGCCA AGCTGTACAA GAGACCGTAT CGTCGGAGTC GAACAGCATT TCTCCCGCGC GATCGTCTCT CATATCGTTC GGACCGAGCA AGGCGGAAAG TCTGGGCAAA TTTGGCGCGC GGGAAACCTA TCAATACACG ACCGAGGAGA CGGGTGCCGG CGGACACTTG CTCGACTCTT TATGGGGACG AATGCAATTT GCTGTCTCGT CTGTTTTCCA AAGAGACAAA AAGGCTAACG CATTGATCGT GCCAACTCGC TGCAGCGTTC GATACACACG ATACGGCGAA GGTCCCGTTT GGTACGGGCC GAACCGATTG TGTACCTTGG AACTCCAAGG GCATCGACTA GAAAATTTAT CCCAGGCACC GCAGCTAGCC GCAACCATTG CCGCCACGTG TGTCCCGGGT TTCTTATCGA CTCATTCCGC CGTTGCCCAA GACGACACCG GGGCTCGAAG AGCCGTGGCG TGGTTCCGTG GAGAAAACTC GGTCCAACTA CAGATCACGC ACGACTACAA CAATGCCGAC AGTATAGAAA GGTCACATTC AAGTGGTGTT CGGGGTTTAT ACGCGAAGGG AGCGGCTGTG ATTGAGCGTC TACATGCAGC CACGACAACC AGCACTGGTG GTTCACTGTC AATCTATGAA GAAGATTCAA GCTATTTTAA AACGGAAAAG TAA
|
Protein sequence | MNIAVRRRRR PQEMVCCCPH SCLTLTSRTK SRRGVFWLHF WTAVAIVQTG TSRPDPSSPT VGNHQKNPLQ TWIDARCGPN GHAVWAYSGS LFDPLNGKKI ANVEGLELVR RLAETDDDVE HKRRYQRRCG DLKMAQAILQ ESCSLDYAGT ILSRKIFCYK PVDDPKSLLS SVRLRPQGPE KAIPTDQAAT VFDTAITVIQ KGPSWFVHGE LPNGNVVWNQ AEVKLASGES SRTHFDYTIY SRPRHSKRQQ TPDLTSQAVQ ETVSSESNSI SPARSSLISF GPSKAESLGK FGARETYQYT TEETGAGGHL LDSLWGRMQF AVSSVFQRDK KANALIVPTR CSVRYTRYGE GPVWYGPNRL CTLELQGHRL ENLSQAPQLA ATIAATCVPG FLSTHSAVAQ DDTGARRAVA WFRGENSVQL QITHDYNNAD SIERSHSSGV RGLYAKGAAV IERLHAATTT STGGSLSIYE EDSSYFKTEK
|
| |