Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47058 |
Symbol | |
ID | 7202132 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 299208 |
End bp | 301144 |
Gene Length | 1937 bp |
Protein Length | 606 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181347 |
Protein GI | 219122008 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGTAAAATA CGAAATCTGC ATTGATAGGG CCCGGCCTTT ATTTAGGAAT TGAGTTCCTA TCTTTACGAT CCGTCATGGC GATCAAGAGC TCTTCTAGTA ACGTGACAAG CAAGCGCAAG GATGGGGACA GCAAGTTTTC CTCGCAACCC TCGAAGAAAA GTAAAGGAAG TGGTGCCAAA CACTCTACAG TGCCGCCTTC TACAGTGGGT TCGAAACGTG CTGTTAAGCA GGAGCGACAG TCCCAGCGCA AGCACGCCGA CGTTGTGAAC GACGCCAAGC GGATTTGGAA TCAACTGCGT CTCAAAACAA ACACGTCCGG ACAGAACCGT CAATATATGG ACACGCTCAT GCCCTTGATT ACGGGCAAGG CCAATGAAAT CGCGTTACAG CACGACGCGG CGCGTGTCGT TCAAGCCGCG ATTCAGTTCG GAACGGTTGA AGAACGGCGT CTAATATTAC AAGAACTGTG CGCGAAGCAA AACAACTTTG CGGAACTCTG TAAATCTCAA TACGCACACT TTTGCGCTTT GAAAGCGATC AAGTATTGTC ACAGCGACCC AGCCTCCGTG AAATTGATCA ACAAAGCACT CAAAGGACAT ATGCCTAGAT TGGCTGTGCA CGCCGTCGGA TCTCGGGTTG TACAGTCAAT TTTCAGCACA ATGACTCCAA AACAAAGTGC GGTGCTCAAG CAAGAGTTTT ATGGGCCACA TTTTGCTTTG TTTGCGCTGG ACCTGCCACG AAATGATGCT GTGCCGACGT TGGCGACGAA CATAGCTGAG GCGCCGGAAA AGAAAGAAGC AACACTCATA TTTGTACGCA ACTTGATCAA TAAAGGCATG GAAAAGACTC TCTACGGTTT CACTTACTTT CAGGATCTGT TTGCCGAGTA TTGCGAGGTG GCCGACCCGC GAGAGATTCG CATTTTGGCG GGCACGGCGG CCGACAACTC AATTCATCTC TTATCTGGTC GCGCTGGGAC TCGCGTGGTT GCTTCCTTGA TTTCCTATGG CACTGCTAAA GATCGGAAGC GCATCATGAA AAGCTTGAAA GGCTACACCA AGTCAGGACT ACTGCATCAC GATGCATACC TAGCAATCAT CCGTTTGGTT CAGTTGACGG ATGATACAGT GTCTATTCAC AAAAACATTT TCAACGAACT GCTCTTACCG AGCGATAAAT CCGACGAAGA ACTATCTTGT CCACTTTTGG AGCTGGCGCT TTCCGATACC GGCTCCAAAC TTCTTTTAAT GCTTCTTGTT GCGGATCCGG AAACATTGAA GAAGTTTTTT GATCCCTACG AGCTTTCGGT ACTTTTCGAA AATCCGACTG TGATAGACGA TGGTCAAGAA GTTCTGACAA GCAAGAAGGA GCCAGAAATA CGACGAAAAG AACTCATCAA GTATCTTCGA GAGCCGTTAA TTGAGATGTG CGCCAAAAGT GCGAATGAGT TGATCAGGTC TCGGCCGGGA GCTCTTGTAC TTCGAGAAGT GTACCACTCG TATCGACCGA TCTCGGTAGT TGAAGCGATT GTTGGAACGT GCCAAGCAGC ATTGAATCAG GATTCCTCGA AAGGCGACGA AAAGGATGTC CAGAATTATC GCCTCTTTGA AGATAGAGAT GGACATTTGG CAGTGAAGAA TCTTTTATTG GCTGATTCCG CGAAAGAATC AGAGGCCAAG CTGGCGTCAG CTTTTTTCGA AACCTTCCAA GATCGACTGA TGGAGATTGC CCAATCCAAT CGCGGGGCTT TCGTTATTAC AGCTCTTTGT AAAGTTTTGG CTGTAAGAAA AGGCGCTATT TCCAAGTTGA ATCAAGCACA ACTGAAAAAA TTGGCCGACG GCAAAGGTGC TACTGCGGGG TTTAAAGCAC TAGTGAATGA GATGGATGGC AAATAGCTTT TCTGCGTAAT TTCTGCATCG TAACGATTAC ATATCAT
|
Protein sequence | MAIKSSSSNV TSKRKDGDSK FSSQPSKKSK GSGAKHSTVP PSTVGSKRAV KQERQSQRKH ADVVNDAKRI WNQLRLKTNT SGQNRQYMDT LMPLITGKAN EIALQHDAAR VVQAAIQFGT VEERRLILQE LCAKQNNFAE LCKSQYAHFC ALKAIKYCHS DPASVKLINK ALKGHMPRLA VHAVGSRVVQ SIFSTMTPKQ SAVLKQEFYG PHFALFALDL PRNDAVPTLA TNIAEAPEKK EATLIFVRNL INKGMEKTLY GFTYFQDLFA EYCEVADPRE IRILAGTAAD NSIHLLSGRA GTRVVASLIS YGTAKDRKRI MKSLKGYTKS GLLHHDAYLA IIRLVQLTDD TVSIHKNIFN ELLLPSDKSD EELSCPLLEL ALSDTGSKLL LMLLVADPET LKKFFDPYEL SVLFENPTVI DDGQEVLTSK KEPEIRRKEL IKYLREPLIE MCAKSANELI RSRPGALVLR EVYHSYRPIS VVEAIVGTCQ AALNQDSSKG DEKDVQNYRL FEDRDGHLAV KNLLLADSAK ESEAKLASAF FETFQDRLME IAQSNRGAFV ITALCKVLAV RKGAISKLNQ AQLKKLADGK GATAGFKALV NEMDGK
|
| |