Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44116 |
Symbol | |
ID | 7204043 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 1024742 |
End bp | 1027015 |
Gene Length | 2274 bp |
Protein Length | 757 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186450 |
Protein GI | 219113733 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGCAA AGGTGGCTTT CAGAGCGTTT GATGACCCTC AAATGGAAAG TGACGAAGGG ATTTTTATCA CGAAACAACA CAATCTTGAA TCCGAGCGTC CCTCAGCGCT ACGCCGAGCG AGGATCAGCA TCAGGAGGAC TAAGAAGATG CCGACACATG CGAAAAATGC TCCGAAGCAA TCTCGACCGT TTGGTTTTCG CGCCATTCGT AAGGGGTCCC AAAAAAAAGG AGTTGCCGAC GACGGCAACG GCAATAATTC TGACGACGAT CCTGTTATGT TGCTGTTCGA GGCTCGTTCC ACACAACGTG TTCGCTTTTC GTTCGAAGAT CCAAAAGAGT CGCGCCGCTT TACCAAAAAA GAGAATCTGG GCGGGACTAA CACCCCCACA AGTTCTGCGG TCAGTTCGAG TCCGTCTTCC AGTATAAAAA AATTCATGAA AGGAGCCCTG TTCTCCAAAG GCAGCAAGAT GAGCGAAGAA AACGAAGTTT TCGATATCGA GTTGCGTAAA GAAATTGAAA CTGCAAAAAA GGATTCCGGT CTAGTATGGG AAAATGACAA TGCTTCTTTA AGTTCAGATC TATTGTTCCC AGGGAAGCAA CAAGAAGCTA CCTTGCTCTC CTTTTGGGAA GATCAGGAAA CACGGGCAAA TGTCATCAAA CTCACAAACA AGGCACGGCG TGCGCAAGTC GTACACTTTC GTTTTGAGTA CGCTGTGAAG TGCTACGTCA ACGCCTTAGA ACTCCTGAAG GCTGCAAAAT ATCCAGACGA CCACCCGATG GTGGTAAAGA CGATGGTTTC GCTTAATCAT GCTCATCACG CCGTGAATTC ATACAAAAAC TCGGCAAACA TTGTCAAAAT GGGAATCAAA TACGAGGACG CCGGGGAGCT TGTTCGTGCT TTAAAAATGT ATACCATTGC GTACCGAATC CGTCGAGATA CTCTGAGCCG GGCGCACCCC AGCTTGGTAG TTTTATTGAA TATGCTCGGA ACCATTCAAA TCAAACGTGG TGAGCTTGAA GAGGCGATGC AAATCTATGA GCTTGCGCTC AGAGAAACTC CGCCAACCCT TGAACTGGAG GGAGCCAAGG AAGAACTTCA TGCAAAACCG GACAACTTAC TTGCTCGTAG CGTCACCTTT CGCGAGATGG GTACTATTTT CGAACGTTGG GGGAAAAATG TGCAGGCTCT TAAAAAGTAT CACGAAAGCC TTGACTGTGT CGCCCAATAT AAAGGAGTAG TCATCGCGGA CCATGCACAC GAGAATGAGA ACGTCCACCG TTTGCAACGC TCAACGAGAG ACCAAAACCC GGACACAAAA GATATGCAGC TGTCTGAAAA ATCAATTATA GAAGCATTTG AAGATAACGA AGAGGACAAC TGCAGTTTGG AACTTCATAT TGGCTCAGGA GACGTGCACG AAACAGGTCA ACAGAAGAGT AGTGCAAAAA CCATGAGTGA ATATGACAGG TTCTTTCCTC CTGTCGTAGA GGATGTGGTA CAAGCAAATA TGAGAAGCAG AACAGAATCG GAGGAACACC GCGGAGATTT CGCCGATGTC AATGTTGCAT TGACGCTTCA CCAAGTAGGA CAACTACATC GGGCAGAAGG TGAATACGAT ATGGCTCTCT CTGCCTATAC AGTGGCCTTG CGAGGTATGC AATACGCGTT GGGGGAAAAA CATCCTAATG CCGCTGCGAT TCTTGGCAAC ATGGGAAATC TGCAGAAGGA AATGGGTGAC ATGGATGCAG CGTTCGAAAC ATATCAGCAG GTGTTAGGCA TTGAATCGTA TCGTCTTGGT TTGAGTCATC CAGATGTCGT TGTTACCCTC CACAATATCG CGACCATAGA CGCTGCTCGT GGAAACAACG AACACGCGTT GGCCTTGTAC AAACAGGTGA TCAATTTGCA GCGAACATTG TTCGGAGAAG ATAATCATGC TGTTTCGGTA ACGTCCGCAT GCATGGGAGA CGTTTATGAG CGTGTTGGGG ACATAAAGCA AGCAATCGAA TGTTTCGAAG AAGCAATTCG TATCAAAACT ACAGCTCTTG GACGGCATTC ATTAGAAGTT GCGAGGCTGC TGCACAAGCT TGGCAAACTT TCTGCTAAAA AAGAAGAATT TCACCATGCC AGTTCATTTA TTTCCCGTTC CATTCTTGTT TATCGATTGA ATAAGTTATC AGAAGAAGAC GAATGGGTCG TTGATGCTTA CCGAGATGCT GCCGATATTG ATGGCGCTAT TGCGTTGGGA AAAGGAAATT CGTTCGAGTG CTAA
|
Protein sequence | MAAKVAFRAF DDPQMESDEG IFITKQHNLE SERPSALRRA RISIRRTKKM PTHAKNAPKQ SRPFGFRAIR KGSQKKGVAD DGNGNNSDDD PVMLLFEARS TQRVRFSFED PKESRRFTKK ENLGGTNTPT SSAVSSSPSS SIKKFMKGAL FSKGSKMSEE NEVFDIELRK EIETAKKDSG LVWENDNASL SSDLLFPGKQ QEATLLSFWE DQETRANVIK LTNKARRAQV VHFRFEYAVK CYVNALELLK AAKYPDDHPM VVKTMVSLNH AHHAVNSYKN SANIVKMGIK YEDAGELVRA LKMYTIAYRI RRDTLSRAHP SLVVLLNMLG TIQIKRGELE EAMQIYELAL RETPPTLELE GAKEELHAKP DNLLARSVTF REMGTIFERW GKNVQALKKY HESLDCVAQY KGVVIADHAH ENENVHRLQR STRDQNPDTK DMQLSEKSII EAFEDNEEDN CSLELHIGSG DVHETGQQKS SAKTMSEYDR FFPPVVEDVV QANMRSRTES EEHRGDFADV NVALTLHQVG QLHRAEGEYD MALSAYTVAL RGMQYALGEK HPNAAAILGN MGNLQKEMGD MDAAFETYQQ VLGIESYRLG LSHPDVVVTL HNIATIDAAR GNNEHALALY KQVINLQRTL FGEDNHAVSV TSACMGDVYE RVGDIKQAIE CFEEAIRIKT TALGRHSLEV ARLLHKLGKL SAKKEEFHHA SSFISRSILV YRLNKLSEED EWVVDAYRDA ADIDGAIALG KGNSFEC
|
| |