Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_41850 |
Symbol | |
ID | 7197631 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 1093879 |
End bp | 1096886 |
Gene Length | 3008 bp |
Protein Length | 685 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178361 |
Protein GI | 219115131 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCAAGG TAGTTCCTTT GCATGCGGCG GATCACGAAG AATCCCGGTC GGGAGATCTG GTGCTCCTCC AAGCAAAGGA ACCACCTAAA GTTAAGGCCG TTACCTTTGC GCCGGTTGAA GACAGTCTCA AGCAGTTGGA ATCGGCGGAA GGAGGCGACG AAATTGCGGA GGAAGAATTG CAAGCCAGGT TTGTCCAACC ATACGTGGAC AATCCACAAC AAGCCATGGT CAAGAAAGGC CTCTTACTAA TGCTCCGGGA TGAAAACAAC AAGGCGCTAG AATTCATGGT GACACATATC GATACGGAAA ATGATGCTAG TGAAAGTAAA GCCTCTGAAG GTACGTGTCA AACAACACTG GATTGTATCA GTATTTATGT ATGTTGACTG CACGATAATT TCTAATAGTT CCATGCTTTG ATTGTTGCGA TTTACAGACG ACGACGAAGA CGTTGCGGTG GCGGGAGAAA TGACTTCTGA GACGGAAGTC ATCATGGGAA GCTCGACACC ACGTCTCGAA GTTGGACTAG GCTATGACTC CGTCGGAGGC CTCGATTCAG CGATTCAGCT CATGCGTGAG CTGATCGAAC TTCCGTTGCG TTTTCCCGAA TTGTGGACGA CTGCTGGTGT ACCTACGCCG AAGGGAGTTT TGCTGCATGG CCCTCCTGGG TGTGGGAAAA CGCTCATTGC GAATGCTTTG GTGGAAGAGA CGGGAGCGCA TGTCGTCGTT ATCAACGGCC CGGAAATTAT GGCACGCAAA GGAGGAGAGA GCGAAGCAAA TCTTCGCCAA GCCTTCGAAG AAGCCATCGA AAAGGCTCCG TCGATCATAT TCATGGATGA GCTTGACTCA ATTGCACCGA AGCGAGACCA GGCGCAGGGT GAAACGGAAA AACGCGTCGT GTCACAATTG CTAACCTTGA TGGACTCGCT GAAACCCAGT TCAAATGTCA TGGTCATCGG TGCCACAAAC CGCCCCAACG TAATCGAGTC GGCTCTCCGT CGTCCCGGTC GTTTTGATCG TGAACTAGAG ATTGTCATTC CCGATGAGGA TGGCCGCCAT ACCATTTTGA AGATTAAGAC GAAGGACATG AAAATTAGCG CTGACGTTGA CCTATTCCAA ATCGCTCGTG ACACACACGG ATACGTGGGT GCGGACTTGC AGCAACTTAC AATGGAAGCT GCTTTGCAAT GTATTCGTTC CAACATTGCA AATATGGATG TGGACAGCGA GGAACCTATT CCTGAAGAGA TTCTCGATAC GTTGGAAGTC ACTAACGATC ATTTTATTTA CGCGCTAAGT GTGTGCGATC CCAGTACCCT TCGCGACAAC AAGGTGGAGA TTCCAAACGT GAAATGGGAA GATATTGGTG GTTTGGAGGA GACCAAACGT GAACTACAAG AAATGGTTCG GTATCCGATC GAGCATCGGC ATCTTTTTGA GCGCTTCGGA ATGCAAGCCT CTCGTGGGGT TTTATTTTAC GGCCCACCTG GTTGCGGAAA GACGTTGATG GCCAAGGCTA TCGCTAACGA ATGTGGCGCC AACTTCATTT CCGTGAAAGG CCCCGAACTT TTGAATGCTT GGTTTGGAGG ATCCGAAGCC AACGTTCGTA ACCTTTTCGA CAAGGCCCGT GCCGCCAGTC CGTGCATTCT TTTCTTTGAC GAGATGGATT CAATCGCGCG TGCTCGCGGA GCGGGTGGTA GTGGCGGTTC CGAAACTAGT GATCGTGTCA TTAACCAAAT CCTCTCCGAA ATCGACGGCA TGGGATCGGG CAAAACGCTT TTCATTATTG GAGCGACGAA TCGTCCCGAT ATTCTGGATC CCGGTATCAT GCGTCCTGGG CGATTGGATC AACTGATTCA CATTCCGCTA CCGGACCATG ATTCGCGTGT TTCAATCTTT AAGGCCAATC TACGAAAGAG TCCTATCGAC GAAGAGGTCA ATATGAAACA GCTGGCAGAC GCTACTGAAG GGTTTTCGGG AGCTGACATA ACTGAGATTT GTCAACGAGC CGCCAAGAAT GCTATTCGAG ACAGCATAAC AGCCGGTATT GAGCGACAAA AGCGTGTCGA AGCAGGGGAG CTTTCGCAAG AAGAAGCCGA TGCTCTTCCA GACCCCGTAC CGTTTATCAC CAAAGCACAC TTTGAAGCTT CCATGAGCAA GGCGCGACGT TCGGTAGGCC CCGAAATTGT AAAACAGTAC GAAGATTTTA CTGCCAAGAT AAAGCAACAA TGGAGTAGCT CCGGTGCCGA AGGTGCAGAA AACGTTTACG ATATCGACGC GGCAGCAGCC GAACAGGCAC GCGAGGACTC AATGGTAGAG GGGGATGAAG AAACACTAGT CCCAGTCGTT GGTTCAGATT CCGATAGCAA TGAATAGACC AAGCTAATGA CTGATTATGC TGACAGTGAA TTTGAATTCT AAATAACCCC TTTGAGCGTA GTATACTGAA CTTTTTCCAT GGAAATGGAT AGTAATTGTG TGAACAAATG TGGTAGTGGA TTGGGTCCTG TGACATAGTT CCTATGTCCG GGTACAACGA AATATACTCG CCGTTCATTT TGCGTGTCGA AACGATATCG GTATCCTTCG TTCCGAACAC TCCGAGATTC ACTCATGTAA GCCGAGTGTT GTAAATTTTT GTAGACACCC AGGTGATTCT TTTCCTACGA CTTTGCAGTA AAGGATTTAT GTTAGATTAG AGACTCGCTG GCTACCGTTA TAAATCCCAA CCCGACTGTT GAACCTTGTG GAATATGCGT TAGACCACAT GATAGTCTAT TATTGGTAGA GCCCTCTAGT CCCACAAAGT CCAGTGTATG TAGTCTTGTT TAACTGGAAA TCGGTGTCCG AATTGTCGAA CCGTCCTATT CCCACACACA TATAAATATG TATGGATACA ATACCGGTAG TAATTTGATC AGCGATTTTT TCAACTTTCT AAATATATCG CATTTACTAA AAAGCCGTAC CACTACGACA AAAACATCTC TTCCCAAAAG AGGTCTAA
|
Protein sequence | MVKVVPLHAA DHEESRSGDL VLLQAKEPPK VKAVTFAPVE DSLKQLESAE GGDEIAEEEL QARFVQPYVD NPQQAMVKKG LLLMLRDENN KALEFMVTHI DTENDASESK ASEVIMGSST PRLEVGLGYD SVGGLDSAIQ LMRELIELPL RFPELWTTAG VPTPKGVLLH GPPGCGKTLI ANALVEETGA HVVVINGPEI MARKGGESEA NLRQAFEEAI EKAPSIIFMD ELDSIAPKRD QAQGETEKRV VSQLLTLMDS LKPSSNVMVI GATNRPNVIE SALRRPGRFD RELEIVIPDE DGRHTILKIK TKDMKISADV DLFQIARDTH GYVGADLQQL TMEAALQCIR SNIANMDVDS EEPIPEEILD TLEVTNDHFI YALSVCDPST LRDNKVEIPN VKWEDIGGLE ETKRELQEMV RYPIEHRHLF ERFGMQASRG VLFYGPPGCG KTLMAKAIAN ECGANFISVK GPELLNAWFG GSEANVRNLF DKARAASPCI LFFDEMDSIA RARGAGGSGG SETSDRVINQ ILSEIDGMGS GKTLFIIGAT NRPDILDPGI MRPGRLDQLI HIPLPDHDSR VSIFKANLRK SPIDEEVNMK QLADATEGFS GADITEICQR AAKNAIRDSI TAAHFEASMS KARRSVGPEI VKQYEDFTAK IKQQWSSSGA EAVPLRQKHL FPKEV
|
| |