Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44394 |
Symbol | |
ID | 7197859 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 418119 |
End bp | 420217 |
Gene Length | 2099 bp |
Protein Length | 668 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178505 |
Protein GI | 219115419 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000898741 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCTGGCACAA TTCCATCGTC CAATTATCCC TTCATTTCCC GAGCAGATCC TTCCACACCA CATTATAGAC AGCATGGCCA AGCTCGTCGT CGCTGATGCC GTTGAGCGTA CGCCGCCGGT GGAAGCCGAC GCTGCCGGGG GTGCTGTCCA CGTGACCAGG GCGGCGCCTC TCACGAAACG TCAAAAACGC AAACAACCGA ATGACAAGCA CGCGTGGCCC CAGCAGCCCA AGATTTCGTC CCCCGCCGCT CGTGCCAACG CATACTTGCG CGTCGCCGGA CTCAACCAAA AGGAAACCCC GAAGGACGCC TTGGATCCCG ACCTTGATTT GCATTCTCCC CAGGTTAAGT TTGGGCGATT GCTAGGCAGT ACCGATCAAC GGGTACGTCA CCGGGCTATT CTGCAATTGG AGCAATACCT CAAGGCTCGG TGCGATATTA ACAACGAAAC GGGCGGAATT TCGGAGCTTG ATCTACTAAA ATTGTGGAAG GGAATGTGGT ACACTCTGTA TATGGCTGAT AAGGCACCAG TACAGGAGGA ACTCGGCAAA AAGATCGCCC GTCTGATCTG GTGCTTGGCG GGCACCGAAG AGGAGGACGA ATATGCTGGG CAGGCTTATC TGGAAACGGT TGGTGATGAT GGACCCATTG GCTTTGAAAA TGACGAGGAA TCCGATGACG AGGAGGTCAC CATGGAGGAG ATTGAGAATA CCTTGGAAAT GAATGGTAGC GAGGACGAGG AGTCGGACGA TACATTAGAG ACAAAAGACA CTACGAACCA TCACATGCAT GAACTAGACT CTAATCACGA GGACGATGGT GATATCGAGG ATCTGGAAGA TTCCGAAATT CCCCATTGTC GCGGAGCACA TTTGGCCACT CTCTTTGTGA AAACATTCTT TCACACTGTT CGTCGCGAAT GGGGCAAAAT GGACAAGTAC CGGGTTGACA AATTTTATAC ACTCATGCGC TTAATGATGC ACGAAGTGTA CGAATACATG GCCGTCCGTC ATTGGAATGT GGGCATTATT CGACTTTTCA ATGACGCCAT TTACGAAGAA GTTTTGACAC AAACACCCAA TGGGTTGCGC CTTCATTTGA TTGACCTAGT TTTGGATGAG GTAGTAGCTG TCAACGCCAA AGCTCCGATG CCTCTAACGG AGGCGACTTT CTTGGATTGT TTAGAGCCAT TCTTTGCCAT GGCGCAGACG GGAGCAGGCG AAGATTTGAT TCAGCAACGT GTACTAGAAA ACATTTTTGT CAGGTTCCTG AACAAATACA GTGTGGTGAA CGAGCACGCA CTGGATGAAG GCACCAAATC AGACTCCTTC ATCCTCGAGC AGGTACATGT TGCAACTGTA GCTCAATATA TCTTTGAGCT GGCTAGTGAC GGGGCAACAA AAGATCGCTT TCGCAAGTCA ATGTACAGTC TGCACAAGCA GTACATCCGA AGACTGAAAA CCGTTGGAAA AGATGTTGTG CTGCAGGACG AAGAAAGCGA AGAAGAAAAA GAGGAGCACA ACATCACCGT CGCATCTATT CAACCGATGG AAGAAAATTC GGGTGCACTA AAGGAAACCG AGAAAGCAAG TGAAAACGAC GAGGTCGGAT CGGCCTCCAA CAAAATTATC GAGTCGAAGA CATTGGATAA AAAGAAACGT AAACGAAAGA AGAATAAGAA AAGCTCGACT GGATCTGATG CTGTAGAACA GGACAGCACT ACTCCAAAAG AAGAAGAGGT TACTATATCT GTAGAGGAGC AAAAAGCTGC GAAGGATGCC ATGATTCCAA ATGAGAGGAA AGTTGACGAT ATCAACCCAA AAGCATCATT GCAGAAAAGG CGCAAAACCG CGACCAAGCG AGAATCTGAG CAAAATGATC GTAAGCGCGT AAAGTTTGGC TCGAAGAACC AAGCTATATC CTGGAAGGAA TCGATGAAAA ATTTGAGAAC GAGAGATCCA CCTGAGCCAA GAACAGCAAC CCCTGAAAAG GGTATTCTTC TAAACAAAAA TGCAAAATCG GCAATAATTC AAGGGTCCAA AGGGAAGAAG CGACGAAATA AGGCAGTAGA TTTCTTTTGA GTATCATTGA TAGTAAGCA
|
Protein sequence | MAKLVVADAV ERTPPVEADA AGGAVHVTRA APLTKRQKRK QPNDKHAWPQ QPKISSPAAR ANAYLRVAGL NQKETPKDAL DPDLDLHSPQ VKFGRLLGST DQRVRHRAIL QLEQYLKARC DINNETGGIS ELDLLKLWKG MWYTLYMADK APVQEELGKK IARLIWCLAG TEEEDEYAGQ AYLETVGDDG PIGFENDEES DDEEVTMEEI ENTLEMNGSE DEESDDTLET KDTTNHHMHE LDSNHEDDGD IEDLEDSEIP HCRGAHLATL FVKTFFHTVR REWGKMDKYR VDKFYTLMRL MMHEVYEYMA VRHWNVGIIR LFNDAIYEEV LTQTPNGLRL HLIDLVLDEV VAVNAKAPMP LTEATFLDCL EPFFAMAQTG AGEDLIQQRV LENIFVRFLN KYSVVNEHAL DEGTKSDSFI LEQVHVATVA QYIFELASDG ATKDRFRKSM YSLHKQYIRR LKTVGKDVVL QDEESEEEKE EHNITVASIQ PMEENSGALK ETEKASENDE VGSASNKIIE SKTLDKKKRK RKKNKKSSTG SDAVEQDSTT PKEEEVTISV EEQKAAKDAM IPNERKVDDI NPKASLQKRR KTATKRESEQ NDRKRVKFGS KNQAISWKES MKNLRTRDPP EPRTATPEKG ILLNKNAKSA IIQGSKGKKR RNKAVDFF
|
| |