Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_32057 |
Symbol | |
ID | 7196190 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1574985 |
End bp | 1577664 |
Gene Length | 2680 bp |
Protein Length | 840 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177323 |
Protein GI | 219111143 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGATCA TCCTTTCAAC GCTGGCCTTG TTTGCCTTAT TTGTACCCAC TGAATCTCAG GAGAGGTCCC GGGATGTGCA ACTTGATCCA CGTCCATTTT GGCTGATTGA TCAGATGCGT CCATCTACTT TAAAGGATCA ACTTGGTAAG TTTTCGTGGT CCGCGAGCTA GTCAGCCTTC TCGTTGTGCT TCTAACTTTC TTGTGCCTCG CAATCTCCTC TAAACATCAG CGATGTGTGC TGAAGACACG AAGCACTACG CCGTTTCGGA TTTCTCCATT GGACACCGTG GAGCGTGCTT GCAGTTCCCT GAACACACGC TCGAGTCCTA TGCTGCGGCC TTGGAAATGG GTGCTGGGAT TGTCGAATGC GATGTCACTT TCACCAAAGA CCGGCAGCTC ATCTGCCGCC ATTCCCAGTG CGACTTGCAC ACTACCACCA ATGTGGTTAC CATCCCCGAC CTCAACGCCA AATGTACAAC GCCATGGTCG GCTGGTGTCT CGCCAAACTG CTGCGCATCG GACTTTACGC TTGATGAAAT CAAAACACTC TGTGCCAAGA TGGACTCCAG CGGCCCGTCC GAAGCCGCTA CGGCTGAAGA GTACGCGTAT GGCGGCACTG CGGATTGGCG TACCGATTTG TACCAGCTCG AGTCCTGTCC CGAGGTCCCC ACGCACAAGG AAAGCATTGC TTTCATCAAG GCCGGTAACG GGTACTTCAC CCCCGAGCTG AAGTCTCCCA GCGTGGAGAT GCCATACGAA GGCAACTACA CTCAGGAGAT GTATGCCCAG CAAATGATTG ACGAGTACGT TGAGGCGGGC GTCCCCCCCG AACAAGTGTG GCCTCAGAGC TTTAGTCCGG ACGACGTCTT TTACTGGATT GACAATACCG AGTTTGGTGC CCAGGCGGTT GCCCTGGATG ACCAGTACGA CCTGAATACT ACGGACGTGG AAGCCTTCCT TGACTTGCTG GTTTCTCGCA ACGTAAAAAT TGTCGCTCCA CCAATGCAAC GTCTGGTCGA TCCCGCGCCG GACAGCGAGT ACTTGATGAC TGCCTCCCAT TACGCCAATG CTGCGAAGAA CCGCAGTCTT GGGGTGATTG CTTGGACTTT GGAGCGTTCT GGGCCTGGCC TTACTGGCTT TTACTGGGAA ACGTTGGAGG ATCAGGTTGA GCTGAACGAA GGCGACCGCT ATAACTTACT GCATGTACTG TTGAACGAGG TGGAAGTACT TGGTGTTTTC TCGGACTGGC CGGCGACGAC GACTTTCTTT GCCAACTGTA TGGGCGCCAC GTTGCGGTCG AGTCCCGTCA TGGCTGCAGC CAACGACGAT AGAGTTGGCG GAATCGGCTC GGTACAAGTG GGGCCACGGC CGTACTGGCT TGTTGAGCAA ATGCGCCCAT CCGCATTGAA AAATGAACTC GGTAAGTATG AAGGGATGCT TTCTCTTTCT TCAATAAGTT TTCTGTACTG ATGCAAACAT TCTTACTTTG CAGTTGCTTG TGCGGATAGA AGCAACGAGT TTCTTGTGTC TGACTTCTCC ATTGGACACC GTGGAGCGTG CTTGCAGTTC CCCGAACACA CGCTCGAGTC CTATGCTGCG GCCTTGGAAA TGGGTGCTGG GATTGTCGAA TGCGATGTCA CTTTCACCAA AGACCGGCAG CTCATCTGCC GCCATTCCCA GTGCGACTTG CACACTACCA CCAATGTGGT TACCATCCCC GACCTCAACG CCAAATGTAC AACGCCATGG TCGGCTGGTG TCTCGCCAAA CTGCTGCGCA TCGGACTTTA CGCTTGATGA AATCAAAACA CTCTGTGCCA AGATGGACTC CAGTGGCCCG TCCGAAGCCG CTACGGCTGA AGAGTACGCG TATGGCGGCA CTGCGGATTG GCGTACCGAT TTGTACCAGC TCGAGTCCTG TCCCGAGGTC CCCACGCACA AGGAAAGCAT TGCTTTCATC AAGGCCGGTA ACGGGTACTT CACCCCCGAG CTGAAGTCTC CCAGCGTGGA GATGCCATAC GAAGGCAACT ACACTCAGGA GATGTATGCC CAGCAAATGA TTGACGAGTA CGTTGAGGCG GGCGTCCCCC CCGAACAAGT GTGGCCTCAG AGCTTTAGTC CGGACGACGT CTTTTACTGG ATTGACAATA CCGAGTTTGG TGCCCAGGCG GTTGCCCTGG ATGACCAGTA CGACCTGAAT ACTACGGACG TGGAAGCCTT CCTTGACTTG CTGGTTTCTC GCAACGTAAA AATTGTCGCT CCACCAATGC AACGTCTGGT CGATCCCGCG CCGGACAGCG AGTACTTGAT GACTGCCTCC CATTACGCCA ATGCTGCGAA GAACCGCAGT CTTGGGGTGA TTGCTTGGAC TTTGGAGCGT TCTGGGCCTG GCCTTACTGG CTTTTACTGG GAAACGTTGG AGGATCAGGT TGAGCTGAAC GAAGGCGACC GCTATAACTT ACTGCATGTA CTGTTGAACG AGGTGGAAGT ACTTGGTGTT TTCTCGGACT GGCCGGCGAC GACGACTTTC TTTGCCAACT GTATGGGTGC ATCATTGCGA ACGGCCGAAG AAATTGATGT TGCCGTTGAA GGCAGTAACA CGAGCGCCTC GGCACAGGTT GGCTCCATTG TAGCCACCAT CACAATCTCA GCGCTTGTCC ATTTGCTAAT TTTTACGTGA
|
Protein sequence | MEIILSTLAL FALFVPTESQ ERSRDVQLDP RPFWLIDQMR PSTLKDQLAM CAEDTKHYAV SDFSIGHRGA CLQFPEHTLE SYAAALEMGA GIVECDVTFT KDRQLICRHS QCDLHTTTNV VTIPDLNAKC TTPWSAGVSP NCCASDFTLD EIKTLCAKMD SSGPSEAATA EEYAYGGTAD WRTDLYQLES CPEVPTHKES IAFIKAGNGY FTPELKSPSV EMPYEGNYTQ EMYAQQMIDE YVEAGVPPEQ VWPQSFSPDD VFYWIDNTEF GAQAVALDDQ YDLNTTDVEA FLDLLVSRNV KIVAPPMQRL VDPAPDSEYL MTASHYANAA KNRSLGVIAW TLERSGPGLT GFYWETLEDQ VELNEGDRYN LLHVLLNEVE VLGVFSDWPA TTTFFANCMG ATLRSSPVMA AANDDRVGGI GSVQVGPRPY WLVEQMRPSA LKNELVACAD RSNEFLVSDF SIGHRGACLQ FPEHTLESYA AALEMGAGIV ECDVTFTKDR QLICRHSQCD LHTTTNVVTI PDLNAKCTTP WSAGVSPNCC ASDFTLDEIK TLCAKMDSSG PSEAATAEEY AYGGTADWRT DLYQLESCPE VPTHKESIAF IKAGNGYFTP ELKSPSVEMP YEGNYTQEMY AQQMIDEYVE AGVPPEQVWP QSFSPDDVFY WIDNTEFGAQ AVALDDQYDL NTTDVEAFLD LLVSRNVKIV APPMQRLVDP APDSEYLMTA SHYANAAKNR SLGVIAWTLE RSGPGLTGFY WETLEDQVEL NEGDRYNLLH VLLNEVEVLG VFSDWPATTT FFANCMGASL RTAEEIDVAV EGSNTSASAQ VGSIVATITI SALVHLLIFT
|
| |