Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47861 |
Symbol | |
ID | 7203084 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 265564 |
End bp | 270456 |
Gene Length | 4893 bp |
Protein Length | 1234 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182190 |
Protein GI | 219123768 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.090651 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGTTGC GTCGTCGTCT GACCAAGCCC GGTCGTGAAA GCCTGAAGGG GTGGGAAGGT TGGGGTTCCG CTCCGCACAG CGACCTCCTC GATACAGCAC GAGAGCCTCT TTCCCAAAAA GGGCCTGTTT ACTATCGTAC CTCGCTACCG TGTCGACTAG TCGCTCTACT TGGCGCTGGT GGGTCTCCCT TCTTACGTTT ACACGTCCCG GCACGGTAAA AATAGTAGGT ATACGCCCCC ATCGCTGCCT GCAGAGTTGG TGGTACACAA GTTGTGCGTC TATTGATCTA TCTATCTATA TCTCTATCCA CCCCAGGGTC CTTTGTGGGC GTGTTGGTAG AAGAGAATTA CGAAACTTCT TTGGGATAGC TTGGTGTCCC TTTTGAGGAC TCAAAAGACC GTTCAGGAAC CTGCCTACCA ATCCCCATTC GCGTCACTCT GTGTACGTTG ATTACCATCT TCGCGATTGT GCAGACAGTG TATCGCACTT GCAGTCTTGT TGGGGTTTCT GTTCTTGATT GATTCGGTCC TTACGGTGTT ATCGTAGAGT CCACCGACGA AGACAGGGAC ACACACGTTG GTTGTTAGAG ACTGGACGTC CGACAATACT CGGTACGGCG ACGACTTGTG GTCTACTTAT TCGTGCCGAG ACTGGAAGAT CCCGCCCGTA CACTTGGAAA GACACGCACA CTCGCTCAAA AAAGCACAGA CATATACATA CACAGTATGC CTGTTCGACC CAAACGCTCG TTGGCCAAGT ACTTTCACGG TGGGGATCCT TCGGATGTCG TCGAATGCCC AGGCTACCGT GCCGCCAACA ACGGCAATCA ACACCGTAGC GTACACAACG CAACCATGGA GGCACTCGTA CGTGGTTCCC CCGTCGCCGA CACTGCCAAT GGCGGGGGCG TGCTCGTACC GTCGGTGCAA CCGCAGCTCT CGGCTGTACG GCATAGCAAC GGTTGTTTGC ACTGCCACAC GGAAGACACC TCCGAAAACA ACGTGCAAGT CCGCTGCGAC GAATGCGGCG CCTATATTTG CGACGAATGT CACTGGTGTC ACGAATATCA GGCCAATCAC GAGATTCGAG TTTGCGATCG ATGTGACGGC TTTTACTGCA AGGGCTGCGA CGAAATGGAT CAGTGCGAAG ATTGCGGCGA AGTTGTATGT GCAAGCTGTA GTACTCTCCT CAGTTGCAAA TTTTGTGGTG GAGGGCTTTG TGAAGAATGC GCGACGGCCT GTGGACGGTA CGTAGGCTCT GGAGTATTGT GCAACCAAAC ATGGACCCTT TGAACACTAT ACGAGAGGAT ACGTTTGTAT TGCATACGAA TTGTTCTTCT CACACGATTT TTCTCTCACT TTTGGCAGTT GCGGTATTGT GTTATGCAGC CGGGACGCCA AGTTTGCGGT AGAGTGCGAT ACTTGTCGAC TGAGTTACTG CCTGGTATGC TTGGCTAGTG GGGTCAAGGA TCCTTGCGTC CGTTGTGGAC ATCGCCCTTC GAAACGCATG GAGCAACTCG TACACCTGCG ACTGAAATCC ATTTACAAAG CATTCAAACA AAACAGCGGT TCCAACGGTC TGGAAGTCCG TGCTCCGCAC ACGGCTACCA AAACGTATGC GCGTTTACGC GAAAACGACG AACCTTGCGA CAATCCGGAT TCACTATTAC AGGCTGCCGC GTCAGTGGTA GCGGCCAAAC ATCCGGAACT ATTGACTGCA CCGGAAGATG TCGAATCCCA GCTAGTCCGA CTCGACTTGG AACAAGAAAA AGCCGATGCA GCCGCGGCGG CGTTATTGGC GGAACTTGAA GAGGAAGAGC ACGCCGAACA AGTCAAGAAA AATAAGAAAA AGAAACGCAA AGGTCGCAAC GGCAATAAGA AAAGCGAGGA AGAGGTTGAC AAAAAGCTTC CAGCAAAAGA AGATCTACTT CCGCAGCCGT CCAGCCCGGA ACCAGTATCA GTAGCTACGG ATTTGCATGT AGTAGCTACC TCACCGTCAA TACCAAACTC GCCTGATCCT CCCAAAGCTT CGGCCGTCGA TTCGATGCAA CAAAAGCTTT GCGACCTTGT GATGAATGAG GATATGAAAG GGTTAGAAGA TTTGATGGCC TCTTTAAAGG GTGTTCCGGG ACAAGCAGCC TTACGCAAAA ATGCAAAAAA AGCGCTGAAA CGACTACAAA CACCAGAGGT AGACGCCCAT ATTATTGAAG CCAGAGAAAT TACCACTGCT ACTCTGACGA CACCTTTAGA CGAAGCAACC GTCCCTGATG GCGCTGCCAC GCCTCCGCCT CCCACCAGCG ACTTGCTTCA CGTTATTTCT TATACACACA ACAAATTGCC GACTCAACCT TCCAACGTCT CACATCGTGC ACGCAATGCC TCCGCTACAC CCAAGACGGA ATGTGTCCTT CACATGGCAT CCTCGATTGT CGGTTGGGTA ATTGGGAAAG GCGGTCAACG TATTCGCGAT CTTATGGAGG AATCCGGAGC CCGAGTTTGG ATCGACCAAG AGAATCTAGG CAAAATGGAT CCCCGTATTG TTTACGTTAG CGGTCATCGG AAAAACGTGG ACTCGGCGGT ATATTTGCTG CAAAAACTCG TTGCGCAAAC ACCTACCGAT CCTTCAGCGT CGAATCAAAA CACACTCTTG GGGCTTAAAA GCGACAGCTC GTTAGATCCT GGGATTGTGC CATCGCGGCC GTCGGATTCG CACGAAGGCA CGACTAGTGT GCGTGTGCAA ACTTACGGTG TGCATGCTGA TCGAGCCAGC GATACTACGG GCAAAGGCAA TCATATTCTC ACCTGCGATA AACGTTTCGT TCCTCTTTTG ATTGGGAGGC GAGGATGGAC AATTAAAAAC ATCCAGGATT CATCCGGGGC GCGTGTTGAT ATAGATCAGA ACGTAGCACC TCCCCGTATT ACAATCTCAG GTGCGGAAGA ACAGGTTTCG ATCGCGGTAG AAATGGTGCG GGATGTGTTG AGCTATCCAC ATTCGCAACT TCAAGGACGC GCTGGCAGGG ATGAATGCGA TCACGCGGCA GATCATGAAA GGAACACTCC CGGCGTTGAA TTGCAGATGA GACTCTACTT CCACCGAATG TATCACCAGT GGCCGACAGG AACAGGAATT CTCCGCCGTC TTCGTTGATT ATGCCAGACG ATGTTCAGAG CACGATTTCG GCTTCTTCTT CTTTGTCTTT GACTCCAGAG CCATCGACAG CGTCTTCGAA CAGAACCAAT TTGCACGTTC CTTCTGGACC TATGTTACCG CCCGCGTACA ACGCCGGACT CTATACTTCC GGTGTCAATG CTAGATCTAC TTCGTTTCAG CCAGGTTTTA ATGCGTCAAA CGGGCCACTT TTCACTGGCC ACAGTCCGAG TATGATTCTA CCGGCGGAGC AATTGTTTCG TGTGCAATCG GGTCAGTACG CAGAATTTCA GCAATCGACA AGGGATATAG GTGGCGCATC TTTATTTCCA GATCAGAATA TAGGCTTTGC TGCTCCTGCT AACGTTCAGC CTCCCAACTA CCTTCAGAGT CAATCTTCCT CTCCCTTTGA GTCAAATCCA CTGGGATCGT ACGGAAACAT TCCCTCCAAT CAGCAGCAAG CAGGGCTGTT TCCTCTTTCA CAACCCGTAT TTTCTTCCCG GAATGAACGA ATTATAGGCC AGTCAACAGC TGTCGATGCG CTAAGGTCGA ACTCATTGGA TCCGAAAGAA AGCGCGAGTA TGTGGGAACA ACTGGGGGGC GCAGCGGTTT CTCAACCGGC TGTTGCTAGC GGAGGCAGTG CCGGGTTCCA CTTGGATGCC GCCGTCGAAT TTCTGCAGAA CAGCAATTTG GGGCCACACT ATTCGCCAAT TTCCAGTGAT GCCGATCAAA ACATTGGGCA GTCGACTGGT GGATCTGTGA ACCCTCAAAG ATTTGGACCG TCGGCTAGGG GAAAGCGTAC CCCAGCACAC GGGAAGGCTG AGTCTCAGAT GGTCGACAGC TTTTTCGGAC CGAACAAGCA AGACATCAGG GACAACAGGG TTTTGAATGG CTTGTCAGGT CTCTCCGTTG CAGACAAAGA TGTATCTACT GGTCTTTGGG GTGTCCCTAT ACAAGCACTG CCATCACTTG GCGAGGTGGA AGGAGCGACT ATAAACAAAT CCTCTGCTTT GTATGCAGCA ATTCAACCGA ATCTTGCGAA TAAAAAAGAA CAACGCCCCG AGCATTCACG CTTTAATTGG GGGCCATGAA GAATTGCAAA ATGTTTTGCT TGATTTTACT TTCTTGCTGT TTCTCTACCC TTTAGACGTA CTGCCATTGT TTTGGTGCGG GTTGGATTGC AAGTGGAATA TTATAGACTT CTGGATTGGC AAGTAAGCGA GGCAAAAATA CACACGAATA GCATGGAGAA AGCTGGCTAA AACAGGGAGA GATGCGGGTC GACATACTGC TTCCGTGTCG ATCTAGACTT GGTTTCTACA TACTGCTTCT TTGTACAAAC CTTGATCATC TCTCACCAAT AATTAGCTCC TGCTTACTAG CTCAAAGTTT GTCTTTTACA CCACATCAGA CACATCTTGG GAGTATTCCT GCACTGTCGT AGATGGCTCG TCCGTGGCGA CATGAACTTC GGAGAGCATC ACTTCTTCCA AAAGTGGGAA GACTTGAATT CCGTTCTTGT TCCATTCTAA TGATCCTGAT GTGATAACAG AAACAACTAC TTCCCACACG TACGGAACGA AAAAGATTTC AGGATCCGCT TCCTCTTCGT ACGTAAAGGT GAAGTCTTCG TTTTCGCTTC TTTTTCGACG TTCGGTCACA ACGGCTTTGA TGCGGTCGAT GTTCTCCCCT AGCACTTTCA AAGCCTTAGT TGCCGTCCGG GCGTCACCAT TTTCTTCAAT AAT
|
Protein sequence | MSLRRRLTKP GRESLKGWEG WGSAPHSDLL DTAREPLSQK GPVYYRTSLP CRLVALLGAG GSPFLRLHVP ARLVSLLRTQ KTVQEPAYQS PFASLCSPPT KTGTHTLVVR DWTSDNTRMP VRPKRSLAKY FHGGDPSDVV ECPGYRAANN GNQHRSVHNA TMEALVRGSP VADTANGGGV LVPSVQPQLS AVRHSNGCLH CHTEDTSENN VQVRCDECGA YICDECHWCH EYQANHEIRV CDRCDGFYCK GCDEMDQCED CGEVVCASCS TLLSCKFCGG GLCEECATAC GRRDAKFAVE CDTCRLSYCL VCLASGVKDP CVRCGHRPSK RMEQLVHLRL KSIYKAFKQN SGSNGLEVRA PHTATKTYAR LRENDEPCDN PDSLLQAAAS VVAAKHPELL TAPEDVESQL VRLDLEQEKA DAAAAALLAE LEEEEHAEQV KKNKKKKRKG RNGNKKSEEE VDKKLPAKED LLPQPSSPEP VSVATDLHVV ATSPSIPNSP DPPKASAVDS MQQKLCDLVM NEDMKGLEDL MASLKGVPGQ AALRKNAKKA LKRLQTPEVD AHIIEAREIT TATLTTPLDE ATVPDGAATP PPPTSDLLHV ISYTHNKLPT QPSNVSHRAR NASATPKTEC VLHMASSIVG WVIGKGGQRI RDLMEESGAR VWIDQENLGK MDPRIVYVSG HRKNVDSAVY LLQKLVAQTP TDPSASNQNT LLGLKSDSSL DPGIVPSRPS DSHEGTTSVR VQTYGVHADR ASDTTGKGNH ILTCDKRFVP LLIGRRGWTI KNIQDSSGAR VDIDQNVAPP RITISGAEEQ VSIAVEMVRD VLSYPHSQLQ GRAGRDECDH AADHERNTPG VELQMRLYFH RMNRNSPPSS LIMPDDVQST ISASSSLSLT PEPSTASSNR TNLHVPSGPM LPPAYNAGLY TSGVNARSTS FQPGFNASNG PLFTGHSPSM ILPAEQLFRV QSGQYAEFQQ STRDIGGASL FPDQNIGFAA PANVQPPNYL QSQSSSPFES NPLGSYGNIP SNQQQAGLFP LSQPVFSSRN ERIIGQSTAV DALRSNSLDP KESASMWEQL GGAAVSQPAV ASGGSAGFHL DAAVEFLQNS NLGPHYSPIS SDADQNIGQS TGGSVNPQRF GPSARGKRTP AHGKAESQMV DSFFGPNKQD IRDNRVLNGL SGLSVADKDV STGLWGVPIQ ALPSLGEVEG ATINKSSALY AAIQPNLANK KEQRPEHSRF NWGP
|
| |