Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_40174 |
Symbol | |
ID | 7195819 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011690 |
Strand | + |
Start bp | 317447 |
End bp | 320196 |
Gene Length | 2750 bp |
Protein Length | 897 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184111 |
Protein GI | 219127790 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTCCCA AACCTCAAGA ACACCAGCAA GGGGAACCCG AACAAGACGC AGACGAGCAC CACTCGAGAA CTTCCAGCAA GAGTAGTCTG TCAGGGCGCG AACCACTCCT GTTTGACAAT GCAACTCTGT TGTCCGCGTC TGGGCATTTA CCATCGGAGA CTTCTCGTAG TACTTTAGAC GGTGAATTCG CTGTCAATAC TGCTACGGAG ATGGACTGTG ATATCTCACT TTCAATCATG AATCGTCCAT CACGGGATGT TGTTCTTCAA CGATTATCGG AAGCCCTGCT CCGGAGATCT CTGACAAAGG TATGTGGTAG CGACATATAT GATCCTTGGT GCCAAGACTG GTGCTATCTA CAAAGTTCGC TCTGACACAC TTTCTTTCTC TTGCTGCTTT CATTAAACTT TCGTCACTCG CCGTAATTTC AAAGATTGAC TTGTCCCAAC GAGGAATTCG AACCTCAGAT GCTCGACTCA TCAACATGGC ACTGGCCCAG AACGCATCCC TGACAACCTT GAAACTGGGA TACAATGATC TCGGGGACGA CGGCTTGCGA ACATTGGCAA ACGGTATCGC TCGCCACGGG GCTTTAGAAA GCCTGGATCT AGGGTTCAAC AACATCGGCG ATAATGGTTG CCGCGCTTTG GCCGAAGCAA TTACTGCGCA ACCCATGTCA CTTTCCAGAT TGCGGACTCT GTATCTTGCA GGAAACGCAT TGGGCGAAGA TGGAGCATTG GCCATTGCCA AAATCGTCCA GCACGGATCT CTGGAAAAGC TGTATCTCAC TGGAAATCGC CTAGGACCAG ATGGAGTTAG AGCCATTGCT GAGGCAGCGT TGGAGCTACA GCTGGAGAAA ATTCACAAAG TCAACATCAA TTGCGTAAAC AGTCTTCAAG CAAGCAGACG TGGGATCAAG GAACTGTTTC TCGGAGGCAC TGGATTGGGC GGTGTTGGTT GTCAGGCTAT CGCGGACTTG TTAGGGCAAT CGTCAACTCT GCAAGTTTTG TCTCTGGCGA ACTGCGACCT AGACAATGAT TCTTTGTCAG TCTTGGCATC CAGTATCAAG TCAAATAGGG AGCAATTACC CCTGGAGTCT CTTCAATTGT CATTTAACCA GATATCTTGC AAAGGAGTCG AGAGTCTTTC AAATGCTATA TGGGGATGCC GTTCACTTCG AGAGCTGCTT CTCGACAACA ATCAGATCGG AGACCGCGGG GCGGGGCAAA TCGCTGCCGT CTTAGCCTCT GCGAATCGCT TGGAGACCCT AAACGTTGGC TTCAATCGAA TCAAAGCAGT AGGCATTAAG GCGATCATGA AGACTGTTCC CGAAAGCGAG AGTTTACATT CCCTTTCTCT GTCGGGGAAC ACCGTTGACG CCAGTGCCGC GAGAAGTATT GCCTATGCTC TGGCATTCAA TCATTCTCTT CTTTCGCTCT CGCTAGTGAA CACCTCAATT CAACATGAAG GACAGCGACA TATTACTGCA GGAATTGTTT CCAATAGTCA CATTAAACTC TTGCAACTGA ACGGCTTCCG AATTGGCCCG ATCGTTGTCA CGCTCGGCTT TCCAGCTGCC TTGGAACATT GGAGCAATGA TCAGATTCTC AACTTTATTC ATTTAATGTG GGACAAATCC GCTGAGTTGG TGGCACAACA GGAGCACGAG GCAAAACCAG TCTTTGACAC ATCACGCTTT TTCTCGAAGG CGAATCCTCG AGATCGAGCG GCCCCGTTGG ACGCCGCGGT CGTGGTTGAC GTGGCAAAGA AGGCCTATGT GGAACTTGTT ACGGAGGGGG TTGATATTTT TTCAAAGCGA CCTGGCAATA TGCACGAGCT GTCGCCGCTA CCAGGTGATA ACTTCATAGT AGAGTCGACG AGGAAGGTCG GAGAGAACAG CTATGCCGAA TCGTCGCTTG AAAGCCATGT TCAAGCTCGT TCTTTCGTGA CATCCCCAGA ATTAGCCGGC TCTGAAACCT ATGTTCCAGA TCCTCAGCGA AAGAAACGCG TCATTGAGTG GCTTTGTTCT AATATTCAGA ACCTGAACAA AATGGCCCAG CAGCCCTTTA ACTCGAAAGA GTTATGGGCG CTTCACCAGC GATACTTTAC GCCAGTCGTC AACGAGTGTG GCGGAAGCGT CAACCCTACT TCAGAAACGT CCAACAATCA AAACGGGAAT CCAAAATTAC ACGCTTCCAG GGTTTCTCGA TCTAACTCGA CTGAGAACCC AGCTGATATG ATGAATGATT CAACGGATGA CACACTGATG ACCCAATCGA GTGACCCTTT CATCTTGGAT TCGCCTCAAG GCATCGTTTC CTTGCCCGTT CTGAAAAGAA AAGTGTCTTA CCGATTTCTT GGTGATGCAA TGGTAAATTC AGCCCCTCGA ATGTCAAATT GTGTGGAGAT GCGCGGGCCT GAGACAGAAC AGCCAATTTC GAACGGAATG GTGTCCATGA TGATTGAAGG AGGCCCGGTT GGCCACTCGA TGCCTCGCAA AACCAAACGT GCACGGAGGA ATCGCACTCG CATTTCATTT CTACCTCGTG TGAAGGTAAA GCTGGATTCG TATTTGGACG TCTGTCACGA GAAAGCGTTG ACGATGATGA GGCAACTGTA CTTTGTTGAA CGAGCAATCT TGCTGGGTCA GTTGAATTCA GATGTGAACT CGATGCCGTA CAGTGCCCGC ATGCACTTGC ACGGCGTCCT TGCTATGGAC GCTGAAATGA TTTTAGTTGA CATGATATAG
|
Protein sequence | MPPKPQEHQQ GEPEQDADEH HSRTSSKSSL SGREPLLFDN ATLLSASGHL PSETSRSTLD GEFAVNTATE MDCDISLSIM NRPSRDVVLQ RLSEALLRRS LTKFALTHFL SLAAFIKLSS LAVISKIDLS QRGIRTSDAR LINMALAQNA SLTTLKLGYN DLGDDGLRTL ANGIARHGAL ESLDLGFNNI GDNGCRALAE AITAQPMSLS RLRTLYLAGN ALGEDGALAI AKIVQHGSLE KLYLTGNRLG PDGVRAIAEA ALELQLEKIH KVNINCVNSL QASRRGIKEL FLGGTGLGGV GCQAIADLLG QSSTLQVLSL ANCDLDNDSL SVLASSIKSN REQLPLESLQ LSFNQISCKG VESLSNAIWG CRSLRELLLD NNQIGDRGAG QIAAVLASAN RLETLNVGFN RIKAVGIKAI MKTVPESESL HSLSLSGNTV DASAARSIAY ALAFNHSLLS LSLVNTSIQH EGQRHITAGI VSNSHIKLLQ LNGFRIGPIV VTLGFPAALE HWSNDQILNF IHLMWDKSAE LVAQQEHEAK PVFDTSRFFS KANPRDRAAP LDAAVVVDVA KKAYVELVTE GVDIFSKRPG NMHELSPLPG DNFIVESTRK VGENSYAESS LESHVQARSF VTSPELAGSE TYVPDPQRKK RVIEWLCSNI QNLNKMAQQP FNSKELWALH QRYFTPVVNE CGGSVNPTSE TSNNQNGNPK LHASRVSRSN STENPADMMN DSTDDTLMTQ SSDPFILDSP QGIVSLPVLK RKVSYRFLGD AMVNSAPRMS NCVEMRGPET EQPISNGMVS MMIEGGPVGH SMPRKTKRAR RNRTRISFLP RVKVKLDSYL DVCHEKALTM MRQLYFVERA ILLGQLNSDV NSMPYSARMH LHGVLAMDAE MILVDMI
|
| |