Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_46438 |
Symbol | |
ID | 7201793 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | + |
Start bp | 340064 |
End bp | 342003 |
Gene Length | 1940 bp |
Protein Length | 565 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180806 |
Protein GI | 219120121 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAATGCGAAT CTTACTCTCG GGGACGACGA CGGTTGTCTG CTGAATGAGC AGTACCTGCT GGTTTGAAAC GCTTGGCGTG CTGATCATCT GCTGCTTTGA GCAGACAGAC AGTTGAAAAG ATGAACATCC TTTTTTCAAA GCCTGATTGG TTCCGTGTGC TGTTGCTGTC GGCACTAGCG CTGAGGTGTC GACCCGTGGA AGCCGTTCCC GGCCAAACCT TGGTGCACCT GGTGCAAGAT ACGCTCTACA ACTTACGGAA CGAAGACAAT TCCAACCTGG TGTATTCGTC TATCGTTGCA TTCTTGTTGA TGGTGCTACT GCAGAGATTC TATCGGCACC GGGCTATGCT TTGTTACGGC ATGCCGGTTA TGACGCACCC TCCGCAGGGA CGACCATGGC CATTCCTGGG ACACGCTCTA AATTTCCTGA GTTACAGACC ATGGGATTTG CTGATGAGCT GGCACAATCT CTACGGCCCA ATCGTCTGCT TCGACCTTTT GGGATCAACC ATGTTCAGTC TAGCATCGCC CAGTCTCCTC AAGCTAGTGT TGCAGTCCAA AATCCAGTCG GTCAAGAAGG ATATAAGCAA CACCATGAAG CACTTTATAG TTATTCTTGG CACCGGCATT GTTACTTCCG AAAATCAGTC CTGGATTAAA CAACGACTCA AGATGAGCCA TCCTCTCCGC GTCGAAGTTC TCGAAATGAT CCCCCGTCAA ACATTGCTTG CGGTCCAACG CTGGATGACA AAATTGGATG CAGCTTGCGA GACACAGGAA TCGGTTGAAG TGGGATCGTC CTTGAGACAT TTGACGCTAC AGGTTATTTC TGGAACTTTC TTGTCCCTGT CGGCAGAAGA ATCCGACTCG ACGTTTGCAA AGATGTACCT ACCCATCGTT GACGAATCTA ATCTGCGGGT TTGGCACCCT TACAGAGCTT ACCTCTTTAT GCTACCGGTT TTTTGGAAGT ACCTGTGGAA TGTACATAAC TTGAACAGGT ACGTGTCGCA CTTGATTCGG GTTCGATGGC TGGTTCGTCA ACAGGAACGT ATCAACGGCG GATCGGTCCG AACCCAGGAT ATTTTGGACG GCGTCTTGAA GGCACACGAG AAAGAATTTC CGCACCAGAT GAACCTACCA GAAATGGCGG TGCGTCAATT TAGAGATGAA ATGAAAACTT TTATGTTGGC AGGTCACGAA ACTTCCGCAG CTATGATGAC ATGGACACTG TATGAGCTGC TGGCCAACAC GGCATTGATG CAACGGGTTT CTGAGGAAGG CGCGTCCTTG TTTGCGCGGA ATGTGGACTG GAGTAGGGCG GGAGCCGATG AATTACCTTC GAACGACCAG TTGAAGCATT TGATACTATC AGAAGCCTGT TTAAGGGTAA GTCTCACTGC ATTCGCTACG GACTTGGAGA ATAGAGTTTC GCTTTATGAT GTGCTCACAT GTCCTATTCT TAAAGGAATC TTTGAGAAAA TACTCTGTCG TTCCAATCGT TGCGCGACGA ACGGTTGAAG ACTTATACCT AGAGGACGGC AAGTACTTTA TTCCGAAGGG CAGCTCGTTT TTGATCAATA TTCAAGCTAT TCACCATGAT CCAAACCTGT GGCCCAATCC AATGAGATTC GATCCTGATC GATTTGTGGA TGGGGAAATT GTCCCATACA CTTTTCTTCC TTTTATCGCG GGCCCTCGGA ATTGTTTGGG GCAGCATTTG GCACTGCTTG AGAGCAAAAT GGTAATTTCG TTGCTCGCGC AACGCTATAT TTTCTCGCTC GGAGAAGGCG CTACGTTAGA AGTGGACGAC TGGGAAAACG ATAAAGACCC TAGGCATCGG TTTATGGTCC CTGTTGTTCC CAAAGAGGAG CTCAAAGTTA CCGTTCAAAG GAAGTAAAAT GATTACGCTA ATTCTTAGGC TAACGGACGA GAAATCTTTT
|
Protein sequence | MNILFSKPDW FRVLLLSALA LRCRPVEAVP GQTLVHLVQD TLYNLRNEDN SNLVYSSIVA FLLMVLLQRF YRHRAMLCYG MPVMTHPPQG RPWPFLGHAL NFLSYRPWDL LMSWHNLYGP IVCFDLLGST MFSLASPSLL KLVLQSKIQS VKKDISNTMK HFIVILGTGI VTSENQSWIK QRLKMSHPLR VEVLEMIPRQ TLLAVQRWMT KLDAACETQE SVEVGSSLRH LTLQVISGTF LSLSAEESDS TFAKMYLPIV DESNLRVWHP YRAYLFMLPV FWKYLWNVHN LNRYVSHLIR VRWLVRQQER INGGSVRTQD ILDGVLKAHE KEFPHQMNLP EMAVRQFRDE MKTFMLAGHE TSAAMMTWTL YELLANTALM QRVSEEGASL FARNVDWSRA GADELPSNDQ LKHLILSEAC LRESLRKYSV VPIVARRTVE DLYLEDGKYF IPKGSSFLIN IQAIHHDPNL WPNPMRFDPD RFVDGEIVPY TFLPFIAGPR NCLGQHLALL ESKMVISLLA QRYIFSLGEG ATLEVDDWEN DKDPRHRFMV PVVPKEELKV TVQRK
|
| |