Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43491 |
Symbol | |
ID | 7197543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | + |
Start bp | 603675 |
End bp | 605681 |
Gene Length | 2007 bp |
Protein Length | 563 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177650 |
Protein GI | 219111797 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00664754 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATAGATAGGC TTGGAGCCAC TGGCCCAGCT TAGCTTTACA TTAACTATCT CGCAATCACC GTCCAAAATA GTTCCGAACT TTGAGTAATT TGACGACTCT CACCATCGAT TCTTTCATCC CGGGGCTCTC GGACTTTACG TTCGTAAAAT ACATCTCCTA TCAGGAGGCT ACTGCTTCAA CACCTCCGTT CTACTCAAAT TTACAGTTAA TCGCAAATCT CTCCTTGGAA GTACTTCCTT CAATTTTTCT GAAGTGAAGC TTGCGACCTC AGCCATGTCG ACAGCGTCTG ACGATGCGGC GGAAAAGGCA GCAGGATCCA ATGTGGAAGA CGGGTCCCTC TCTCTTGCTT CTCACCATTT TGGCAACGAA ACGGAAATGC CCAAGATTGA GGAAGGCCAA CTCCGCCGCG CTCAGTCCAT TTCTGAAGCA GTCGAAAACA ATGATGACAA GCTCTGTAAC GCCCCTTGGT TTCTTAAGTT TATCGAGTAC CCTTTCCATG TAAAGGATCC AACTCGGGAC GTCTACTTAC CTGAAGCGGC CGGCTGGGCG ATGGATTCTG CGGGTCGGGG CCCGCTCAAT CAAGTCGGAT CTTACGTTGG ACAGGCGATT CTTCGTTTGG CTACTCAAGA TGCCGGCTGC TTGAACCCGC GAACTTGTAC GAATACTGTA TATGGTTTGA AGCCAAGTTC TCTCCTTACC GCAACTACTT CTATCGTTGG AGTAGTGGCT GCTTTTCTGA TGCCCATCGT CGGGGCAGTG GTCGATCACA CCACGCATCG ACGCTTACTC GGGCTCGTTT CTGGAATGGC GGCCGTCGTT CTCGCTGGCA TTCAGATAAG TGTGAACGCG AACAACTGGT TCTTCATCCT CTGTGTAGAC GGAGCGTTGT CTTTCTCGCT ACTTGTACAC ACAACGGCAG TATTTGCCTA TTTGCCCGAC TTGAGTTTGG ACGAAAATGT TCTCTCCCAC TACACTTCAC ATTTCAACAT TCGGCAATAC TCGGTTCAGG TTGTCTATCT AGGCTTAGTC ATTATTACAG GAGAAGTCCG CAATTTACCC TCTCAAGCAA TTGCCACTTC GGTGCAAACT GCGAAAGACG CAGCGGGCAT CAGCTTTGGC GTTGCCGCTC TTTTTATTGG ATACGCTTGG ATCTTTCTAT TCCGCCCTCG GCCAGCCTTG TCCAAAGTTC CCGAAGGACA AACATTGCTG ACCACCGGTT TCGTACAAGT TCATCGAACT GGCAAGAAAA TTTGGAAGGA TTATTGGGCG TTGAAGTGGT TCATGTTCAG TTTACTGTGG TCTCCCGAAG CTGGTGCAGG TGTCATCCAA TCGATTGCCG TCACATTCTT GACTGTGGTG ATGAAGTTTA CCGGTCTGGA TTTGGCCAAG GCCATGCTGG TTCTGATGGT TGGAAACATT TGCGGCTCTT TATTTTCAAA ATGGGTGTGT CAAAAGATTA ATCCGCTCAA TTCGTATCGT TGCGGTCTCA TGTCATTGGC TGTTTCGATT TTCGTGTCGG CGTGGACATT AAATGGACCA GAAAGGCGCG CAGCCGTTTT TGGTTTTACG TTTTTTTGGG GGGTCTCTAT GGGATGGGTT AACCCGTCGC AACGTGTGCT GTTATGCACA CTTATACCGA AAGGTCAAGA GACCGAAATG ATGGGGCTAT TCGTTTTCAC TGGCCAGATT CTAGGCTGGC TACCACCTCT CATTTTCACT CTAATGAACG AGAATGGCGC CGACATACGC TGGGGATTCG GGCTGGTCAC TTTCTTTTGT GGTTTTGCGG CAATCTGCAC ACTCCCAATG GGCAATTATT ACGAGGCTGT TGCATGGGCC GCTCGCCAGT CGGAAGAGAA GCTGGGAGAA GTTCTCGTAA ATGCTCAATC TCGGAGTGAA AAGCTGCATG AAAGTGCACA ATCGATTGGG AGCCAAGATT GTGCAACAGA GAAAATTGAA AAATAGACTT GCTAGGCAAC TAATCTTAGC AGCAGTATAT TTTGCTC
|
Protein sequence | MSTASDDAAE KAAGSNVEDG SLSLASHHFG NETEMPKIEE GQLRRAQSIS EAVENNDDKL CNAPWFLKFI EYPFHVKDPT RDVYLPEAAG WAMDSAGRGP LNQVGSYVGQ AILRLATQDA GCLNPRTCTN TVYGLKPSSL LTATTSIVGV VAAFLMPIVG AVVDHTTHRR LLGLVSGMAA VVLAGIQISV NANNWFFILC VDGALSFSLL VHTTAVFAYL PDLSLDENVL SHYTSHFNIR QYSVQVVYLG LVIITGEVRN LPSQAIATSV QTAKDAAGIS FGVAALFIGY AWIFLFRPRP ALSKVPEGQT LLTTGFVQVH RTGKKIWKDY WALKWFMFSL LWSPEAGAGV IQSIAVTFLT VVMKFTGLDL AKAMLVLMVG NICGSLFSKW VCQKINPLNS YRCGLMSLAV SIFVSAWTLN GPERRAAVFG FTFFWGVSMG WVNPSQRVLL CTLIPKGQET EMMGLFVFTG QILGWLPPLI FTLMNENGAD IRWGFGLVTF FCGFAAICTL PMGNYYEAVA WAARQSEEKL GEVLVNAQSR SEKLHESAQS IGSQDCATEK IEK
|
| |