Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43233 |
Symbol | |
ID | 7196954 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 2400287 |
End bp | 2402333 |
Gene Length | 2047 bp |
Protein Length | 517 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177507 |
Protein GI | 219111511 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.479677 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGAGAGAGGG AAGCTTTGGA CAACGAGTTT TCTCCCGTGT CGTGCGTGAT ATATAGTTTG GTTCGTTGGG GTATTGTATA TCTGTGCTCT CGAGTGTCCT TGGTGTCACT TATTCACACA TACACATCGT ATTTTATCGT AGTCTACGGC GTTCACAATG AGGATTTGCA TGCGCAGTCA GAGTTACTCG TCGCTCGTGG TGGCTAGTAT GGTAGTGTGT ACGGCGGCGT TCCAGACGTC CGTCGTTTCG GTCCGCCGTC ATTCGCCGCT GTCTTCGACG ATTGACGGTG TGGATCGGAG CGAAGCGGTA GCCAAGTTGG AACGCGCCGC GGAGCTATCG GACTTTCTCG CGCAAGCCTA CGAAGACAAG GTCAAGGCCA TCGAGACGGT GGAGAAGCAG AATCAAGCCG AGATCGACGC CTTGAAAGCG CAGATCGAAT CGCTCCAGTC GCAGTCCAGC GGAGCAGCGT CCAAGCCCAG CGCTCCGCCC AAGATCACCG GAGATCTCGC CAAACTATCC AACGATGACT TGAAGAAGAA ACTTCAGGAG TACGAAGAGT TCATGGCCAA AACTATGGCC GATGCCGATC CCTCCATCAA GGCCGCGATA CAGAAGAAGC AAGCCGAAGC CGCACAACAG CAGCAACAAC AACAACAACA GTCGTCCACC ACTACCACAG CGACTGCTGG AGCGACTACC AGTGGCACTG GTGGTGTCTT GGGTACCGAG GATCTGCTTT CGGTGGGTGC CGTTTCGGCA CTCACCGCCG TCGCGACTTC GGTCATTCTC GACAACACGC GTCGCCAGAG TTTTGGTAGT ATCGTGGCCG GTGTCAGTGC GCCTCTCGTC GCCGGAAATT CGGCACCCGA ACCCACAGAA ACTCCAGCTC CAGCTCCCGC AACCGTCGCT CCCCCGGCAC CCGTGGTGGT CGATTTGTCC CAGTTTGAGG TACGTACTGT GGCGCAGTTG CGTGACTTTC CCTGTGTTAC TTTTACTGTT GTTGTTGGTC CTACCCGACA AAAGTAGCGT ACGGTAGTAA CGGTAAAACG GAATGGACGT ATCTGGACTC ACCCCGCCGG TTGCTCTGTC TCTTGTTACG CCACAGACCA CCATGGCCAA AGTACAACAA GCCTTTCCCG GAGCCGCCAC CAACGACCAG CTCGTGGCCA AGACCAAAAG TGCACTCTCT CGATTTGGAT TCGGTTCCAA TTCACTAGTC GCTACTTCCT TTTGCAGCGA CGAAGTCAAC CGTCCCCTCG AGACAGACTT TGCCAAGGAA TTCAAGGACA CCTTCAGTCT CGGAGGCCTC GCCGGGTTCC CCTTTTCGGG AGTCACCGGG TTCGGTGCCA TGGCCAAACA CATACCCGAC GGTGGCTCCT GCCTCGTCGT CTACGGACCC CACGTCGGAG TCGATCTAGA CGGCAACGTC GGAACCGTCA ACCGACGAGG CCGCGAAAAG GGAGGAACTT GCTGCGGCAG CGCCGTCGCC GCCGCCGGGT ACATCAGCAA AGTTTTCAAT GGCGAAGCCG ATCCGGCTCC CGCCGTCCCG GAAAGTTCCA TGGACGCCCA ACAACTCTAC GTCGGAAACA TGCTCCTGCC CTACGCCGAA CGCATCGGTA ACGCCCAGGA CGCCATGGTC GAACTACCCT ACGCTACCTA CGAACCCCTC GACGATCTTA TGCAGAAGAT TGTGGCCAAG GGATGCGGCA AGGTCGGCGG CGACGGTAAA ATTGCCCTGC TCGGAGGCTT GCAAATTAAT ACCCCTGCCG GGTGCCCCGA CTACTTTTTG CCCTTGCGCT TCGAAGTCCG GGACAATCAA AACAACGTTC TGGACAATCT ACTGTACGAA AAGCGAGCGT TGTTCTAACT GTGAGCAAAC GAGCCCAGTG TGTTTATCGT TTACGATTGG ACCGCCCTTG GTCCATAGTC CGTAGTTCCC GGCGCGTGGG ATGGGCGATT GACGACTCTG GATGGTCCCC TCGCTCAATC CCACGGCCGG AGTATAACAG TTCCCTCTAT TTCACTCGTA GTGTGCTTTG CTAGAGT
|
Protein sequence | MRICMRSQSY SSLVVASMVV CTAAFQTSVV SVRRHSPLSS TIDGVDRSEA VAKLERAAEL SDFLAQAYED KVKAIETVEK QNQAEIDALK AQIESLQSQS SGAASKPSAP PKITGDLAKL SNDDLKKKLQ EYEEFMAKTM ADADPSIKAA IQKKQAEAAQ QQQQQQQQSS TTTTATAGAT TSGTGGVLGT EDLLSVGAVS ALTAVATSVI LDNTRRQSFG SIVAGVSAPL VAGNSAPEPT ETPAPAPATV APPAPVVVDL SQFETTMAKV QQAFPGAATN DQLVAKTKSA LSRFGFGSNS LVATSFCSDE VNRPLETDFA KEFKDTFSLG GLAGFPFSGV TGFGAMAKHI PDGGSCLVVY GPHVGVDLDG NVGTVNRRGR EKGGTCCGSA VAAAGYISKV FNGEADPAPA VPESSMDAQQ LYVGNMLLPY AERIGNAQDA MVELPYATYE PLDDLMQKIV AKGCGKVGGD GKIALLGGLQ INTPAGCPDY FLPLRFEVRD NQNNVLDNLL YEKRALF
|
| |