Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_21996 |
Symbol | |
ID | 7203106 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 375336 |
End bp | 376677 |
Gene Length | 1342 bp |
Protein Length | 417 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182210 |
Protein GI | 219123810 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CACGAACAAA GCAGTATCCC ATATTATTCG TACCGAAATG CAGGCTGTGG CTCCTCTGAC TGCTGTACAG ATTCCGTGTT GCCTTTGTGG CGTGCTTACT GCGCCGAACG CAGCGAATCA GTGCGCTTCG TGTCTGGCAC AGGAATTTGA TCTGAAAGGC CGCTTGCAGC GGGGTCCATC GGGGGCACCA TTTGCGACAA CTTACCAGTG CCGGGAATGC CGTCGGTTCC GACGGACTGA AAAGCACCAC GAACACGCCG GACCAGAATC CCCGGAGCTA CTAGCGATTT GCTTAAAAGC AATTCCGGCG CTCCAATCGA CGGCTGAACC CCGCTTACAT TTGATCGATG CTGGTTGGGT ATGGACGGAA CCACATTCCA TGCGGTGGAA GGTACGATTG ACGGTGAGGA CCGAAATTCA GGCCGTGACG GTCCAACAAC GTGTAGTCGT TGAACTCCAC AATGCCTTTC GGCAGTGCAA TGATTGCAAT CGTGAGTTCA CCAATCGTAC CTGGCAAGCG CTCGTCCAGC TGCGTCAAAA ACGCTCAGAC GATGCGCCCA AGAAGGGACT GACGGCCTTG GAAATGGCGC TGGCGAAGAA TAAGGAAATT CGAAAGCATG TACTCAAGAT CGATGCCGTA CGGAATGGTT TTGATTTCTA CTTTCTGTCG CTGTCCTACG CCCAAGCGTT CAGCGCTTAC CTACAACGAG TTGGTCCGAT GCGCGTCAAG ACCAGTAAGA AGCTCGTTTC ACAAGATTTT ACGAACAACA CAGCGAATAT GAAGTACACG GTAGTTTGTG ATTTGGTACC ATTTTGCAAA GACGACTTGG TTTTGATTAA GAAGGGCGCT AAAGGAAAAT TGTCGGGACG CTTAGCACTC GTGACGAAGG TGTCGAGTGT CGTACATTTG ATGGATTCAT CTCCGAAACG GGAAGCGTTG CTCGATAGTC AAATGGAGCT GTCGCCAGAC GCGTATTACA AACAAGAGAA GCTGTACACA ATTCTACAAG CGTCGAACCG AACTATCCCA TTTGTGGTTT TAGATGTTGA CTTGTGTCAG CACGATGGGG GCGCAATGGA TGAAAGTGGC CAGCCACTAT ATGCTGGAGT AGAAAATAGC GTGGAGAAGT ACTGTTTGGC CGATGTTCAG GTTGCTCGTC AGTCGGATTT TGGGGTCAAT GATGAAGTTT TCAATTGTGT CACGCATCTA GGCCATTTGA TCAGACCAGG TGATGTCGTC ATGGGATACG ATTTAGTTGC AACGGTGGGT GGTGACTGGG AGGTCGAAGA GTCCCTCCAC AACAGTTTTG TACTGCCGGA TGTTGTTTTA GTCAAAAAGA TC
|
Protein sequence | MQAVAPLTAV QIPCCLCGVL TAPNAANQCA SCLAQEFDLK GRLQRGPSGA PFATTYQCRE CRRFRRTEKH HEHAGPESPE LLAICLKAIP ALQSTAEPRL HLIDAGWVWT EPHSMRWKVR LTVRTEIQAV TVQQRVVVEL HNAFRQCNDC NREFTNRTWQ ALVQLRQKRS DDAPKKGLTA LEMALAKNKE IRKHVLKIDA VRNGFDFYFL SLSYAQAFSA YLQRVGPMRV KTSKKLVSQD FTNNTANMKY TVVCDLVPFC KDDLVLIKKG AKGKLSGRLA LVTKVSSVVH LMDSSPKREA LLDSQMELSP DAYYKQEKLY TILQASNRTI PFVVLDVDLL ENSVEKYCLA DVQVARQSDF GVNDEVFNCV THLGHLIRPG DVVMGYDLVA TVGGDWEVEE SLHNSFVLPD VVLVKKI
|
| |