Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37067 |
Symbol | |
ID | 7202223 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 96761 |
End bp | 97925 |
Gene Length | 1165 bp |
Protein Length | 363 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181298 |
Protein GI | 219121906 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACTGC AGACGGCAAT TGTGGGTCTT CCCAATGTCG GAAAATCTAC TTTATTCAAC GCACTGACAG AAACTCAAGG TGCTGAAGCT GCCAATTATC CGTTCTGCAC CATAGAACCC AATGTGAGTT TTGTTAGTCA AATTGAAACC TTGCCTGGCC CATTATGAAC CATCTGACTT ATGTTTTGTG ACACAGGTGG GAATCGTAAG TGTACCTGAT CCCAAACTGG AAATCTTGAA AGACATCAAC AAATCGGTAA AAGTTGTCCC AGCCGCATTG GAATTCGTTG ACGTCGCTGG ACTAATCAAG GGTGCCTCTA CAGGTGAAGG CCTTGGCAAC CAGTTTCTGG CATCTATTAG ACAGTGTGAC GCAGTTGTAC ACGTCGTTCG GTGCTTTGAA GACGAAGATG TTGTCCACGT CGATGGAAGC ATTGATCCCG TTCGTGACGC CGAACTCATC AATCTTGAGT TGGCTCTAGC CGACTTGTCT CAGGTCGAAA AGCGCTTAGA GCGCGTTCGA AAAGATCGAA ATGCCAGCCC TGTTGAGAAG TCCGCATTAG AGAAGGTTGC CGCTGTCTTG CTGAAAGATC AACCTGCTCG CAATGTTGTG TTGGACGAAG AGGAAGAAAA CGCTATTAAG TCTCTCGGTC TATTGACAGC GAAAAAGATG GTGTACGCTG CCAATGTTGG CGATGCGGAT TTGGCAAACG GGAATGAGAT GGTTGACAGA CTTCGTGAAA CTGCTGAGAA AGAGGGTGCA AAACTAGTTG TCGTATCGGC TCAAGTCGAA GCGGAGCTTG TTGAACTAAG CGTCGGAGAC CGTACTGATT TTCTTGAATC GCTGGGTGTC AAGCTAGAAG ACACTGGGCT TCGAAAACTT GTTCGTGAAG CTTACGATAT CCTGGGATTA CAAACGTACT ACACCAGCGG CCCGACTGAG ACAAGAGCTT GGACTATCCG CAAGGGATGG ACAGCACCGA AAGCTGCCGG CGTGATTCAT AACGATTTCG AGCGAGGTTT CATCCGTGCA GAGACCGTTT CCTATGACGA TTTAGTTGCT TGTGGCAGCG AGATTGAAGC GAAGAACAAA GGGAAGCTAC GCAGCGAGGG GAAAGACTAC ATTGTCCAAG AAGGAGATGT GATCTTGTTT CGCTTCAACG TGTAA
|
Protein sequence | MKLQTAIVGL PNVGKSTLFN ALTETQGAEA ANYPFCTIEP NVGIVSVPDP KLEILKDINK SVKVVPAALE FVDVAGLIKG ASTGEGLGNQ FLASIRQCDA VVHVVRCFED EDVVHVDGSI DPVRDAELIN LELALADLSQ VEKRLERVRK DRNASPVEKS ALEKVAAVLL KDQPARNVVL DEEEENAIKS LGLLTAKKMV YAANVGDADL ANGNEMVDRL RETAEKEGAK LVVVSAQVEA ELVELSVGDR TDFLESLGVK LEDTGLRKLV REAYDILGLQ TYYTSGPTET RAWTIRKGWT APKAAGVIHN DFERGFIRAE TVSYDDLVAC GSEIEAKNKG KLRSEGKDYI VQEGDVILFR FNV
|
| |