Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43728 |
Symbol | |
ID | 7197018 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 1301603 |
End bp | 1302773 |
Gene Length | 1171 bp |
Protein Length | 329 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | short-chain dehydrogenase/reductase |
Protein accession | XP_002178115 |
Protein GI | 219112727 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAACGA GCGATGCTGG AATAGGAACA TGCGAAGCTT CGCACGGATC GATGCCGCCG TTGACAATCT TAGCAATGAT CTTCGCCTTT CCCACCGTCT TTTGGTTTGT GGCGTATCTC ATTCCTCAAT TGTATATGGT AGTGCGCCCT GTTCCTGATT TAAAGAAAAG ATACAATGCT GAATGGGCTC TGGTGACGGG AGGTGGAAGT GGAATTGGCA AAGCGCTCGC ATTCAAGCTC GCAAGTCAGG GCTTGAACAT TGTTATTGTC TCTCTCGACG ACAATTTCCT TAAGTCGACA ATGAGAGAAC TCGCCGAAAT GTACCCGAAG CAGTCGTTTC GCGCAGCCGG TGTAACTTTC TCGCCTGGCG TCGACTACAT GGCCAAGATC AACGCAGCAA CCAAGGACAT TGACGTTCCA ATCGTTTTTA ACAATGCTGG GTTCATGGTT ACCGGATTTC TCGATCAAAC TCCGATTGGA AAGCTCCTGG CAAACATTGA ATGCAATGCT ACAGCTTCAG TGAATATTTC GCATTTCTTT GTGCAGAAGC TTGTCGCAAA AAAAGCAAAA GGTTGCATTG TTTTTACAAG TAGTGTTGCT GGTTTTATAC CGACTCCATT TGCAGCAATG TATGCAAGTA CCAAAGCCTT TGTCTCGCAG TTTGCTGCGT GCTTACATAT CGAAGTGCAG TCTCTTGGTA TTGATGTATG TGCAATCCAT CCGTCTCCTG TCAAGAGCCA GTTTTACACG AACTTGGACC ACAAAGTCGA TATGATTGAG AAGGCGGCAA CCAGTGCAGT GTCACCAGAC GAGAAATTAG CAGACACGAT GCTAAAGAGT ATTGGTGTAT GCGCGTTGCG TGACATGGGA GGTCGGTACC AGTGCTTTGC CAGTCATAGT ATTGGATGTT CAACAACTGA CATTCTTCTC ACCACTTTGC AGGATTGGCG TGGGGAACGC GGATGGGCAC CTTTTTCTTG CCGTACAACT TTTTTGCCGT AGCCTTTGCA AATGCTGCAC CGTTTATGCC CGACTGGAAG ACTCACAACA AGCATCGTTA AAAAGTATTA CTACATATCC TCTACTTGTA ATTGCCGGAC GGACTTGTCC AACAGCTGCC TCCAACTAGA TAATGTTTGT TTCTATTTGC AAGACAGAAC CACACGCCTT C
|
Protein sequence | MSTSDAGIGT CEASHGSMPP LTILAMIFAF PTVFWFVAYL IPQLYMVVRP VPDLKKRYNA EWALVTGGGS GIGKALAFKL ASQGLNIVIV SLDDNFLKST MRELAEMYPK QSFRAAGVTF SPGVDYMAKI NAATKDIDVP IVFNNAGFMV TGFLDQTPIG KLLANIECNA TASVNISHFF VQKLVAKKAK GCIVFTSSVA GFIPTPFAAM YASTKAFVSQ FAACLHIEVQ SLGIDVCAIH PSPVKSQFYT NLDHKVDMIE KAATSAVSPD EKLADTMLKS IGVCALRDMG GLAWGTRMGT FFLPYNFFAV AFANAAPFMP DWKTHNKHR
|
| |