Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44766 |
Symbol | |
ID | 7199883 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 178915 |
End bp | 180736 |
Gene Length | 1822 bp |
Protein Length | 452 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178720 |
Protein GI | 219115850 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TACCGACTCG CAACGGATCT TCCAAGACGT TCGATACCCA TTATTTTTTC TCTCTTTCAC TCGACTCACT CACCTTGGGA ACAAGAACTA CACACGCGAC ACCATGAGAT TCCTTGCGCC ACGATCGGAC GTTGGCCCGG GGACGCGGAT ATCGCGTCGA TTGCTGTTGC TGTCCGTCAC GCTCGGGCTG CAATCCGCCG TGCCGGTCAC CAACGCGTCC GTTCAATCCA TCGACTTGCG GGCCGAGCAT CTCTTGTTCT GGTCCGGACT CGACTGCGCC GTCGATGCGG CACTGCACAA CATGGGGCCC TTGTTGGATG ACCTCTTACT CCGCGGCGGT GATCAACCAA CGGAAACGAC GAGCGAAACA ACGTGTGTTT CGGAACGACC AATGGGTTCC TCCCCCGTAT ACAATAACCA TACCTTGGAT GACCCGCCGT CGTCGATGTC CACAACGATG GAGACTCTGG ACGACAACCG CGAGACCAGC GTGGCAACGG TCAATGCCCC GGAGAACACA ACCACGCTTT CGTTGAAAGA ACGCGTTCGT AACATTCCGG TGAGCGACAA ATCCCGCATG CAGTTTAGCC TCTTTCAACC AGGAGATGGT AGTAGAGAAG ATCCGGATGG AATTCCCACA CGGTATCTCA AAATGCAAAA GGGTGATCGG GAGTTGGCAG CCAAGGCCTT GGAGGCTACG CTGGACTGGC GGGACGAGCA CGCCATCGAC ACCATTCTCA AGATTCCTCA CCGTAAATTC GAAATTTGCA AACAAGTATT TCCGCATTAT TTTGTGGGAC GTGATAAAGA CGATCACGTT GTTTTTGTCC AGAGACCGGC CATGTTGGAT CTGGAAAAAG CCAAGGCCAA CGGACTCACC AACGAAGAAC TATTGCTCCA CTACGTCTAC GTCAACGAAT TTTTGTGGCA ATACCTTGAA GCCGATTCCC CTCTCGGGAC CATGACCAGT GTCATTGATT TGCAAGGTCT GCATTTAGGG GTTCTGCGAC AATCGGACAT TATCTCCTTT TTGAAAAAGT TTGTCATGAC CATGGATGCG CACTTTCCGC AGAGGTCCCA CAAGACGTTA ATTTTGAACG CCCCCAAATG GTTTCACATG CTCTACAAAC TCATTTCACC CCTCTTGCGC GATACCACGA AAGCCAAGAT TGAAATACAC TCGCGCAGCA AAAAGCAAGA TGCCGTCTTG AAGGATTACT TGGGCGAAGA CGCGGCCAAA AAACTACCTC CTTCGTTCTG GAGTAAGAAG CATACTAAAC GACAATCCAG GCACCGTCGT GGTCACAACG AAGAGCATAG TCTCGATGCG GATGACGATG GGAGTGGGAG TGCAGTTCCT AGTGATGATC CGACCGAAGT GTCGGAAATG GAAGAAGCCT TACGGTCTTT TGTAAGTATA ATGTTACTTT GAAAGAACAT TAGTGCGATC CGCTGAGCTA ACATGAACTG CAATGCGCTG TGTCGCCTCT CTTGATCTTT CAGACGCTTG CTCGCATTCA AGAAAATGGC CAGGAATTGG CGGCGATCGT ATAGAAACAA TTGGTCACGT CTTTTTCGAA CGCAGACGTG GGCTCCTCGG GGGACCATTT CGGGTTGGGG AGATCCATTG TGCCCGTCCT CCCTCTTCAT ATTGCTACTA ACGGCACACT TTATAGCGGC GCATTTTTGT GCAAATGCTT GAGTGTCGAG CATCCGCTTC CGCAAGGTAG AAAGTCAAAA TTTTATGGAC GACCGCTTCT ACGACCATAG CTACAGTCCC TGTGGGTGCT ATAATGCGTA GTTGTTGCAC AAATTTTATT GG
|
Protein sequence | MRFLAPRSDV GPGTRISRRL LLLSVTLGLQ SAVPVTNASV QSIDLRAEHL LFWSGLDCAV DAALHNMGPL LDDLLLRGGD QPTETTSETT CVSERPMGSS PVYNNHTLDD PPSSMSTTME TLDDNRETSV ATVNAPENTT TLSLKERVRN IPVSDKSRMQ FSLFQPGDGS REDPDGIPTR YLKMQKGDRE LAAKALEATL DWRDEHAIDT ILKIPHRKFE ICKQVFPHYF VGRDKDDHVV FVQRPAMLDL EKAKANGLTN EELLLHYVYV NEFLWQYLEA DSPLGTMTSV IDLQGLHLGV LRQSDIISFL KKFVMTMDAH FPQRSHKTLI LNAPKWFHML YKLISPLLRD TTKAKIEIHS RSKKQDAVLK DYLGEDAAKK LPPSFWSKKH TKRQSRHRRG HNEEHSLDAD DDGSGSAVPS DDPTEVSEME EALRSFTLAR IQENGQELAA IV
|
| |