Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47965 |
Symbol | |
ID | 7203206 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | - |
Start bp | 567638 |
End bp | 569208 |
Gene Length | 1571 bp |
Protein Length | 465 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182422 |
Protein GI | 219124252 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.91941 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTTGTGA ATGCCGTTAA AGATTGTCGT TTGAAAAAAA AAACGATTTT TTCTGTCAAT TGCTATGGAG ACCTGAAAAG TTGACGGGAG TACATTGAAC GAGGTTGTCG ATTCAAACAC AAAAGCGTGT TATGAGATCA ACGACCATGT CGCAAGGATT TCGCTTAGCT CTGGTGTGCT ACACTGCCGT CTTCACACCC TCTTTCCAGT TCCAAGGAAA GGGACAGCTC GCATTCCGGC GACGAGGCCC TCAATCTCTC CCTACCCCCA TCGGCAACAG CCTGCGTGCG GCAGGAAGCG ACGACAACGA TGCTCCCTCT TCAGACAAAA ACTTGGACAA GAACCAAGGT ATGTCGTGGA CCGAGAGCTT TAATGCTCGG AAAGAGGCAT TGCGAGAGGA AAAACTAGCT CAATTGAAGA AATGGTGCAA AGCGGATTGC AGTTCTGCCG TTTCGGTGAC TTTACCCGAT TGGATCCGTC GCCTAGACGT CGCCGAATGG CCTTTCGCCG CTTGCGGCAG CTCATCGGGA TCTGTCTATA TAGCCAATCT AGAAACAGGC AACTTGATTG CGAGTAACGT TGTGCAAAAG GAAAACGATT CGGCGCACCA GAAGGGTGAT GCTGTTACTC CAATCGGTTT GGAAGAAACT CTACGACTCT TGTACGGAGA TCACGATGGG GGTGGTACTT TTGCTATGAC TTTCTCGGGA AACTTGATTT GCGAAGCAGG GCGGAGTGGT GGCGTAAATC TTTGGCGTCT AGATTCTTCG TCCCAACATC TTGTTTCGCA AGGCAGCATG ATGGCTGTAC AAAACAAGTT GGTGACTTGC TTGGAGTTGG ACGACGATTA TTTATGGGTC GGTACCTCCG ATGGTCTGGT CCAAGCCTTC GCGTTGGACC ACGAACTCCC GTTGGCTCTC CAGTCCAGTC CAGAATTGAA ATGGGACTTT GGATCACCCG TTCTCTCGCT CTCTCTACTT CCTGATATTG GTTGTGGCGT GGTATCGACA GTCAGCGGTG TTCGGCTCTT TTCTATGGAA GATGACGAAG AGGCCACACC AATTATGCAG CCACCATTCG ATATTAATAG ACAAGAATCT TCCGTCACTT TTGCGCTCTG CTCCACCATC GTATGCAGTA CCGAAAACGA TGAACGGACA TTTTCTGTAG CCTGCGGAGG CAATGACGGC AGCTTATTTT TGCAACCGCT CAGTATGCAG AGCCACGATG AAGTCGACTG GAAGAAACCT TTTGTCCAGC CGGTGTGGCA ATTGAAACCC CGTCATTCTG GAGCAGTCCA ATGCATGACG AGCCCGGCAC CGGGGCTTCT CGTCACGGCT AGCCAAGACG GAACAATGCG AGTTTGGGAC ATTGCGCAGA GGAACTGCAT GTACCAGTTT ATTGGCTACA AGGTATGGCT AGGTAGTGTA TGGAGTGACG GCGTGCGTCT CATCAGTGAC GGTAGTGACA ATACTGTCAT TGCGCACAAC TTTGACCCCG CAGTGTCCGA AAAGACAGAG CTGGAATAAT CCAACGGTTG TAACATAACC ATAGATACTG TTATATATAC G
|
Protein sequence | MRSTTMSQGF RLALVCYTAV FTPSFQFQGK GQLAFRRRGP QSLPTPIGNS LRAAGSDDND APSSDKNLDK NQGMSWTESF NARKEALREE KLAQLKKWCK ADCSSAVSVT LPDWIRRLDV AEWPFAACGS SSGSVYIANL ETGNLIASNV VQKENDSAHQ KGDAVTPIGL EETLRLLYGD HDGGGTFAMT FSGNLICEAG RSGGVNLWRL DSSSQHLVSQ GSMMAVQNKL VTCLELDDDY LWVGTSDGLV QAFALDHELP LALQSSPELK WDFGSPVLSL SLLPDIGCGV VSTVSGVRLF SMEDDEEATP IMQPPFDINR QESSVTFALC STIVCSTEND ERTFSVACGG NDGSLFLQPL SMQSHDEVDW KKPFVQPVWQ LKPRHSGAVQ CMTSPAPGLL VTASQDGTMR VWDIAQRNCM YQFIGYKVWL GSVWSDGVRL ISDGSDNTVI AHNFDPAVSE KTELE
|
| |