Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39117 |
Symbol | |
ID | 7194877 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 370582 |
End bp | 372582 |
Gene Length | 2001 bp |
Protein Length | 587 aa |
Translation table | |
GC content | 45% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183083 |
Protein GI | 219125639 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.294419 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGATTTC CTTGTTTGCT CCTTGTTGTT TCGGCTGTAT TGTCTCTAAG CAAGGCGGAA GAACAACAGA AAAGTAACGG TACAAGGCGG ACCCGTCAGT TAAAACTCAG CGAAGCAGAA TCATTTCTTC ATGCTGAACA GTTACTGTGG TCGGGATCGG GACGACATAC CCAAGAGGAT GCTTTTCTCA GCTTTGAAAC CTCTTCCCCG ACTCCAATCA CTTCAGTCTC GTTTACTGAT TTTCCATCCA TAGCTCCGGT CGCTAATTCT GCGCCGACTG TTTCATCATC TAAATATCCC ACTCAGGTTC CGATCCGATC GACTGATCCA TATCCCGGAA CGTCTGCCAC AGAGCAACCG ACAAACTTTA TCTGCGACGG TTTTAATCGT TCAGATATTC TTCTCGGACT TTTGCAATCA GTAACTTCAG AGGATCTTTT GCTAAACAGA TCCCTGCCTC AAGGCCAAGC CTTTTTGTGG CTACTCGACA CTGATATTAC AACAGACCCT TGCATATACC CATCTGTTAA GCAACGATAC TCTGTTGCAG TGATCTATTT TGCATTAGGA GGCAACAATG GGGCGAGTTG GATTGAAAGT ACGGGCTGGT TGTCGTCCAT GGAGGAATGC GACTGGGCAC GCGTGAACTG TGACGAAAAA GGCGGAGTCA CTGGCCTCCA ACTAGGTAGG TACTCAAACT AGAAGGAATG CCGTCTTAGC AGATTCAAAC TGATGCTTGG AAACTGTACT GAACAGGTAG AAACAACCTC ACGGGGATGA TTCCGAAAGA AATTTCAGAA TTGACTGCGT TGCAGGCGTT AGTTATGAAC GACAACACCA TAGAGGGTCC GCTTCCAGAG ACCATTGGCA GTTTACGTAA TCTGACAGAT CTAGACCTGG AAGACAATTT TCTAGAAGGT AACCCTTTGC TCACGCTATC TTCACTCAAG AAGCTACGCA GTCTTCGATT ATCATTCAAC TCTTTCGATG GTTCGGTACC TTCATCAATT GGTGGCTGGA ATGAGCTGCA GGAGATGTGG ATGGCTGGCA ACTTTTTCAG AGGAGAACTT CCTACGGAAA TCGGCTTGAT TGGGAATCAA TTATGTAGGT CGTTGCTTCG TCCGGTTTTA AAAATTATCC TTCCCTTGGA AACCTCACAT TTTGTCTTGA TGTCGATTTA GCATCCCTCT TTATTTACGA AAACGAATTG GAGGGTACGC TCCCGTCCGA GCTCGGCAAC TTGGGACTTT CTGAATTCCT GGGTCAGGCA AATATGTTCG AAGGCAGTAT TCCCCAGGAG CTGTTTAGAA ACTTTGATCT TGTGGTACTA CGACTTGACC AAAACAAGCT TACGGGAACT GTGAGCAACG CGATTGGTGG TTTGAACAAC CTCCAAGATC TACGTTTAAA CCTTAACTCA TTGTCTGGAA ATCTTCCAAT ATTGCTCTAT GGGCTGAGCA ATATACGTAA GTTCTTCCAG GCCGCCGTCA AAGGCGGCCC GGATTATTCC AAATCTTCTC TCCTTTTGTA ATCCTCACTC TCCTACCTAC CCCAGAAAAT CTGCTTTTGA GCAACAATCG CTTCGATGGA CAGATTCGCA ATGCGTTCGG AAACTGGAAT GCTTTGGATT TCGCAGATTT TGCACAGAAT AGGTTCACAG GGTTTATCCC ACCAAGCTTA TTCGAAGCCG AATCGCTTCG AATTCTATAC CTTAACAACA ACCTGCTGCA AGGACCAATC CCCTTAAATT TCGGCAAACC TCGAAAGCTC CGTGATCTTT ATTTGAACAG CAATATTTTG ACCGGAGAGA TTCCTTCAAT CCCTACAGGA AGTTTGTTGA ACTTATCTGA GTTTCTTTTG CAAGACAACC AACTCCAAGG CATCACAATG CCTCCTTCTG TTTGTTCTTT GATCGAGCAA GATGGCGAAC TAGAGGATCT GTGGGCTGAC TGCCTTGATA CTGATGATGT TGATTGTCAA TGTTGTACCC AATGCTTTTA A
|
Protein sequence | MRFPCLLLVV SAVLSLSKAE EQQKSNGTRR TRQLKLSEAE SFLHAEQLLW SGSGRHTQED AFLSFETSSP TPITSVSFTD FPSIAPVANS APTVSSSKYP TQVPIRSTDP YPGTSATEQP TNFICDGFNR SDILLGLLQS VTSEDLLLNR SLPQGQAFLW LLDTDITTDP CIYPSVKQRY SVAVIYFALG GNNGASWIES TGWLSSMEEC DWARVNCDEK GGVTGLQLGR NNLTGMIPKE ISELTALQAL VMNDNTIEGP LPETIGSLRN LTDLDLEDNF LEGNPLLTLS SLKKLRSLRL SFNSFDGSVP SSIGGWNELQ EMWMAGNFFR GELPTEIGLI GNQLSSLFIY ENELEGTLPS ELGNLGLSEF LGQANMFEGS IPQELFRNFD LVVLRLDQNK LTGTVSNAIG GLNNLQDLRL NLNSLSGNLP ILLYGLSNIQ NLLLSNNRFD GQIRNAFGNW NALDFADFAQ NRFTGFIPPS LFEAESLRIL YLNNNLLQGP IPLNFGKPRK LRDLYLNSNI LTGEIPSIPT GSLLNLSEFL LQDNQLQGIT MPPSVCSLIE QDGELEDLWA DCLDTDDVDC QCCTQCF
|
| |