Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48566 |
Symbol | |
ID | 7194792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 229192 |
End bp | 230582 |
Gene Length | 1391 bp |
Protein Length | 460 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183183 |
Protein GI | 219125847 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00129954 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCTGGACCAT GAAAAGGCGG AATGTTTGCT TCTCCTTTTC TTTGCTTGGT CTAGTCCTGG TTTCAATGTT GATTGTTAGA CAACACCGCC ATTACGTCAA ACAAGCTGCT TCCGGTAAGC TCTTGTCGAA ACGAGGAGGC GACAGGCCTC ATCACTTCAA TGTCTCTCAT TTGCCAAATG CAAATCGATC GGGTGCTAGA CAATCAAGGA CGAACACGAT TCAGCTTTAC CATGACCACG CTCCTAACTC GCGAACTGTT TTACTTCCGG CTGTACCGAG CTCCACATCA GAGGAGAGGG TGCGCTTCGC CATCGGAAAT TATTCGGATG CCTCGAATCC ACTCATCAAC TGTAATATCA CATCACAGGT CCTTTTGCAC ACGGACCAAA TGCCCTGGAT AATGGAAAGT CTCGATAGCA CAGGATCGAA AAAGACGGTC GGAGGTGACG AATTCTACGT TGTCTATAAA GATTCTAATG AAGAGTGGGA TGCAACCGCT GTGGCATGGA TTCAAGACCT GCACAATGGT TCATATGGGC TTGATTTCGT AACCGTACCT ATCAAAGAAA CCTTCAGTAA TTTGACGGGC CATGGAAATT TGACCATATA CTTCCAGTAT TCGTGTGGTA TCGGGTTTAC CCCACAACCT CTAAAAGACA ACTGGAAAGA CAACGGATCC ACCTGTTTGG AAACAACCGC GCATGTTAAG GCCCCTAGGA TTCGAGTTTT CCAGGCTCCA ACTATATCTC CTAATTTCTC CGAGTTTTCT AGAATGGTAT CCTTCGGAGA CTCACTTTTA CAGCAGCTTG TATTCGGATA TACAAGATTC TTTCGAAAAG ATGTGTTTTG GAAGGCCAAT GTCAATTCGG AGCTTACAAG CGAAACACTC GACAAGCGTT TCATACCTAA GCTTCGTGAG TGGCACGGCG GCGATCTGCA AGACCCCACC GTGGGCCTTG TAATTGGGTC TTCTGTTTGG GACATTCTTC AAAATCAGAA TATTCAGAAG GCTGATTTTC GGGATCACTT ACAAGCTTGT CGGCTGTATA TTGCCCGTGT TCAAAAAGAA TTTCCCAATA CGACAGTTTT TTGGAAATCG CCTTCTACCT TGCATGTACA TAGAATGCTT CCAGGATGTA ACAAAGATCG GAGGTGCGCA GAGAGAGTGC GTTACATGAG TGCTTCCCGA TCTGAATTTC TGTATCGTGG ACAAAAACAA ATAATGGCCG AGCTGCGCAT CCCCTTTCTG GATCTCTATG AAGGATATTT CCTGTCGGCG CCTTACACGG TGCGCGGTGA CGGACGTCAC TATCAACCCA AATATGACCG TCGAATGTTG CGCTGGCTGT ACAGCAATAA GACTGTGACT CCCAGCATAT TTCTGGAGTG A
|
Protein sequence | MKRRNVCFSF SLLGLVLVSM LIVRQHRHYV KQAASGKLLS KRGGDRPHHF NVSHLPNANR SGARQSRTNT IQLYHDHAPN SRTVLLPAVP SSTSEERVRF AIGNYSDASN PLINCNITSQ VLLHTDQMPW IMESLDSTGS KKTVGGDEFY VVYKDSNEEW DATAVAWIQD LHNGSYGLDF VTVPIKETFS NLTGHGNLTI YFQYSCGIGF TPQPLKDNWK DNGSTCLETT AHVKAPRIRV FQAPTISPNF SEFSRMVSFG DSLLQQLVFG YTRFFRKDVF WKANVNSELT SETLDKRFIP KLREWHGGDL QDPTVGLVIG SSVWDILQNQ NIQKADFRDH LQACRLYIAR VQKEFPNTTV FWKSPSTLHV HRMLPGCNKD RRCAERVRYM SASRSEFLYR GQKQIMAELR IPFLDLYEGY FLSAPYTVRG DGRHYQPKYD RRMLRWLYSN KTVTPSIFLE
|
| |