Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39660 |
Symbol | |
ID | 7195288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011688 |
Strand | - |
Start bp | 367968 |
End bp | 369158 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183735 |
Protein GI | 219127005 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGTCT TTCGAATTTG TTTCTTGCTC GCCTTGAGCA GCTCATCCTT ACTGTGTCAA GGCACAGATT GTGCGGCTAC TTCTTTCCAG TCAGTTCGCC GAGACTACCA CGATGACCAA ATGTGCGATC GCGGTACCTT CAACAAGTTC AAGGAAAGTT TAAACCGCAA TTTTCAGTAC CTTCAACTCA AGCAACTTGG GAACGCCTTG GTTCCCGACC AGAAGACAGT TGTTTGCCCT AGCCACGTGC ACAATCTTCA CAACGAGAAC GACATCATCG GCTTTGATTC CGTCTGGATA GTGGAAAACA CCGCCTCGTC CCCAATTGTG CTGAGCTTCG TGCACGGCGA TGGCCGAGAA AGCTCAGCAC TTAACCCCAA GATCAGTCCA GCGCAAGTAG ACCCCCGCGC AGTCGTCATG CCGGGGGACT ACCGCGCTGT CAATACATTT GAGGGTCACG TTTTTCATGC CCGCGAAATG CTTCCAGATG GTGGTGCCGG TAGAGTGTTG TTGCAACATC GTGTGGGGTA TATCCCTATA GGATTGAATC AATCAAACGC GGCGTGCTCC GGCCAAGATC TAGAACCCGT GATCACCGAC GCATTAACCG ATGAGACCAG AATAGCTCCT GAATATGCCC GAACACCTCC TAAGCCTTTC CTTGACTGCA ACGCCCTCCA TGTCGGTTTC CGGAACAAGG TAGGTTGTCC GGTGCACGGC TTTTTCGTAG AAGCCACAGA AAATGACGAC TGTCATGAAA ATTTTAAGTT TCATTTGGGA GTCAATCCGA TGACGGATGA TTTCATGTGG AGCTGGGATT CTCCTACCAA GTTTGAAACT TCTTACATTG GACACACGTT TGCCTTTCGC CTCGCCGATC GTCCTGGTGT TCTGGTCGAC AAGGTGACGC TCGGACCCAC ACAAATTTCC GATTGTCCGG GCCTGGCCCA AAGCTTCGCC ATCCCAATTG GAGCAGATGG TCAGCTCCTG CCAGTTGCTC GCATGCTTTG GGATGCCAGC CATAAGACCT CTGTCTACCA CGTGAACAAT CCTGGGCTTT ACCGCCACTC TAACTCCACA GCGAGTTCTG TACTCGTCAA CACGAACGCA TCGCTACCCT CGGCTGCTCG GTGCGCCGGT GCCAGCTCGG TTGCGAGGGA ACGTAGTCCT CTATTCACGC TCACAATCTA G
|
Protein sequence | MMVFRICFLL ALSSSSLLCQ GTDCAATSFQ SVRRDYHDDQ MCDRGTFNKF KESLNRNFQY LQLKQLGNAL VPDQKTVVCP SHVHNLHNEN DIIGFDSVWI VENTASSPIV LSFVHGDGRE SSALNPKISP AQVDPRAVVM PGDYRAVNTF EGHVFHAREM LPDGGAGRVL LQHRVGYIPI GLNQSNAACS GQDLEPVITD ALTDETRIAP EYARTPPKPF LDCNALHVGF RNKVGCPVHG FFVEATENDD CHENFKFHLG VNPMTDDFMW SWDSPTKFET SYIGHTFAFR LADRPGVLVD KVTLGPTQIS DCPGLAQSFA IPIGADGQLL PVARMLWDAS HKTSVYHVNN PGLYRHSNST ASSVLVNTNA SLPSAARCAG ASSVARERSP LFTLTI
|
| |