Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43594 |
Symbol | |
ID | 7197318 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011670 |
Strand | - |
Start bp | 907679 |
End bp | 909649 |
Gene Length | 1971 bp |
Protein Length | 553 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178030 |
Protein GI | 219112557 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATCGC AGCTCAGGTA CCATGACTGG ACCGTACACG GCAGTCGATC GAGTTTGGCG GCGGGTCCCC CCTGGAGCGT GACCGGACTG ATTCGGAGTC GCGGAAAGCA CCCCACGTTC ACGCCCCCCT CTCCAATTTC TCCACTTCAC TGTGCGAACG CGGTAGAAAG GGTAAGCCAC AGAATAGGCT ACGTATTTGT GACTAACAAA ACAGCAGTAC CATCCATGGT ACTACCATGT GGCACGCCGT AAACGCGAAT GCAAAGAATT TCGAACGCGA GTCACTCAGA TCGTCGAGTC GTTCTCACCA CCGCATCCGC ACTGGCATGT CCGTCGAAAC TAGTCTCTGA TTCCTCAGCT CTTGCCTCCA CACTACATTC ATAGACACAC GTATTGTTGC ATCCTTGCAC TCTTTACTGG TGACAGCATC CATTCATTCT TTCCGTCGTC GTCGTCGCTG CACCTCATTG CCATGCCACG TTCGTACGCT TCGCTCCCGG GCGTTTCCGG GATGTTGCTA GTCATTCTGA TCGCGTCGGA TCGCTCGTTG CAATCCACAG CGTGGCAATT AAATACAAAT CTGGAACCTA GCGCCACCGG GTCGACCCTG CACCGGCGTC ATGCGATTGA ACGCATGGTG TCGGGTGCTG CGGTGCTAGG TGGATTGGGC GGGGGACTCG GGACGGTACA CGCCGCCGTA CCGTCAAACG GACTCACCTT TGACTCGTAC CGGGTCGTTC CCGATTCCAC TGCCGCACTT AATCCTAGTC TGTTGCCCAT TCAGGTTCGT ACCTGTCCTT CGTTATTGAG TAGAGGAGAC CCGGTGTCAG TGTTGTTGTT GTTGTTGTTC GGCGAAGCAC CATCTCACGC TTTTCGCTCG GCCATCGTCG TCGTTCTATT GTTGAATTCC TTGTACAGAA GGCGGATTTT CTGCAAACAA TATCGTCGCG CAACGGCGGA GCCTTGTGGT TGGGCGAGCA CCACAACTCC GTCAAGGACC ACAATTTGCA AGTCGACATT CTCCGCCAAG TGCATCAACT CCGCCAAGCC ACCGGGTCCC CCACAGCGGT AGGACTGGAA CAGGTACAGA TTAAGTTTCA GCCTGTTCTG AACGACTACC TGGCCGGGAA GATATCCGCC GCCGAAATGC GTCAACGCGT TGAATGGGAC ACGCGCTGGA TGTGGCCGTT CGAAGTGTAC GAGCCCGTTT TTGCCACGGC CAAGGAATTG CGTATGCCTC TAGTGGCACT CAACGTCAAT TCAGAAGATT TGGTACTCGT CGAAAAAGGA GGTCTACCGG GGTTGCCGAG TGAACGACTC CGGCAGTATA TTAGTGACGC GTACGTTGAT AGCGTGGTCG GAGATTAAAA TCCGTGTACG ATTGGGTTTT TTCTCGTTCC CATCCATTGC TAATTTTGTT ATTCTCTGCT ACAACACTAC AGACCTGGTT TTGCAGCCTT TGCCAAGCCT CGTGAATTCG GAACCTATGT CGACTACGTT ATCCGACCCT CCTACGATCT ACATGAAGCA ATGGGTCTGC TCAAGTACAG CATGTCGGGG GAAAAGTTGG ATGAGCCCAT GCCCTTTCGC AATTTCTTCA GCGGGAGAAT TTTGTGGGAC GAAGCTATGG CGAACGCCGC CTACTCCTGG ACCAAGGCGA ATCCCGGTGG ACTCCTCGTG GGTTTGGTAG GGGCGGATCA CGTCAAGTTT CGCAACGGAA TTCCGGGGCG ATACGCCCGG CTTGCGCCGA ATGACGCCGC GTGTGTTTCA GTTCTGCTGA ACCCGACATT GATTGATACG CGACCGTCGG GCACGGTAGG CATGGAGGGT GCCGTTTCGG ATCGTCCGGA AACCATTACT CTGCAAATCC GTTATTTGAA AGATGACGTA CAATTTGATT CCCCGGAACG AACCTTGCCA TCGTCAACGG GTGGTGTCCT GGCTCTCGCC GATTACTTGG TGGTAGGTTG A
|
Protein sequence | MISQLRYHDW TVHGSRSSLA AGPPWSVTGL IRSRGKHPTF TPPSPISPLH CANAVERQYH PWYYHVARRK RECKEFRTRV TQIVESFSPP HPHWHLLPPH YIHRHTYCCI LALFTGDSIH SFFPSSSSLH LIAMPRSYAS LPGVSGMLLV ILIASDRSLQ STAWQLNTNL EPSATGSTLH RRHAIERMVS GAAVLGGLGG GLGTVHAAVP SNGLTFDSYR VVPDSTAALN PSLLPIQKAD FLQTISSRNG GALWLGEHHN SVKDHNLQVD ILRQVHQLRQ ATGSPTAVGL EQVQIKFQPV LNDYLAGKIS AAEMRQRVEW DTRWMWPFEV YEPVFATAKE LRMPLVALNV NSEDLVLVEK GGLPGLPSER LRQYISDAPG FAAFAKPREF GTYVDYVIRP SYDLHEAMGL LKYSMSGEKL DEPMPFRNFF SGRILWDEAM ANAAYSWTKA NPGGLLVGLV GADHVKFRNG IPGRYARLAP NDAACVSVLL NPTLIDTRPS GTVGMEGAVS DRPETITLQI RYLKDDVQFD SPERTLPSST GGVLALADYL VVG
|
| |