Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44390 |
Symbol | |
ID | 7197856 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 405594 |
End bp | 407668 |
Gene Length | 2075 bp |
Protein Length | 627 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178221 |
Protein GI | 219114851 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.68947 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCGTGGCAA CCAGCACAAT GGCGGCGACT CGACTGACTG GCATTTACCG ACGTTCAGTG ATTGCTTTGC TTCCAGCTCA TCCGTCGTGT AACATTCGTG GCAGGAATTG TATCTATGCT GGTCTCCGCG TGTTGCTTCT CGTATCTTGA CTAGAGCCAT TGACGACGCT CACAGTCAGT GCGATACTAC AATGCGACGA CAGTCGCTTG TCGTGGGTGT CGTCGTTGCG CTGCAACCCA ATCCAATCTT GGCCTTTTCT ACCAGCAGAT CGCCGACTCT GAAGCGTGAA CACCCGCGTT GGGTCAATAA CCAGAAACGA CCCCCGTCAG TTTTGTCCGC CGTCTCGCCG ATAGATACCT TGGTGGATTC CTGGAAGGCA CTTTGGGAAA AGCCAACGGC CGCCGTTTCC TTGCCCAAGA AACACCCGGC ACAGACAGTG CAAAATTTTA TTAGCGCCTA CAATGAAAGA GATTACAACA CGCTCGAGAA TATGGTCGAC CCAGACATTG AATTTGACGA CACGGCTTAC CCAAAACCGT GTCGAGGGCT GCCCGAACTT GAGCGTCGGT GGCGCTTGAC CCGGAATGCT CAAGGAGACA AGCGCATACA AGTCGCGGTC GATGACATTG CCTCTTCTAC GACCACTGTA GGTATTCGTT TTCATTTGGA AAATGTCGAA GGCGAGATTC CCAACGGTCG GGGAGCTGCC TTTTTCCAAC TTTCCAACGA CGGTCTACTG GTTAAAAAGG TGTTTTGGGT ACAAGAATCG GCTCAGAAAG GGGGAGAGGC CAGTCTGCAA ACCCTCAATC GCGCCAGCAA AGTTATGAAA TTGACTGGCT ATAACAAGGC CACCGCCACA TCTACCGCCA CGAAGGAGGC GTCCGAAATG ATCGAAACCT CATTCTTGTC TCTACCAGAG AAGTATTTTG CGGCTTGGAA TCGTCGAGAC ATGAGCAGTG CGGTTGCCTT GTTTGCGGAT ACCGTCACGT ATGATGATAC GGCCTTTCCG GAACCATTTA GCGGCAAAAC AAATTTATCA TCGCATCTTT ATAAATGCTC CAACGCGTTC CCTTCCACTT TTACCTTTCA AGTCGACAAG GTGGCCGACG CTGGAGACCG GATCTCTGTG TTATGGCACG TAGAAAACGA CGGGGATGAT TTACCTTTTA CGAGGGGATG CTCCTTTTAC AACGTTGATA CTAAACGAAA CGAAATACTG GACGGCATTG ATTTTGTTGA ACCCGGGCCG ATTAAACTCG GAGGATTCCG TTTGCTGGTG TCGACGGTCA GAACTAACCT TGAACGTGAA CCGGCTCGAT ACGTACCACT GATATCCTGG ATTGCCTACA TCTATATTGT ATTCTTCTCA AACGGGATCC TGCCGGGTGC GAATGCGCTA GAGCTTGAAC AACGAACTTG GGAAGAGGTT CGTGATTTGT CGCTCAACTT TTTCCTCGTT TCACCACTTC TCCAATTATC CTTCTCTCCA ACGGTGCATC CTATGCTAGA AGGCGTTTTC AATTTGCTCC TGTCTTGGGC GGCCATGTTT GCCGGTTTCT TGAGCGACGA CCGTCGAGAG AAGCCCAATG TCGCCCCAAT GCTTCCAATC GTCATCGGTA TGCAGTTTCT GACATCGGCG TTTCTAATGC CATATCTAGT GCTCCGGTCG ACCGAAGAAA GCACACTCGT GGCAAAGGAC GCCCTTCCGA AAGTGGCTCA ATTGGCAGAG GCGCGGGCGC TGGCGCCCTT CTTGGCGTCG GTAGGAACTG GTTCCATCGT ATGGGGACTA GTAGGACGAA TGGCAGATTT TGGGGACTTG TCGACTCGCT GGTCCAGTTT TATCGATCTG CTGAGTATTG ACCGCGTAGG CTCGAGCTTT GTCGTTGATC TAGCATTGTT TTGGCTTTTT CAATCATTTT TGATTGACGA TGACCTTAAG CGGAGGGGCA TAGATCCAGA AACTAGCGAA CTTCCGGTCT TTCTGGGAAA GTATGTTCCG TTCTTTGGCA TGGCACTCTA CCTTGCGATG CGACCACCAC TTCCTCTGAA TATACAGAAC TCGCAAAATG ACTAG
|
Protein sequence | MRRQSLVVGV VVALQPNPIL AFSTSRSPTL KREHPRWVNN QKRPPSVLSA VSPIDTLVDS WKALWEKPTA AVSLPKKHPA QTVQNFISAY NERDYNTLEN MVDPDIEFDD TAYPKPCRGL PELERRWRLT RNAQGDKRIQ VAVDDIASST TTVGIRFHLE NVEGEIPNGR GAAFFQLSND GLLVKKVFWV QESAQKGGEA SLQTLNRASK VMKLTGYNKA TATSTATKEA SEMIETSFLS LPEKYFAAWN RRDMSSAVAL FADTVTYDDT AFPEPFSGKT NLSSHLYKCS NAFPSTFTFQ VDKVADAGDR ISVLWHVEND GDDLPFTRGC SFYNVDTKRN EILDGIDFVE PGPIKLGGFR LLVSTVRTNL EREPARYVPL ISWIAYIYIV FFSNGILPGA NALELEQRTW EEVRDLSLNF FLVSPLLQLS FSPTVHPMLE GVFNLLLSWA AMFAGFLSDD RREKPNVAPM LPIVIGMQFL TSAFLMPYLV LRSTEESTLV AKDALPKVAQ LAEARALAPF LASVGTGSIV WGLVGRMADF GDLSTRWSSF IDLLSIDRVG SSFVVDLALF WLFQSFLIDD DLKRRGIDPE TSELPVFLGK YVPFFGMALY LAMRPPLPLN IQNSQND
|
| |