Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_56608 |
Symbol | |
ID | 7200644 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 658813 |
End bp | 660627 |
Gene Length | 1815 bp |
Protein Length | 433 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179689 |
Protein GI | 219117802 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACGTGAAA CAAATCTCGT AAACAGCTCA TTCACAGTTA CTGTTGATTT GCATCCAGTT CTATCCATAT CATTCGTTGC TCCTTTGAAA ACAAGAAAGA TTCGGATTTA GTCATGATGC GTTTCCTAGT GATTTTGCTG ACTGCTTCCG CGGTTTCGGC CTTTTCGCCT TCTTTGAATA CCAACAAGGT CCATGTTCCG GTTCCATCCG TGTTAACGAC CTCCTTGTCG TCCCCCTTAT TTCGTAGTCA AGCCTCGAGC TCTTCCTTAA CTCAATGCGA AGCTATGCTT CAAGAAACAG AGCTGCCTGA TAAACTGTAT TTTGAAAAAG AAAAGGAAAT GCCCAAGGTA TTGGGAGGTC TCAAGATTGG CCTCCGAAAA CTGGTAGTCA TTACCGGCGC GTCGTCGGGG TTGGGTTTGA ACTGCGCCAC CACCCTCGCC AAGACGGGGC GATATTTTGT CGTCATGGCG TGTCGCGATG TCGAAAAGGG CAAGCAAGGT ACGCACTCGA TGCTTTAAAC GCAGCTTACT TCCTTTGGTC GTTGGTGCGT GTGAAGAATT CTAACGAAAT GTGTGCTGCT TGGCGATTTC TTTCTTCCCT ACAGTCGCCA AAGAAAAGGG ACTCCCAGAC AACTCGTACG TTGTTATGAA GCTTGAACTT GGTAATCTGC AATCGGTCCG CGATTTTGTG AGCAACCTCA AAGCGTTCAA AGCTGCCCGT CCTTTGACAC ACTTGATCTG CAATGCTGCC GTGTACAAGC CCAAAGATCC TGAACCGGCC TGGACCGACG ATGGGTTTGA AATGTCCATG GGTGTCAATC ACCTCGGCCA TTTTTTGTTG GTTCATCTCT TGATGGATGA CATGTCTCGT GCCAAGGGTG CTCGTGTCTG CATCGTGGGG TCCATCACTG GAAATACCAA TACCGTCGGT GGAGGTCTCG TCTACCCACA GGCTGATTTG GGTAAACTCC AAGGATTTGA GAAAGGCGCC AAGAAGCCGG TTGCCATGGC GGATGGCAAG CCATTCTTTG GTGCCAAGGC TTACAAAGAT TCCAAAGTAC GTTTCAATCT GTTTCACAGA TAAGACGATC ATGGATGGTA AGAATTTCAT CCCTGACATT ATTCCTTTCT TTCACTCTGC AAACCAGGTA TGCAATATGA TGACCGTGTC CGAGCTTAAC CGTCTTTACC ACAAAGATAC CGGTATCGTG TTTTCCTCCA TGTACCCGGG CTGTATCGCT GAGACCGCTC TCTTCCGCGA AAAACGTCCC TGGTTTCGCA AGGCGTTCCC TTGGTTCATG AAGTACGTCA CGGGTGGCTA CGTTGGTATG GAGGAAGCTG GGGAACGTCT TGCTCAGGTT ATAGACGACC CACAATGCAC TAAGTCGGGT GTTTATTGGA GCTGGAATGG CGGTGCTCAG ACAGTGGGAC GTTGGAGCCC CGACGGAAAG CCTCGAGGTG CCGGCGGATC GGGTGGTGAG ATTTTTGAGA ATCAGCAGTC CGACGCAGTA CGGGATCTTC CTACCGCCAA GAAAATGTGG AAACTGAGTA GGGAAGCAGT TGGTCTTTCC AAGAAGGAAA TGTTTAAGGG TGGAAAGCTT GAAGAGGAAT AATAAGCAAA GAAGATTGCT TCTACGGAAA AGAGGGCCCA TTGTCGATTC GCTTGGTGCT AGTCACATGC ACTCATGTTG CTTTCACATT TGATCATCTG CACAATTCAA TATATCGATA TGTGGCTACA TCACCGAAAA AATTCATTGC TGATTTGACA AGCTCGAGCA TAGTTCAACG AAACAATACT AGAAACACTG GAAGC
|
Protein sequence | MMRFLVILLT ASAVSAFSPS LNTNKVHVPV PSVLTTSLSS PLFRSQASSS SLTQCEAMLQ ETELPDKLYF EKEKEMPKVL GGLKIGLRKL VVITGASSGL GLNCATTLAK TGRYFVVMAC RDVEKGKQVA KEKGLPDNSY VVMKLELGNL QSVRDFVSNL KAFKAARPLT HLICNAAVYK PKDPEPAWTD DGFEMSMGVN HLGHFLLVHL LMDDMSRAKG ARVCIVGSIT GNTNTVGGGL VYPQADLGKL QGFEKGAKKP VAMADGKPFF GAKAYKDSKV CNMMTVSELN RLYHKDTGIV FSSMYPGCIA ETALFREKRP WFRKAFPWFM KYVTGGYVGM EEAGERLAQV IDDPQCTKSG VYWSWNGGAQ TVGRWSPDGK PRGAGGSGGE IFENQQSDAV RDLPTAKKMW KLSREAVGLS KKEMFKGGKL EEE
|
| |