Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47845 |
Symbol | |
ID | 7202979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 221358 |
End bp | 223475 |
Gene Length | 2118 bp |
Protein Length | 620 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182181 |
Protein GI | 219123749 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0593792 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAATCGCGTC CACGCTCGTA GCATGGAGTC ACCGTAAAGA AGTAAGGTTT ATGATACCAT CTTTCAATTG TCTTCCGTGC GTGTTAGAGC TCAGCTTAAT TTCCTTGGTA CATAATAGTC CTCTCGCGTG CGCTGCATTG GCAGGCAGAA ACTTCGTTGA CGATAGAGTC TGCCTGCCCT GGATCTAAAC CACAGCCAAA AATATGACAT TGACGTCAAG TATTTCAACC GATTCAACAA ACGGTAAAGT TGGACAACTC GAAACGCCAA ACCAGCGTAG TATGGCAACA GAAAGAAGTT TTCTTTTCGG TTCTCGTCGG GTGTTTACGT TCGCCATGCT TGCTGGAGTT TTAATTTTTC TGATGGGGAG TCTCTCCGTG TCGATGCAAT CGCAGCAATA CATCGAAGCG CTCTATTCGG GGTTGTCCGC GGTCGCTAGT CTGAATAACA GCACCACCCT AAAAAGCGTT GAGGAATTGA CGTATGTTGC ACAGCAACAT TCTCCTGGAC TTACGGGACG AACATTACCG GAGCATCCCC ACGAAACACG AATACGACAA GCGATACATT CCCACTATCC TCATTTTGAA CATCATTCGG TTCGAGGTAA AGTTGTGGAC GTGACGCAGA ATGTTAACCG GATCGAAGAG CTGTCCTTAA CGAGGAAGAA AAATGTCACC TACATGGAAG TGAAGGATGA GGACCACGGA CCGTTGAATG TCGTCCTGTT TTACGCTGAC GACTGGACTT TGAAAGTGCT TGGAGCCTTA AACCCTCACG TAAAAACACC AAATATTGAT CAAATGGCGA AGAATGGAAT GCTCTTTCCC TATAATTGTG TCACCACGAG TATTTGCTGG ATTTCACGTG CCACGTTGGT AACGGGTGTT TATGCGGCGG TCCACCAACA ACTAAAAATT GCGCACAACA GTCTATTTAA TAACCTGACC ATTCCGTGGA CAGAAACTCT ATTTCCACAG CTGAAGAAGC ACGGCTATTA CACGGGATTA GTTGGAAAAT GGCACGCCCC GTCGCCAGGG AAAGAGATGA AATTGGCCTT TGATGTGATG AATATCTATT ACGGGCGGCA TTGGGAACTG CGAAACGGTC AACGTCGTCA CGTGACCGAT CTAAACGGCG AAGATGCGCT CAATTTTCTG AGAAGTCGCC CCAAAGACCA AAAGTTTGCA TTGAAGGTCT CCTTCTTCGC CACACACGCT CAGGACTACA CAATTCCAGC CTACTCGCCC ATGAACGAGA GTATGTCTTT GTACGAAGAC GACGACATTC CTTGGGTGCA AACGAATACA GAGCAGCACT GGAAAGATCT GCCTTGGTTT TTTGACAACC GCAACGAAGG TCGGCGGCGG TATATTGGTC GCTTCGATAC TCCCGATAAT TATCAATACA ACATCAAGTG CTTGTACCGT ATGGCGACCG AAGTTGATTC GGTTGTTGGC GAAGTGATTG ATGAACTCAA AAGGCAAGGT GTTTACGACA AAACGCTTTT GATCTTTACA ACAGACAACG GAAATTTGCA TGGCGAGCAC GGTCTTGCGG AAAAGTGGTA TCCTTGGGAG GAATCAATTC GAGTCCCACT GGTCATCCAA GATCCACGCA TGCCAGCAAC AGAACGTGGC AAAGTCAATG ATGAATTCAC GTTGTCGGTG GACCTTGCAC CGACGATTTT GTCGGCGGCA AAGATTCCGA TACCATCTCA TATGCAAGGT CGGGATATTG CCGAACTGTA CTTTGATCCA CACCAGGCAA CGGTATCATG GCGTAAGGAT TTCTTTTACG AATGGAGTCA AGGCGAGCCG GTAGAAGCCG TAGGCCATAA CGAGTACTAC CATATTCCAG CGGTCTTTGC GCTGATTCGC AAGGACTGGA AGTATTTTTA CTGGCCGCAG GTCAAAGTTG AGCAGCTATT CCAGATTGAG AACGATCCGT ACGAGCAGCG TGATGTGCTG AACTCGACGG CTCAAACAAC ACAAGAAGCA CTGGATTTTA TGAGGGCAAG ATATTTTTTT CTAAAGAACT ACTCCCAAAT GGGCAACCCA GTCTGATACT TCTCAGAAAA ATTCTTTTTC TTCTAATAGT ATGAAGCATT TTCTCTTA
|
Protein sequence | MTLTSSISTD STNGKVGQLE TPNQRSMATE RSFLFGSRRV FTFAMLAGVL IFLMGSLSVS MQSQQYIEAL YSGLSAVASL NNSTTLKSVE ELTYVAQQHS PGLTGRTLPE HPHETRIRQA IHSHYPHFEH HSVRGKVVDV TQNVNRIEEL SLTRKKNVTY MEVKDEDHGP LNVVLFYADD WTLKVLGALN PHVKTPNIDQ MAKNGMLFPY NCVTTSICWI SRATLVTGVY AAVHQQLKIA HNSLFNNLTI PWTETLFPQL KKHGYYTGLV GKWHAPSPGK EMKLAFDVMN IYYGRHWELR NGQRRHVTDL NGEDALNFLR SRPKDQKFAL KVSFFATHAQ DYTIPAYSPM NESMSLYEDD DIPWVQTNTE QHWKDLPWFF DNRNEGRRRY IGRFDTPDNY QYNIKCLYRM ATEVDSVVGE VIDELKRQGV YDKTLLIFTT DNGNLHGEHG LAEKWYPWEE SIRVPLVIQD PRMPATERGK VNDEFTLSVD LAPTILSAAK IPIPSHMQGR DIAELYFDPH QATVSWRKDF FYEWSQGEPV EAVGHNEYYH IPAVFALIRK DWKYFYWPQV KVEQLFQIEN DPYEQRDVLN STAQTTQEAL DFMRARYFFL KNYSQMGNPV
|
| |