Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47781 |
Symbol | |
ID | 7203033 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 14170 |
End bp | 15973 |
Gene Length | 1804 bp |
Protein Length | 437 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182143 |
Protein GI | 219123669 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.938739 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGTGC GCAAGCCCGT CTTAGCGCTC TACCAAGCGG TTGCGGCGAC CGATCATCAT GATGTTCAAG TCGGCGAAGC CGCCAAATCA GGTTCCGGTA CCGCCACAAT TCCCACCGGA GTCTTCAAGT CCGTTTTTGC AAAGTAGTGC CTCACTGTGA GCTTTTTGTA TACTGTCGGC CTTAACCTAA ATTATATTCT CACGAAACCA ATGCAACTCT CCCTCATTTG ATAACACAGC CTGGTCAAGA GCATTGCTGG TGCGGGTGTT CTTAGTCTTC CCTACGGTGT CGCCGCCTTT GGCAATGCCC CGTCGGCATT AATTCCGGCA ATTGGAGTGA TTATGTTGAT GGGAGCTGCA TCCGCCTATA CCTTTGGATT GATTGGGCGC GTCTGTCAAT GCACCGACAC AAATTCATAC GCCGCCGCCT GGGACGTTGC CGTGGGCCGG AAGTCTAGTT GGATCGTCGC CTTTTCCTGC TTTATCGACT GCTTTAGTGG CAATCTCTTT TACAGCATGA TTCTAGCCGA CACGTCGGTT GACCTCTTGG CTTCGGTGGG TGTGACAGTT ACTCGCACCC AGTCTCTCTT GTATGTCACG AATTTGGTCC TCGCCCCACT CTGTTTCCTC AAGAATCTCA GTAGTCTAGC ACCGTTTTCA CTCGTGGGCA TCATCGGAAT GCTCTACACA ACTCTGGCAA TGGGTCTCCG TTACTTCCAG GGTTCCTACG CCCCCGGTGG AGAGTACTTT AGCAGTCAAC TCACCGAACC CGTCTTTGGG GTTGACGGAG CCTCGGCGGC ATTCTCACCC AAGGCCCTCA TCCTCACCTG TATGCTTTCC AATGCCTACA TTGCCCACTT CTCGGCTCCA TTGTACTGGA GCGATCTCAG AGACAATACC ATGGAACGCT TTCATCAAAT GATTGGGTAC AGCTTTACTG CGGTGGTGGT CATTTATAGT CTGGTTACCA CCGCGGGTTT TCTCACCTTT GGTGCCGCCT CGAACGGATT CATTCTGAAC AATTACTCCA CGAACGATAC GATAGCCAGT CTTTCGCGGT TTGCCGTCGC CATTAGCATC ATCTTCTCCT ACCCGCTGAT CTTTGTCGGA ACCCGTGACG GTCTGATGGA TCTTTTTCGC GTCGAGGAGG CGAAACGAAA CACAAAACTC ATCAACAAAT TGACGGTGAC CCTCATGCTT TTGGTGACGG CATTGGCTAG TCAATTGACT GATTTAGGAG TTGTGGCGTC GATTGGTGGA GCCACCTTTG GAACGGCCCT CGTCTTTGTG TATCCTGCTG TCATGTTCCT CAAGACGCAG ACAAAGAGAA CCAAGGAAAC TGTTCCAGTA TTTTTTATTG GTGTTTTGGG TGTTGTTGTG GGGGTGATTG GCACGACCAT GTGCTTTTGA AATATTCGCT TATCGCAGGT ATTTCACCCC GTGAAATAAG TTTGGCAACA ACACTAGAGG TAGTCTCGAT ATGGCGTTGC GTCTATGGCA ACTGCTCTTT GTATAATCCG CGCCCTCGAT GGGGCAGGTC AGTGCTTCGG ATGCTCTCGA TGGGTTCTTG GCGCTAGATA AACCTCGATG TCGCTGACTG TGGAGTGGTC TATACTTATA CTGCTACCGC CGAGGACCAA TCCGTCGGTC TCTTTATCAT TTTAGGCACA AAGGGCATTT CATCCAATGC GCATTAGGGC AAGTCCGAGG TCGTAAAGGC AGCCGAAGAT GTTTTCACTT TTGGAGGAAT CAAAGGACTC GGTGACCAAG GTAGTAAAAC TAGTCTTTCG GTTGGATGCA AAAA
|
Protein sequence | MAVRKPVLAL YQAVAATDHH DVQVGEAAKS GSGTATIPTG VFKSVFANLV KSIAGAGVLS LPYGVAAFGN APSALIPAIG VIMLMGAASA YTFGLIGRVC QCTDTNSYAA AWDVAVGRKS SWIVAFSCFI DCFSGNLFYS MILADTSVDL LASVGVTVTR TQSLLYVTNL VLAPLCFLKN LSSLAPFSLV GIIGMLYTTL AMGLRYFQGS YAPGGEYFSS QLTEPVFGVD GASAAFSPKA LILTCMLSNA YIAHFSAPLY WSDLRDNTME RFHQMIGYSF TAVVVIYSLV TTAGFLTFGA ASNGFILNNY STNDTIASLS RFAVAISIIF SYPLIFVGTR DGLMDLFRVE EAKRNTKLIN KLTVTLMLLV TALASQLTDL GVVASIGGAT FGTALVFVYP AVMFLKTQTK RTKETVPVFF IGVLGVVVGV IGTTMCF
|
| |