Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_10628 |
Symbol | |
ID | 7204194 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 605563 |
End bp | 607759 |
Gene Length | 2197 bp |
Protein Length | 575 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186091 |
Protein GI | 219113015 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATGGCCACA CCATTCACCA GTATTTGCGC GGCCGCTTTC TCGGAAAAGG CGGGTTCGCC AAAGTATACT TGTGCACCGC TCTGGACACA TCCAAACAGT ACGCGGTCAA GATCGTACCA AAGGCGAATC TAGTCAAAGC GCGAGCGCGA CACAAGGTTT GTTACTGATT CTCAGTGCAC ATTATCTACT TTTCCCTAGT TTCGACGCTC ACTTACTTTC TATCCCTCGC AGTTGCAAAC CGAAATAAAA ATTCACCGCA CACTCAAGCA TCCCAACATT TGCGAGTACA AACATTTCTT TGAGGACCGC AACAACTGCT ACATCCTTTT GGAGCTCTGT CACAATCAGA CTCTAAATGA AATGATCAAA CGTCGGAAAA GATTGACCGA ACCAGAAGCG GCTCTGTTTA TGAATCATCT TCTCGATGCA GTCAAGTACA TGCACCTGAA GAATGTAATT CACCGAGACT TAAAACTCGG AAACTTGTTT TTGGACCGAC ATCTGAACGT CAAGGTTGGA GATTTGGGCT TAGCAACGAT TTTAGAACAT CCCGAAGAAA AGCGCAAGAC TATCTGCGGA ACCCCGAACT ACATTGCTCC CGAGATCATT CAGGGAGACA AGGCCACCAG GGGGTATTCG TTCGAAGTTG ACGTATGGTC CATGGGGGTT ATCCTGTTTA CAATTCTTGT TGGAAAGCCG CCGTATGAAG CGAAGGACGT CAAAGCCACT TACCAACGCA TTTTGGCCAA CGAATATTCG TTCCCCAACA ATGTAGAACT CTCGTTGGAT GCAAAAGACT TAATTCGGAG CATGCTACGC TCCACACCAT GCGAACGGTA AGGTCGTCAT TCGTTAGTTC TTGACTGATG CATTGTAAAT TTCAATACGT AGAGGCTAAT CTTTTCTTTC TACAGTCTGT CCCTTAAGGA GATTGGAAGT CACCGGTTCT TGTCCATTAG GAACACACCA CTAAACATCC CTTCAAACGC CACTCACTCT ACACCCAAAT GGTACTTGAA CGAGTATGGT AGATTCGTCT CCGACGGAGA CGCTGCGGCC ATTCACTGTC AAAAACCACG AAAATCAGTA CTCCCCCGAT TGAGTACTCG GCAGCCGTTC GGACTTCGTG ACCAGAACCA TGGAACGGCC CGTAAGACGA AAAATGAAAA ATCCGAGGGA GAACACATTG ATATACAACG CCTCGTCAAG AGCACTATAT CTCTACCGGC GTCGAAGCCC ATTAAAGGCG GAGGCATGTC TCCTACTTTC AGAATTTTTG ACGACTCCAA AAAAGCTACG CCATGTGAGT CCTTGGAAAA ACCTACACCA AAAACGAACG CAGAGGAAGA ACTTATTTCT CGAACTCGCG CCCTGTCAAT TCAAACTTCC TCTCGCCTTC AAGATTCGGG CCGATGCAGT CCTGCAAGAT CTCTGGCATC CTCCACCTAC ACCGCAATTA TCGATTCCGG TACAGAGATT CTGCAGAAAC TCGTTGTCCA TCTAGAAGCC GTTCTAGAGT TAACTGCCTC ACGTCGTGAT GCGTTTCGAC CTACATCCCC TCAATCCGTA GTCGTGTATG CAGGACCTAC CAGATGGGTG AGCCGCTATG TTGATTATAC AAGCAAGTAT GGTCTGGGCT TTCTTTTGAA CGATGGTAGC TCCGGAGTTT ACTTTAATGA CTCAACCAAG ACTGCTCTGG AGGCACAGGG GGAGACATTC TACTATATTG AACGTAGAAA GGTTGAAGAC GCTGCTTCTC GAAAAGTAGA AATTGCTGTT GAAACCCACA CGTTGAGTTC GTACCCAGAG CACTTGAAGA AAAAAGTCAC TCTTCTGAAG CATTTCCGCA ATTATCTCTT AGACCAGCAG AATAAGGATG AAGAAACAGA GCCCACCCGG CCTCTTTCAT GCCTGCCAGT TTCTGACACA GTACACGTCA AGAAATGGAT TCGCACGAAA CACGCGATAC TGTTCCGTTT GAGTGACCAG ACCATCCAGG TAGTCTTCTA TGATCAAACA GAAGTACTCT TGACACCTGA TGTACGATAT ATTACCTATG TGGATAAAAA TCATGTCCGT CGAACATACG ACTTCACAGA CGAACTAGTT GGGTCCCTTG TGGAATTGGA GAAACGTCTG AAGTACACTA AAGAAGTTTT GTTGCAGCTC ATTGGTTCAC ACTCTGGACG TCGCTAA
|
Protein sequence | DGHTIHQYLR GRFLGKGGFA KVYLCTALDT SKQYAVKIVP KANLVKARAR HKLQTEIKIH RTLKHPNICE YKHFFEDRNN CYILLELCHN QTLNEMIKRR KRLTEPEAAL FMNHLLDAVK YMHLKNVIHR DLKLGNLFLD RHLNVKVGDL GLATILEHPE EKRKTICGTP NYIAPEIIQG DKATRGYSFE VDVWSMGVIL FTILVGKPPY EAKDVKATYQ RILANEYSFP NNVELSLDAK DLIRSMLRST PCERLSLKEI GSHRFLSIRN TPLNIPSNAT HSTPKWYLNE YEEELISRTR ALSIQTSSRL QDSGRCSPAR SLASSTYTAI IDSGTEILQK LVVHLEAVLE LTASRRDAFR PTSPQSVVVY AGPTRWVSRY VDYTSKYGLG FLLNDGSSGV YFNDSTKTAL EAQGETFYYI ERRKVEDAAS RKVEIAVETH TLSSYPEHLK KKVTLLKHFR NYLLDQQNKD EETEPTRPLS CLPVSDTVHV KKWIRTKHAI LFRLSDQTIQ VVFYDQTEVL LTPDVRYITY VDKNHVRRTY DFTDELVGSL VELEKRLKYT KEVLLQLIGS HSGRR
|
| |