Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39914 |
Symbol | |
ID | 7195541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | + |
Start bp | 353134 |
End bp | 354390 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | |
GC content | 57% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183859 |
Protein GI | 219127265 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.0674804 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCCGAC CAAGTATCGA CTACTACCTC CCACAGATGC CGTCTTTGCT TCAAGATTGG TTCACGTCTC CCGACTCCGC TTCCCACACG GATCGAATTC TAGTGTTGGA TGGAGGCGTC AGTACGCATT TGGAGTCGAA CCTGTCGTCC ACCAACGCGG ACTCCGCTTC GTCCGACAAA CTGTCGTGCC GGACGTGTGC GTTCCCGCAT CGAGAATTGT GGTCCAGTAG TCTCTTACTC TCCGAATCGG GTCGTCGCCT CGTCCGGCAA GGTCACGATG ATTGGCTCCG AGCCGGAGCC AACGTTCTCT CGACAGTCAC GTACCAGTGT CACTACCAAG CTGCGTATTG GCCTAAAGGG AAGATGGCGA CGAACGATAA GGATAGCCGA GTAATGGACG ACGCCGTCGT GAATACTTTG TGGAACGACG GGGTAGAAAT TGCCCAACAA GCGGTGAAAG ACTATTGTCA CAATCAACAG CGACCGCACA CTCTCCGTCA GCCAGAGCTC GAGACGTCCT CCGTCCCGCG TTACGTCGTG GCCTCCTCGG GTTGTTACGG AGCCATTCTG GCCAATGGTG CCGAATACAC GGGAAATTAT GGACCCGTGA CGGTGGATGA TTTGGTACAC TTTCACCGAC GCAAGGTGCG GCGTGCCGTA CAACTGCATC CCGACGGGAT CGCCATTGAA ACCGTACCAA GTTTGCTCGA GTGCCACGCA CTCGTGCAAC TGTTTCAACC AACCAACGGA GCAGCGCCAA TGTTGTTGAA TAAGACTGCG TGCTGGATTT CCTTAAGTTG TCGGAACGAA CGCGAATTGA ACGACGGAAC GCCGTTGGTC GCAGCACTCA ACGTGCTGTC GCAGATTCCC TGTACCGCAG TCTCCGCTTG GGGTCTTAAT TGCTGTTCCG TAACACACCT TCCCGCCTTG GTCCGCATTC TCACGCAACA CGTGGCGCAA GAAGCGAGTG GCAAACACCG ACGCGGCTTG GTACTGTACC CCAACAGTGG CGAGCTGTGG GATGCGGTCA CGGGCACCTG GCACACGGGC ACGGGCTGTA CCGCGCCCGC CGCCATGGCG TCGGAAATCG TGGCTGTCCT ACGAACAGTC GAAGACGAGT GGCGACGCCA TCGTCCCACC GATCCGACGC CGTCCATCCT AATTGGCGGC TGCTGCCGGA CGAGTCCCGC TACCATTGCG GCACTTCGCG TGCTCGTGGA CGACCATTTG CAACAAGAGT ACAAGGGATG GTTTTGA
|
Protein sequence | MVRPSIDYYL PQMPSLLQDW FTSPDSASHT DRILVLDGGV STHLESNLSS TNADSASSDK LSCRTCAFPH RELWSSSLLL SESGRRLVRQ GHDDWLRAGA NVLSTVTYQC HYQAAYWPKG KMATNDKDSR VMDDAVVNTL WNDGVEIAQQ AVKDYCHNQQ RPHTLRQPEL ETSSVPRYVV ASSGCYGAIL ANGAEYTGNY GPVTVDDLVH FHRRKVRRAV QLHPDGIAIE TVPSLLECHA LVQLFQPTNG AAPMLLNKTA CWISLSCRNE RELNDGTPLV AALNVLSQIP CTAVSAWGLN CCSVTHLPAL VRILTQHVAQ EASGKHRRGL VLYPNSGELW DAVTGTWHTG TGCTAPAAMA SEIVAVLRTV EDEWRRHRPT DPTPSILIGG CCRTSPATIA ALRVLVDDHL QQEYKGWF
|
| |