Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_16499 |
Symbol | |
ID | 7198760 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 349889 |
End bp | 351025 |
Gene Length | 1137 bp |
Protein Length | 378 aa |
Translation table | |
GC content | 58% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184867 |
Protein GI | 219129378 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.570154 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATGACA GTGGACAAAC CTACACGTAT CTCAATTATC CGTTGGAGAA CGGACAAGTC CTCGCCGAAG CCCAACTCCG CTACCAAACC TACGGACAAC TCAACGAAAC ACGGGATAAC GTCATGGTTG TTTGCCACGC CTTGACCGGC AACGCCTCGC TACACGCCTG GTGGGGCGAC ATGCTCGGAC CCGGGAAAGT CTTCGACACG GACAAGTATC TCGTGGTCTG TTGCAACATT CTTGGTAGTT GCTACGGCAG TACGTCACCC GTATCGATCC GCCCCGGAAC GGACCAACCC TACGGACTCG ACTTTCCCGA CGTCAGTGTC AAGGACACGG TACGGTTGCA GCTCTGCATG CTCCGCGACG AACTCAAAGT CGCTTCCGTC CACGCCGTTG TGGGCGGTTC CTTTGGCGGC ATGCAGGCCG TCGAATTCGC CGTCCAGGCC GGATCCACCC GGGCCGCCTT TACCGACGCG CACGGACAAC CCTTTTGCAA ACACGTGGTG CCCATTGCCT GCGGCGCCCA GCATTCGGCC TGGCAAATCG CCATTTCTGA AGTCCAACGC CAGGCTATCT ACCAGGACCC GGCCTGGCCG ACGGATCCTT TCCGCGCCAC GCACGGATTG CGTGTCGCCC GACAGTTGGG TATGATTTCC TACCGTACGC CGCAAGGGTA CGGCAGCAAG TTTGGCCGGG AACGGCAACG TGGTCGGGGC GACGATGACA CGGACGGCCC CGCCTACGGT AGTCACGCGC GTTGGCAAGT TAAATCCTAT TTGGAATATC AAGGAGTCAA GTTCCTCCAA CGCTTCGATC CCGTCACGTA CGTCAAACTC ACGGAACAGA TGGACTCGCA CGACGTGACA CGGCAACCTG CCGGTAGTTG TCCCGGAACG GTCAGTAAGG AACAAGTGCT GGGCCATGTG ACGATTCCCG TGCTCGTACT AGGCATTGAC AGTGACGTGC TGTATCCGTT GGCGGAACAA CAGGAACTGG CCCGACTCTT GCCCAACGCC ACGTTGGAAG TGATTCATTC GGACGACGGA CACGACGGGT TTTTGTTGGA ACAGGAGCAA GTGGCGGCCC ACATTCAACA CTTTTTGACC CTCCACGAAC GACCCACTAC TATCTAA
|
Protein sequence | MDDSGQTYTY LNYPLENGQV LAEAQLRYQT YGQLNETRDN VMVVCHALTG NASLHAWWGD MLGPGKVFDT DKYLVVCCNI LGSCYGSTSP VSIRPGTDQP YGLDFPDVSV KDTVRLQLCM LRDELKVASV HAVVGGSFGG MQAVEFAVQA GSTRAAFTDA HGQPFCKHVV PIACGAQHSA WQIAISEVQR QAIYQDPAWP TDPFRATHGL RVARQLGMIS YRTPQGYGSK FGRERQRGRG DDDTDGPAYG SHARWQVKSY LEYQGVKFLQ RFDPVTYVKL TEQMDSHDVT RQPAGSCPGT VSKEQVLGHV TIPVLVLGID SDVLYPLAEQ QELARLLPNA TLEVIHSDDG HDGFLLEQEQ VAAHIQHFLT LHERPTTI
|
| |