Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47347 |
Symbol | |
ID | 7202507 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 363118 |
End bp | 364617 |
Gene Length | 1500 bp |
Protein Length | 499 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181538 |
Protein GI | 219122409 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAGAC AAAAACTCAC AGACAGAGAA ACCTCTCTAG CCCGGCAACA CGAGAAGCGA ATTGGAACGT CAGGTGCAAA TGAAAGTGGA ATGAATAGTG CCTTGGAAGG GGCCATTATC TCGCCACAGA GCCGGGTGAT CGGCCATTTT ACGTCGTTGC TCTCTAGTGC CTTGAATGTA TCACGGAGCA CCGGCACCGT AAAGCAGCCG CTTCAGTCGG TATCTGAAAA GCTCCCTTTC ATTTTCTCCC GATCGATCGA CGCTGCTGCC CTATGTTGTA GTGTGCCATC GCTGCAACTG AAACAAAAAG ACCCGACCTT TTTCCCGCGT TCTTCAAAAC CGCTGGTAGA AGACGCCGCT GCTTTGAATC TTGGAACACC CTTACGTGTT CACCGATCCG ATTTGAAAAG TTTGCCAATC ACATTATTGT GGAATCTCAG TCGATCTTTC CTATCCTTGG TAGATTCACG ATTGCGGTCT TCACAAACCG CTTTGGTAAG GCAAAGTCGA AGTAGGCATC GGGAAGATGA CGCACATTCC CGCGTCCTCG TTGGTCTTTT GGCTGCGTCT TCTACTCCAA TCAATCCCAC AGCTGTCGTC ACAACTTTTC GTGCTCTGGC TTTCTCCGAG CGTGTCGACG AAGGTGACTA CATTTTACCT ATTGTTATGG AAGCAGTATT TGATCTCGAT GTTTTGGGCC ATTTCATGAC CGTCACGATT GAAGCTCCAG GAACCATTCA AGGAAGCTTT GTCGGCAACA ATCACATAGC AGGTCCGGTA GAGCTGTTAA AAATCGAAGT TCAACTAGAC ACGTCAGCCA TGCTCAAGTC GATGATGACT GAAGCGCGTT CTGTGGTGCG AAAAGCCCTT GTCGTAGCCA CCGAAATAGC CACTAACCTT CTTCACTCGA CCCCGTCGAG AGTCTCATAC CACGATACGA CGGATCTGTT GGTGCTGCAA GGTTCAGGCG AAAGATCTGT GAGAGAGATG CTTCCTAATA GTACCTCTTC AGATACTTCA GCGAACGGAT CTAGCGCAGC TAACAACAGC GACACCTGTT CCGAGCATCC ACAGATGCTT CCGCCTCCTG CGCGGGGAAA GAGTGAAGAG CTGTCTCACA AAAGAAGTAC TGAAACTTCG GACAAAACCA GTGATGCATT CTCCTTGGAG ACAAAGAACG CACCATGGGG CAAGAATAAC ACCTGTGGCG ACAAAGAAGA CATCGACTTT TTGGAAAATG AGAGTTTTGT GTCTGATTTG TCCGGAGTGA AACGGGAATG CCTTGACCAA CAATGCCTGG GCGTTGATGA TGACAGTTTA TCCATAGGAT CTCGAAAGCG GCTTAAGACG TCTCCGTCTG AAAAGAGTCG TACCACGGAA AATGAAGAGC AGCCGTCAAC AGTAGACCTT CCAGCTGAGA GCTCACGCGA TCAGGACGAT CGACCATCTC GTCCTAGCAT CGATGTGAAT GAGCTCTGTG TCCGGATTGC AAAGGTATAG
|
Protein sequence | MRRQKLTDRE TSLARQHEKR IGTSGANESG MNSALEGAII SPQSRVIGHF TSLLSSALNV SRSTGTVKQP LQSVSEKLPF IFSRSIDAAA LCCSVPSLQL KQKDPTFFPR SSKPLVEDAA ALNLGTPLRV HRSDLKSLPI TLLWNLSRSF LSLVDSRLRS SQTALVRQSR SRHREDDAHS RVLVGLLAAS STPINPTAVV TTFRALAFSE RVDEGDYILP IVMEAVFDLD VLGHFMTVTI EAPGTIQGSF VGNNHIAGPV ELLKIEVQLD TSAMLKSMMT EARSVVRKAL VVATEIATNL LHSTPSRVSY HDTTDLLVLQ GSGERSVREM LPNSTSSDTS ANGSSAANNS DTCSEHPQML PPPARGKSEE LSHKRSTETS DKTSDAFSLE TKNAPWGKNN TCGDKEDIDF LENESFVSDL SGVKRECLDQ QCLGVDDDSL SIGSRKRLKT SPSEKSRTTE NEEQPSTVDL PAESSRDQDD RPSRPSIDVN ELCVRIAKV
|
| |