Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42721 |
Symbol | |
ID | 7196118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 892321 |
End bp | 894669 |
Gene Length | 2349 bp |
Protein Length | 773 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176677 |
Protein GI | 219109848 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACAGTCAG TGGAATATCG AGCACCAATG GCCGCCGCCA ACGTCAAAGA GGTATGGAAA CCTGTGCGAA AATCCGTCCT GTACCTCATT GCGGCAGTCG CGCTTGCAAT CTCGGCATCG ACGGTTGGGG GTAACTCATT GCCGGCAGGT AGTAGAACGC AGGTCGAAGC CGTGGACGAT GTCGGTACAA GTGCGCAAGA TAATAGTCAC TCCCGCCGCT TGCTAGGTTA TCAACGAGTC ACGGTTTACA CAACAGCTCC GGAAGCCGAT CAGGACAATC CGATTGCACG CAAACAGTTC GCCGACTTGA TCGCCTCCCT GCCCACGGAT GTGACAACCC TCGATGTTGT TGCCTTAAAC TTAATGGAAC AACGCCATTT TCTGGAGCAG AATTGTTACG GGAAAGAGGA GGACAACATG CCCGTAAAAA TGAGCGCTCT GAAGCGTTTC GACGAACTGC GTACGCGTGG ACAAGATCAT CTCGCAACCG AAGTCTACAA GTGGTGCGCG TTGAAAACGA AACCCTACCT CTCTGACTCG GTGGCGTACA TTGATTCATC CAGTCCATTG CTGATGCGAC TGCAAGACTT TTTGTCGGAC GTACAAAATG TGGTAGTCTT AGGAGACGAT TATTTCCCCC ACACTGCTCA TGGCAGTCTG ATTGTGTTGC GCGATAATCA ACTCTACGTA GCGGAACAGA TGATGAAAGT TCTTCTGGAA ACTCCGGCGG AGCACTTGGA AGTAAACCCT CTTCTTTTGC CGCAGCGCTT GTACGAAGCG GTGGCGTTCG CTTCCGCCAA AAAGCATCCT AAACCGGGCA AGGTGGGCGA TTTCCTACTG TTGAAAGAGA CATGCCGAAT GGATCCCTTG AGGCGCCACA ATGATGATGC AAAAGGCTGG ACGGATACAC AAACTCGTCG CCACGTACAC CACTGCCCCG AACGTGCAGG CTTTTGCTGC TCAATTCACG ACATATCTCG TAACCGTGTC GTCCTCATGA GTCGTCATCC TCTGTTGCCA TTTCAAATTC TGTCCCAATC ATTTTCTCGG CCTTACAACG CCGAAGCGGA TCACTATGAA GAAGACGAGC TGCCCTACAT CACCACAATC CAGGCTACTT TTCACGAGCG GCCTGCGGAT ATGCCCGAGA CACCTAACTT TTACGCCTTG CTTGCCGCGA AAGATTGCTT GCCCGACAAA GAAGAATGCT CTAAGTGCTG CCGAAACCAG GCCGGTGCTA CAAGGGAACT ATGTGCCAAA GACTGCCCTT GCTATGCGAA GGCTCTCTGT TCGGATAAAC CTCCACCAAA GCATGTTGCT CATACGTGGA CTGTAACCCC ACCGGCCTAC ACGCGCGATC CCAACCGTGT TGTTCCCCGT ATCGTACACC AAACTTGGTT CGAAGACCTA GCCCAGGACA GGTACCCAAA TATGAGCCGC ATGGTCCAAT CGTTCCGTAA CTCTGGCTGG GAGTACAAAT TCTACAACGA CGATGACGCA GTTAATTTTT TGAGTACGCA CTTTCCGCCG GAAGTGCGAG AAGCATACGA AGCTCTTCGC CCTGGTGCCT TTAAGGCCGA TCTCTTTCGG TATTGTGTGC TGTTGATACA CGGTGGTCTC TATGCTGACG TTGACATCAT GTTGGAATCG GCTTTGGATG CGGCCATTGG ACCGGATGTA GGATTTATGG TTCCGACAGA CGAGCCCGGC ATGGCCACAA ATCACCGGAT GTGTCTATGG AATGGAATGA TCGCCGCCGC TCCAGGTCAT CCGTATCTGG CGAAAGCCAT CGAAACCGTC GTGAATCAAG TCCGAAACCG ATTTACATCT GTTGATATTG ATGCCACCCT CTGCCCCAAT CCGGAGCTCT CCATCTCGCA CGCTTATGAT ACTCTCTTCA CTGCAGGACC GTGCTTGCTG GGTGCCTCAA TCAATCGAGT TCTGGGACGG CATCCACAAA AATCCTTTAC AGCTGGGGAA ATTAACATTT TGGCTGATCG CCGACAATTA GAAGCCGGTA CATCGTTTAT TGTCGGAGAC GGTGTTGCAT TGGAGGCACG CGTGCCAGGA AGGAGTGTGA TTTTGAAGCA GGACAAGTGG GATATGGGCG CGCATCGATT TACCTACGTG GAGCGCAACT TGGTTGTCTC GGCCACAGAT CTGCAAGATT CAAATGATAG GGACACTCAC AAAAAGAACA AAAAAACAGA GCATTACAGC ACGACACACG CCAAGACGGG TATTTATGGA TTGGAAGGTC TGTACACAGA CACACGTATT GCAAATGAAG ATATTCGGAT TATACTGGAT GTTTCCAAGC AGGCTAGTGT CCCATCGTCT ACGTCTTAG
|
Protein sequence | MAAANVKEVW KPVRKSVLYL IAAVALAISA STVGGNSLPA GSRTQVEAVD DVGTSAQDNS HSRRLLGYQR VTVYTTAPEA DQDNPIARKQ FADLIASLPT DVTTLDVVAL NLMEQRHFLE QNCYGKEEDN MPVKMSALKR FDELRTRGQD HLATEVYKWC ALKTKPYLSD SVAYIDSSSP LLMRLQDFLS DVQNVVVLGD DYFPHTAHGS LIVLRDNQLY VAEQMMKVLL ETPAEHLEVN PLLLPQRLYE AVAFASAKKH PKPGKVGDFL LLKETCRMDP LRRHNDDAKG WTDTQTRRHV HHCPERAGFC CSIHDISRNR VVLMSRHPLL PFQILSQSFS RPYNAEADHY EEDELPYITT IQATFHERPA DMPETPNFYA LLAAKDCLPD KEECSKCCRN QAGATRELCA KDCPCYAKAL CSDKPPPKHV AHTWTVTPPA YTRDPNRVVP RIVHQTWFED LAQDRYPNMS RMVQSFRNSG WEYKFYNDDD AVNFLSTHFP PEVREAYEAL RPGAFKADLF RYCVLLIHGG LYADVDIMLE SALDAAIGPD VGFMVPTDEP GMATNHRMCL WNGMIAAAPG HPYLAKAIET VVNQVRNRFT SVDIDATLCP NPELSISHAY DTLFTAGPCL LGASINRVLG RHPQKSFTAG EINILADRRQ LEAGTSFIVG DGVALEARVP GRSVILKQDK WDMGAHRFTY VERNLVVSAT DLQDSNDRDT HKKNKKTEHY STTHAKTGIY GLEGLYTDTR IANEDIRIIL DVSKQASVPS STS
|
| |