Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_2097 |
Symbol | |
ID | 7201394 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | - |
Start bp | 2515 |
End bp | 4257 |
Gene Length | 1743 bp |
Protein Length | 552 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180550 |
Protein GI | 219119587 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.143499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACAAGTATG TGCCACCGCA TCTGAGAAAC TCTCAGGGCA GTGGTGGTCG CACCAGCGAC TCTGTATCGG ACGGCCGTGG CGGAGACCGT CGCGATTCCT ACTCGGACCG TCGTGGAGAT AGTGGCGGCC GTGGCGGTTA TGGAGGAGAC CGTCGCGGCT CCTACGGGGA CCGGCGCGGA GAACAAGGAC GTGGAGGGGA TGGTCCACCT CCTACTGGAA ACTCCCGTTG GTCGGAAGGC GGTGGAGGCG GCCGCGGCTC GTCTTCGTAC GGAGGAGGCC GCGGACAATC TCGTCGTAAC GCTCGTGGCT TTCACGGTGA CCTCAAGGAA GACCCGCGCA CACAAGCACG TCTCTTTGGT CGCGACGACC ACCAAACGAC AGGAATCAAC TTTGACAATT ATGACAAGAT TCCCATTGAA GTGTCGGGAG ACGATGTTCC CGATCCTATC GAAACCTACT CCCCCGAAAC TATCGGAGAC GATCTCTTTC GAAACACTCA GCTATGCGGC TACTCACGTC CTACTCCAGT CCAAAAGTAC AGTGTTCCTA TCTGCACTCA GGGACGCGAT CTCATGGCCT GCGCGCAGAC GGGTTCTGGA AAGACGGCAG GTTTCCTCTT TCCCATTATC ATGTCCATGA TAAAGCGAGG TGGAAGCGAC CCACCCGAGA ATGCTCGCCG TCGTATATAC CCCGAGGCGC TGGTATTGGC TCCTACACGC GAGTTGGCTC AGCAGATTCA CGAAGAGGCC AAGCGTTTTA CCTACGCTAC AGGCATTGCT TCGGTAGTGA TTTATGGAGG AGCAAACGTG GGCGACCAAC TGCGTGAAAT GGAGCGCGGC TGTGACTTAC TGGTCGCCAC CCCGGGTCGT CTGGTCGATC TGATTGAACG GGGACGTCTC GGCATGGAAA GCGTCTCGTT TCTTGTTCTG GATGAGGCCG ATCGCATGTT GGATATGGGT TTCGAGCCTC AAATTCGTAG GATCGTGGAA GAATCGGGCA TGCCCGGTGG TATTGATCGC CAGACAATGA TGTTTAGTGC CACCTTTCCC GCCAATATTC AGCGTTTGGC AAGCGATTTC ATGCGTGACT ACGTTTTTTT GACGGTTGGA CGCGTGGGCT CCGCCTCCAA GGATGTCACC CAAACTGTAG AGTTTGTGGA GGAACGCGAT AAGGTTGACG CCTTGATGAA GTTTCTTTTG ACCATTCAAG ATGGCCTCAT CCTAATTTTT GTTGAAACGA AGCGCTCGTG CGACTACGTT GAAGACGTTC TCTGCGGCCA AGGATTTCCT GCCTGCTCGA TCCACGGCGA TAAGTCACAG CGCGAACGGG AAGACGCACT TCGCTATTTT AAGAACGGAA ATACGCCAAT TCTTTGCGCA ACTTCTGTAG CCGCCCGAGG ATTAGATATT CCGAACGTTA CCCAGGTTGT CAACTACGAC CTTCCGTCCA ACATTGATGA CTATGTGCAT CGCATTGGAC GTACAGGTCG CGCAGGAAAC ACTGGGGCAG CGCTGTCTTT TATCAACGAG AGTAATTCGG GTGTTGTCCG CGAGCTGCGC GATCTTCTCG ACGAGAATGA GCAGGATGTT CCCCCTTGGC TCAATCAAAT GTGCCAGTTT AGTGGCGGCC GTAGTAGCGG CGGAGGTGGT CGAGGAGGAG GCGGCCGTCG TGGCGGCGGT GGCGGAGGTT TTGGCAGTCG TGATGTACGC AGCAAAGGTG GCAATGATCG CGGACAAGGC GGC
|
Protein sequence | NKYVPPHLRN SQGSGGRTSD SVSDGRGGDR RDSYSDRRGD SGGRGGYGGD RRGSYGDRRG EQGRRGQSRR NARGFHGDLK EDPRTQARLF GRDDHQTTGI NFDNYDKIPI EVSGDDVPDP IETYSPETIG DDLFRNTQLC GYSRPTPVQK YSVPICTQGR DLMACAQTGS GKTAGFLFPI IMSMIKRGGS DPPENARRRI YPEALVLAPT RELAQQIHEE AKRFTYATGI ASVVIYGGAN VGDQLREMER GCDLLVATPG RLVDLIERGR LGMESVSFLV LDEADRMLDM GFEPQIRRIV EESGMPGGID RQTMMFSATF PANIQRLASD FMRDYVFLTV GRVGSASKDV TQTVEFVEER DKVDALMKFL LTIQDGLILI FVETKRSCDY VEDVLCGQGF PACSIHGDKS QREREDALRY FKNGNTPILC ATSVAARGLD IPNVTQVVNY DLPSNIDDYV HRIGRTGRAG NTGAALSFIN ESNSGVVREL RDLLDENEQD VPPWLNQMCQ FSGGRSSGGG GRGGGGRRGG GGGGFGSRDV RSKGGNDRGQ GG
|
| |