Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43235 |
Symbol | |
ID | 7196956 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2404585 |
End bp | 2406700 |
Gene Length | 2116 bp |
Protein Length | 618 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176973 |
Protein GI | 219110443 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAGGTTTTTA CCTGTGTCGC ACAAGAAATG GCAATATGCG TCGAATCCTG AAACTACCAA CGCCCAAACA GGCGGCAATC TTCTTTCTCT TGGTCGCGGC GTTCCTTGAC CCAGCGATCT ACGCGGCGCA AGGTTTGACC GCGGACTCGG CGATCCTGTT AACCTCGTCC ACCTCGTCCT TGCGTTTCCA CTCCTCCCTA CCGCCCCGGC AGAAACGCAC GGATCGACCG GCACGCGTCC ACCGATCGTA CGGCGATCGC TTTCGCGATG ACGATGATGT CCAAGGCGAT AGATTGGTCC CCAACGACGA AATTTCTCGC CGATTGAGTA CAGATCGACC CACGCTCTCT CGCACAGAAG TTGTGGGACC GCCCATTGTA CCGAGCAAAC CCAAAATTGT TGTACTGGGG GCATCCGGAA AAGTGGGAAG ACTCGTCGTC CAACAACTAC TCGAATCTAA TGTTGACATG ACTGTGGTGG CCTTTGTAAG AGACTACGAT AAGGTACGTC TGATGCCTGC GAAAGTGTGT GAACGAGGCC CGACCCCGAC GGTTCGCGTT GTTCGAGTGG AGTACCTACG GATGTCTTAC CGCATTTTAT TGACCTGGAT TCGAACTATG TTTTCTACTG TTACTCTAGG CATGTCGTGT ATTGTATGAT GATATCCTGG TAGCCAAAAG AAGTAAAACC CGTCCGGGAG GAAAGGGGGA ATCACGGGGT CCCAACCTTC AAATCGTGGA AGCCGATTTG GTACCTCCGG AGGAACTTCC AGGGTACGAG GATGAAGAGG AAACATCTTG GAGAAAACGG GCAAAGTCGG CCGCCTCGTT CTACAGAAAT TCAATGGAGA ACTATGACGA TCGCGATCAA ATTTCGGAGG GTGTGCAAGT GAACGAAGCC TTGCAAGAAG CGGTGAAGGG ATGCACTGCT ATCATAAGTT GTGTTGGATC GGTGCGTCCA ACGAATCTGT GGACAGATAT TTTGGAACGA CCCATACTAC GCCTACTGCG CAAGGATGTC AGTCGTTGGT GCAAGGATGC CCGACACCCG TACTACGTCA ATTTCGCGAG CACGAGAAAG GCCCTAAACT ACGCGGAGCG GGAACAGCTA CGCAGGGAAG CCGCCTTGAC CGTTAGTGAT AGCGATGACA CCCAAGCCTC TACAAAAGAA AAGGAGTCGG TACCACGCAT TCGATTCATT CGGATTTCCG ACTTATGCTT AGCGTCACAG CCTTGGAGTT TTGTCCCGCT AGTGACCAAC ACGATGCATT CAATGGTCTT TCGGTATCAA GACATGGCGG AGCGCTTATT GGAGGCCAGC TCATTGATAG AGACAGTTGT GCTACGACCT GGAGATTTGG TAGACGAAGA TCGCGACGTC CAGACAACTT CCTTACAAGT CTGTCCCAGT GGTCGACTCC CATCACCATC GAGAGTGGGC CGAGATGACG TTGCTGCTTT GGCGGTGTCT GCAGCATTGT TCGATTCCAA AACCTCTTCC AAAGACGACA AAAGTTCTGA AGCAGGGGAG CATGAAATCG AGGAGCCCTT CCATTACACG TTGGCGGTGC GGTGGGCTGG CGAAGATCTT TCGCCTTTCC CGGCTCAGGG TCGATTGAAA GACGGAATGC CTGATGCCAA CTTGTGCCTA CAAGATGCGC TTCGGAAGCT GAGCAAAGAC CGAGCGAAAC CGAGTCTGCA GAGAACCAAA CGGCGTGCAG CTTACCCAGA AACCGTTTTG AGATTCGCGA CAACGTTGCA AGCTTCACGC CGGAGCAAAC CCTATGGCGT CTGTGTAGCC GTACCACTCT ATTTGATTTT GGCACTCTTT GCGCGTTCCC TATTTCACTC TGTGGCATCC TTCTTTCCGG GACAGCGATG GATACGGCCT GTATTTACTC CAGTGTGGGA TTTCATCGCG ATGAGCATTG CGGCGGTGAT GGCTCGCGTT GGTGTGCTGT TCCAGGGCCG ACTGCCAAGC TGGAAATGGG TCAGCTGGCG GGCCAACCCC AAATACATTT CGTTTTGAAA ATACATCAAT TTTTGCTACA TTTCATCCTA TCTCTATGGA ATTTCTGGCG TGCTAATCTA TTCGTAACGG AATTGTAGTG GTTGGC
|
Protein sequence | MRRILKLPTP KQAAIFFLLV AAFLDPAIYA AQGLTADSAI LLTSSTSSLR FHSSLPPRQK RTDRPARVHR SYGDRFRDDD DVQGDRLVPN DEISRRLSTD RPTLSRTEVV GPPIVPSKPK IVVLGASGKV GRLVVQQLLE SNVDMTVVAF VRDYDKACRV LYDDILVAKR SKTRPGGKGE SRGPNLQIVE ADLVPPEELP GYEDEEETSW RKRAKSAASF YRNSMENYDD RDQISEGVQV NEALQEAVKG CTAIISCVGS VRPTNLWTDI LERPILRLLR KDVSRWCKDA RHPYYVNFAS TRKALNYAER EQLRREAALT VSDSDDTQAS TKEKESVPRI RFIRISDLCL ASQPWSFVPL VTNTMHSMVF RYQDMAERLL EASSLIETVV LRPGDLVDED RDVQTTSLQV CPSGRLPSPS RVGRDDVAAL AVSAALFDSK TSSKDDKSSE AGEHEIEEPF HYTLAVRWAG EDLSPFPAQG RLKDGMPDAN LCLQDALRKL SKDRAKPSLQ RTKRRAAYPE TVLRFATTLQ ASRRSKPYGV CVAVPLYLIL ALFARSLFHS VASFFPGQRW IRPVFTPVWD FIAMSIAAVM ARVGVLFQGR LPSWKWVSWR ANPKYISF
|
| |