Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42495 |
Symbol | |
ID | 7196679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 214447 |
End bp | 218323 |
Gene Length | 3877 bp |
Protein Length | 1233 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177043 |
Protein GI | 219110583 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAATTATATA TACCTTACCG AACAGAAACA TATACCTTCC CGAAAACAAA CCTACCGTTG TCCTCCTACT CTACCATCGC ACCCACTCTG CGCCCAAAAG AGAACATTTT ACAAACCGTT GACATTGCCA ACAGCACGCA CTCTTGCGTC CGGAACGAAC GTACGTGTTG TTACAATGGT TCGGACCCGT TCCAATTCGA AATCAACACC CCCATCGCAA CCCAACGAAC CCCATGACGA CAACGACGAT GGTTCCTCTC GAAACGAATC GGACGATGAT ATTCCCAAAC CCGTCAGTTC CGACAGCGAA AGTGACGACA CGCAGCCTCC GACAACACTT CCGGGAAACA CGTCTGTGCT TCCGGGCACC GGCACGTCCC GTCTCTTTAG TCCCTACCGC AGTCTCGGCG TCGTCTCCAC CGGGGCCCCC TTTCACTTGA TACCGCACGA TCACTCCGGC AACGCCATGG TCTGTGTACC CATCGGTGAT CGTTTCCAAT TACTCCGCAC GGACCGACTC CATCCCGTCC TCGTCAGTCA AGCCGTGCCG CACGATGTAC AGCACGTCGT CACCGACGCC ACCCTCAGTA TTACCGTCGC CGCGCACGGC GACCGCCACG TCACGCTCCT ACACCGCACC CGACCCCTCG CGACCCGCGC GTTGGCCGCT AGTACACGCT GGAGGACGGT GCAGCTCTTG CCCCTGGGAC GGACACCGGT ACCGATGCGT GGCGAAAAAC AAGGTACCAT GGAGAACGCC GCCATTGTCG CCGCGATTCT GCAACGCGCA CCGACCGTAC GGGATGATGT TCCCCTCGTT GGACAAGACG ATGATGACGA TGACAACGAT GACAGTGAAT CGCTGGACAG TGACGACAAT GACGACGAAA CGACGTGCCT GGGGCAAGTG GTTGTTTTGC TGGCCACCCG AGACTCGATT GCGATTCAGA GACGGATTCG CTTGACCGCT TTGCCCAACT TTCGACCCAC GACCGCGTTG CATCCCTCCA CATATCTCAA CAAGATCGTT CTGGGCGGAA CCAATCAACC CAACCAGGAT ACTCCCTCTT CTCCCGCCAT GTGCTTGTTG AACGTGCGAT CGGCCCGAAT CATTCACGTA TTTGCTGGTC TTCCCTCGGA AACCGAAGAA TCGTCCGTCA CTACCTTGGA ACAGTCTCCC GCCGTCGATA CGATTGCCGT CGGAACCTCG TCAGGGAACG TTCACTTGAT CAATTTGCGT CACGATCAGA AACTGTTTAC GCTCCGTCAC AAAACCCACG ACGGAAATAA CGTCGGCATT TCTTCCATCT CCTTTCGGAC GGATCATTCC GCCTTGCAGT ACGATATCGC TCCAATGGCA GTGGGACGAG TGGATGGACA CATTACCATC TGGGACTTGA CGGCACCTAC CGAGATAGAG TCGGGTCGCA CCGTGCTGCA CGAAATGGAA CACGTGCACG TTGGCGGTGT TGCCAAAATC CAATACCTTC CCCAGGAACC CTTGCTGGTC TCCATTGGAC TCGCGTCCAA TCGTATCGCG ATGCATATTT TCGATAGTCC CGACCATTCC GCTCGCTTGT TGCGCTCCCG CCAAGGCCAT ACCGGTCCAC CCACACGGAT ACGTTACCTA CATCCCGGAT CAGGCGCCGG AGGCGGCGTC CTCGTGAACG CGTCTGACGG AACGGACGCG AGTGCCTGTC AAATTCTTTC CTCTGGTGGA CCCGATCGGA CGTTGCGCGT CTTTTCTACC GCCCGTACCG TGCTCGACAA AGAATACAGT CAGGGGGCAG GGCTGGAAAA GAAAGCCCGC AAATTCGGTA TGGACACGGT CGCGGAACTG CTCCTACCAC CCACCATTGG CTTGGCCACG GCGGAATCGC GCGCTCGTGA TTGGGGTGAC TTGGTGACAA TTCACCGGGA TCACGCTTTT GCCTACGTGT GGAGCACTAA GCGGGGCGCC CAGTCTGGGC CAATCTTGCG GCAGTCGACG TGGAACGTAA GTGCCATGAA GATTCCACCA CCGCCGCACA CGCACGCCAC GGCGGTCGCC ATGTCTGCCT GTGGAAACTT CGCTTTGGTG GGGACACGAG GTGGAACGAT CTACAAGTAT AACGTCCAGA GTGGGAATGC TCGGGGAATG TATCCGATTC AAGCAAAGGA AGATAGCAAG CCGAGAAGAC AGGTTGTAGC GGGAGATCTC GGGCGTACCA TGAAATCGTT GGAAAATAGC ATGAAAGTCA GCAATCGGGC TGCCAATGTG GACAAGAAAG AACTCGATGC CGAGCAAGAA GCGAAACGGG AAAGTCGTAT TTTGGCAAAG TTGCAAGCAG CATCCCACAC GGGTCATTCC GTAACAGGAT TAGCGGTAGA TTCGGTAAAC AAGGTCCTCA TATCGGTAGG GGCGGATGCA AAACTTATAT TGTGGAACTT TGCATCGCAT GCTCCGCACA AGAAAAGCCC GTACACTTTA CCATGTCCCG CCACTCGAAT GTGCCACGTG CGTGATTCAG ACTTGGCAGC CATTGCTTTG GAGGACTACT CTGTTGTGTT GTTCGATTGT GCAGCCCTGT CGATTGTTCG TCGTTTTGGA GCTACTGGTG GTCACGTTGG ACCAATTAGT GACCTAGGAT TTAGTCCAGA CGGCCGCAGT CTCTTTACTG CATCGCTGGA TTCTTCTTTG CGGGTATGGG ATGTACCTAC CAACACGTGC GTCGACTGGC TCAGCTTCTC GACGGCTCCG ACGTCTTTGA CGATCAGTCC AACGGGTGAG TTTCTAGCGA CAACACATAA GGGAAAACTC GGTATCAGTG TTTGGAGCGA CCGAACCTAC TATCAAACCG TAAACATTGA CGGCACACCA CTAAAGGAAC CTGCGCGCAT GGACGAACCG GTGCCCATGG CCGATGATGC TCCTTCACAA ACATACACAG CAAGTAAACC GAGCGAGGAA AGAGTGCCTA CCGGTAACGA AAGTGAAGTG GACGGGGTTG ACGATAAAGG CCCCGCCCTA CCCAAAGAAG CTGGACTCAT CACGCTCTCT GGACTGCCAC CTGCCCACTG GAAGAATCTG TTTCACCTGG AACTCGTTAA GGAGCGCAAC AAGCCCAAGG AAGCCCCCCA AAAGCCTCCT TCGGCACCGT TTTTCCTCCA ATGGCGTTCG GGTGAGTCGA TCAGCGAAAC TGTTGCCAAT CCTCAGTTGG CTTCCAAATC CAAACAAAAC GTAAGTGAGG AAGAGGAGTG GGTTGCGGCT TGGACTGATA ATGATGATGA CGAAGCAAAA GTCGATGTGC CTGAAGCAAG TGGACTAGTC AAACGAGACC ACGAGAAAGT TGAAAAGGAA GCTGCGTCCT CCAAAAGACG AAAAGTTACA CGCTATCGTT CAGCACTGGC GTCTCTTCTG GAACAATGTA ACAACAGGGC TTCTGGATCG AATCAGAAAC GCTTCCAGTT GGTTACCGAC CATATAGGGA AGCTTGGACC ATCCGCGATT GACGTCGAAC TTTCCACTCT TTGCAGTGGA TTGCATGATT TGGAAGAGGG CCTTCCATTG TTACAGCTCA CCTGTTACTG GTTGTTGGAA GCTTTACAGT CACGCGAACG GTACGATGCC GTTAATGCGT ACTTACATCG CTTTCTGCAC CTGCATGCAT CGGTTATTGT TGGTATCGAC GAATTTTACC GTGGGGATGA TCAACCGCTA CGCGTGAAAC ATTCCGAGCA AGAACGAATT GAGTTGGAGA CACAACGTGA CCAACGGATC CGCCTACTTG AGTCCATCAC AGAGCTACAC GATGCGCAAA AATCGGCTTC AGAAGCGCTC CAGAACAAGA TGCAAAATAC GCTCTGCCTC TTGCGGCACT TTTCTCGGAT GATCTAG
|
Protein sequence | MVRTRSNSKS TPPSQPNEPH DDNDDGSSRN ESDDDIPKPV SSDSESDDTQ PPTTLPGNTS VLPGTGTSRL FSPYRSLGVV STGAPFHLIP HDHSGNAMVC VPIGDRFQLL RTDRLHPVLV SQAVPHDVQH VVTDATLSIT VAAHGDRHVT LLHRTRPLAT RALAASTRWR TVQLLPLGRT PVPMRGEKQG TMENAAIVAA ILQRAPTVRD DVPLVGQDDD DDDNDDSESL DSDDNDDETT CLGQVVVLLA TRDSIAIQRR IRLTALPNFR PTTALHPSTY LNKIVLGGTN QPNQDTPSSP AMCLLNVRSA RIIHVFAGLP SETEESSVTT LEQSPAVDTI AVGTSSGNVH LINLRHDQKL FTLRHKTHDG NNVGISSISF RTDHSALQYD IAPMAVGRVD GHITIWDLTA PTEIESGRTV LHEMEHVHVG GVAKIQYLPQ EPLLVSIGLA SNRIAMHIFD SPDHSARLLR SRQGHTGPPT RIRYLHPGSG AGGGVLVNAS DGTDASACQI LSSGGPDRTL RVFSTARTVL DKEYSQGAGL EKKARKFGMD TVAELLLPPT IGLATAESRA RDWGDLVTIH RDHAFAYVWS TKRGAQSGPI LRQSTWNVSA MKIPPPPHTH ATAVAMSACG NFALVGTRGG TIYKYNVQSG NARGMYPIQA KEDSKPRRQV VAGDLGRTMK SLENSMKVSN RAANVDKKEL DAEQEAKRES RILAKLQAAS HTGHSVTGLA VDSVNKVLIS VGADAKLILW NFASHAPHKK SPYTLPCPAT RMCHVRDSDL AAIALEDYSV VLFDCAALSI VRRFGATGGH VGPISDLGFS PDGRSLFTAS LDSSLRVWDV PTNTCVDWLS FSTAPTSLTI SPTGEFLATT HKGKLGISVW SDRTYYQTVN IDGTPLKEPA RMDEPVPMAD DAPSQTYTAS KPSEERVPTG NESEVDGVDD KGPALPKEAG LITLSGLPPA HWKNLFHLEL VKERNKPKEA PQKPPSAPFF LQWRSGESIS ETVANPQLAS KSKQNVSEEE EWVAAWTDND DDEAKVDVPE ASGLVKRDHE KVEKEAASSK RRKVTRYRSA LASLLEQCNN RASGSNQKRF QLVTDHIGKL GPSAIDVELS TLCSGLHDLE EGLPLLQLTC YWLLEALQSR ERYDAVNAYL HRFLHLHASV IVGIDEFYRG DDQPLRVKHS EQERIELETQ RDQRIRLLES ITELHDAQKS ASEALQNKMQ NTLCLLRHFS RMI
|
| |