Gene PHATRDRAFT_47105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47105 
Symbol 
ID7202018 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp423920 
End bp427165 
Gene Length3246 bp 
Protein Length1038 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181206 
Protein GI219121714 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00493639 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACCA CCAGAAGCAA TACTTTTAGT GATGCTACTG TGGAGTATGT TGTTGGAACT 
GTCCTGGATG CTAACTCTGA GTCTCCTTAC AGGCTTGTCC TGAAGGAAGC TGGTATTGAA
TCTATGATGG ACATTCTTGA GTTGACTTTG GATGACCTAC TGTTTCTGCA GTGGACTTCA
GGAGAGGAGA CCCCCAAGAA GTTGACCCTT GTCCAATCCA AGCGTATCAT GCACCTCATT
GCCTGGCATA GAACTCAGGA TGACCCAAGC AATGTGGATT GGTTTTCATT GACTCCTACA
GTACTCAGAC AGTTTAGAGA AGGTGAAATC TATTCCAAGC CCCAATGTGC AGATGAGATG
TCTGAAAATT CCTACACTGT TCCCCATGCC CAGCCAGCAT TGCCTAATGC TGCTTATGAC
TTTGACAAAG GTACTGAGAG AAGTATTGCT GACTATCCTG TTCTGAAAGA AGCCAAACAA
TGGTCCTCTT GGAATCGCCA GACCAGAGCT CTTGCCCTTT CCCATGGCCT ACTGAATGTA
TTTGACCCTG CCTATGTTCC CCTCAACCCT GATGAAGCCT CTCTGTTTGC TACTCAGCAA
AGATTTGTCT TCAGTGTGTT TACCATGGCC ATCAAGGAAA CCAAGGGAAT GATTATCATA
CGCCAGCACT CTGATGAAAA GAATGCTATA TCTCTGTTTG GCAATGCTCA GCTTGTCTAC
ACTGCCTTGA TGGCTGCCTA TGAGGGTGGT GTTGTGGCTA CACTTTCTGC TCAGTATCAT
GAAACCCTTC TCTTGAATTA CAACTTGAAC AATTCCTGGA CCAAACCTCT TGTGACCTGG
TTTTCTTCTT TTGAGCACAA ACTGCTGGAT TTGGACAATG TGCTTCCTAC TCCCAAGGAT
GATGCTTGGA AGCGCAACAG ACTTGAAATG GCTGTGCTTC ATCATACTCA GCTTCAGACC
TTTCATTCTA CACTTTCTAC CCAGGCACTT GTGATGGGCA AGCAACATGA CTTCCTCTTT
GAACTTGAAG CCTGCAAGAC TCAAGCTGCC AAATTGGATG CGCAGGCTGG CATGAAAGTT
AAGACACAGA GGCAGACCAA CAACCATGAG CGTGGTGGGA CCAAAGGTAC TGGCCATGGC
AATGCTTCCA ACAGGAAGGG CAATGCTGGC CGTGGTGCAT CCAAGGGTAA AGCCAAGGGA
GGCAAGTATT CTAACTATAT TGACCCTGCA AAATGGAATG CTATGACCTC AGAAGAAAAG
CAAGCTGTCT ACGATGCATG CTCTACACCC AGTGCAAGCA ACACTCCCAA TGCAAACCCA
GTCCCTTCCT CTGTGCTCAT CAACCAAGGT ACTGTGCAAC CAAGTTCTAC TTCTGATGCT
GTTCCAACTG GACAACACAT TGTCCCCAGT AGTGGTGCCT CTGCTCTCTC TGGTACCACC
AACCAGTCTT TCATCAGGCA ACTGCTTTCT AATGCTACTG CCAGAACTCC CACTGTACCT
ACTTCTTCCC ATGATGGGGA GATTGTCATT GATGGGCTCA GGTTCAGACA TGTTAATATG
CACAAGGTCA GCTACTGTGT TAGCAACTAT GACTTGGCCC TTCAGACCCA CCAGGGTTCA
CTCATTGATG GTGGGGCTAA TGGTGGCATG TCTGGTGCTG ATGTCAGAGT CTTGGAAAAG
GGATTTGCCA CTGCTGATGT TACTGGTATT GGCAATCATG CTGTTTCCAA CTTACCTATT
TGTCAGGTTG CAGGTGCTAT CATGACTACA AATGGATTGA TCATTGGCAT CTTCAGTCAG
TATGCACATT TTGGAAAAGG AAAAACTATT CATTCTAAGC CTCAGATGGA ACAGTTTGGA
CTTACCATTG ATGACAGATC CAGACTTAGT GGAGGGCAAC AAAGAATGGT AACCCCTTGT
GGACACATCA TCCCTTTGCA CATTTGCAAT GGTCTTTGCT ACATGGATAT GCACCCTCCC
AGTGATACTG AAATGGATGC CCACCCCCAT GTCTTCTTTA CTGCTGACAT GCCTTGGGAT
CCCTCCATTT TGGACAATGA ATACACAGAG CATGAGTTCT CTGACTGTCT TGCACCTGAA
GACTTCACTC CTTTGGATCA TCGTGTGAAC CAATTTGGAA CTACCACTGA TTCTGATTTC
TACCTTGACA CTTGTATTCA TGCTGTCCAT AACATGCAAC TTGTGCATTC CCAACATGTC
TCAACTCAAG TGCCTGACTT GCAAGCACTA CGCCCTAATT TTGGATGGAT TCCTGTTGAG
AGATTGAAGA ACACCTTGGC TAATACAACT CAGTATTACA GGGCTTCCAT CTCTTACCCA
TTCAGAAAGC ATTACAAGTC AGTTTCCCTG CTGCTAATGT TCACAGATTG AATGAATGGT
TTGCTACTGA CACCTTCTTT AGCAATGTAC CTGCTCATGA TGATGGTTAC ATGCACCATG
GTGGTGCTAC TATGTTGCAA GTCTATGCTG GCAAGGACTC TGGATACTTA GCTGGGTATC
CCATGAAGAT GGAAGGTCAA ATGCCCCAGA CTCTAGAAGA CTTTATCCGT GATAAAGGTG
CTCCTTTGGG CTTGTTCAGT GACAATGCTA AGGCCCAGAC CTCCAAGGCT GTTGAGACCA
TTCAGCGCCT CTACCATATT GCAGATGCTC AATCTGAGCC TCACTATCAA CATCAAAACT
TTGCTGAGCG CTGTATCCAA AACATCAAAT GTATGATCAA TACTATTATG GATCGCACTG
GTACCCCTGC CAAGTACTGG CTCCTTTGCA CTCTGTTTGT CATTGACCTA TCCAATCACC
TTGTGAGTGA TACACTTCAA GCAACTCCTT TGACCCGATG CTTTGGTATT CCCACTGATG
TTTCTGCTTA CCTCACTTAC CATTGGTGGC AATTGGTTTA TTTTGAGAAC CATGATGGCT
CTTTTCCCTC TACTCCTAAG GAAGGCCTTG CTCATTGGGT TGGTCCTACT AATATGAAGG
GGGATGTATT GACTTATCAG TTGCTGACTG TGGATACTCA GCAGCTGCTC TTTCGCTCCA
ACATTTGTCC TGCTACCACT GACCCCATGG TCCCTAATGC CAGAGTTGAT GCCTCTGCTG
CTCCACATCT TCACCTGGAG GCAGGGGAGG AAAAGGACCA GTCAGACAAC ATCAAGTCTA
TCTCTGCTTT CCAAAAGATT GATCCTTCTT ATGTAAAACT GCCTCTCTTC TCTCCAGATG
AGTTAG
 
Protein sequence
MTTTRSNTFS DATVEYVVGT VLDANSESPY RLVLKEAGIE SMMDILELTL DDLLFLQWTS 
GEETPKKLTL VQSKRIMHLI AWHRTQDDPS NVDWFSLTPT VLRQFREGEI YSKPQCADEM
SENSYTVPHA QPALPNAAYD FDKGTERSIA DYPVLKEAKQ WSSWNRQTRA LALSHGLLNV
FDPAYVPLNP DEASLFATQQ RFVFSVFTMA IKETKGMIII RQHSDEKNAI SLFGNAQLVY
TALMAAYEGG VVATLSAQYH ETLLLNYNLN NSWTKPLVTW FSSFEHKLLD LDNVLPTPKD
DAWKRNRLEM AVLHHTQLQT FHSTLSTQAL VMGKQHDFLF ELEACKTQAA KLDAQAGMKV
KTQRQTNNHE RGGTKGTGHG NASNRKGNAG RGASKGKAKG GKYSNYIDPA KWNAMTSEEK
QAVYDACSTP SASNTPNANP VPSSVLINQG TVQPSSTSDA VPTGQHIVPS SGASALSGTT
NQSFIRQLLS NATARTPTVP TSSHDGEIVI DGLRFRHVNM HKTHQGSLID GGANGGMSGA
DVRVLEKGFA TADVTGIGNH AVSNLPICQV AGAIMTTNGL IIGIFSQYAH FGKGKTIHSK
PQMEQFGLTI DDRSRLSGGQ QRMVTPCGHI IPLHICNGLC YMDMHPPSDT EMDAHPHVFF
TADMPWDPSI LDNEYTEHEF SDCLAPEDFT PLDHRVNQFG TTTDSDFYLD TCIHAVHNMQ
LVHSQHVSTQ VPDLQALRPN FGWIPVERLK NTLANTTQYY RASISYPFRK HYNNVPAHDD
GYMHHGGATM LQVYAGKDSG YLAGYPMKME GQMPQTLEDF IRDKGAPLGL FSDNAKAQTS
KAVETIQRLY HIADAQSEPH YQHQNFAERC IQNIKCMINT IMDRTGTPAK YWLLCTLFVI
DLSNHLVSDT LQATPLTRCF GIPTDVSAYL TYHWWQLVYF ENHDGSFPST PKEGLAHWVG
PTNMKGDVLT YQLLTVDTQQ LLFRSNICPA TTDPMVPNAR VDASAAPHLH LEAGEEKDQS
DNIKSISAFQ KIDPSYMS