Gene PHATRDRAFT_38642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38642 
Symbol 
ID7203348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp588073 
End bp589836 
Gene Length1764 bp 
Protein Length587 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182565 
Protein GI219124553 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.429359 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGGTG GCAAGGTCGT CACCATTCAA GTACCGTACC TGTTCAAACA TTTGGTGGAC 
GACTTGCCGT CTGGAGCTGC TATTACGAGT GAAGGTAGTG CTACTGCTAC GGAAGCAGCC
ACCGCGATGA TGGCGGCGGA CCCTTCCGCC TCCGTAGCTG CCGGAGTACC AGTATTCCTT
CTACTCGGGT ACGGGCTTTC ACGCATCGCT TCATCGGGCT TGCAGGAATG GCGGAATGCC
GTATTTGCGC ACGTTGCACA AGACGCCATT CGAAACGTTG GGCGGAGTGT CTTTGATCAT
GTCCATCGAC TTGATATGCA GTTTCACCTT TCCAGGAACA CGGGACAGCT CAGTCGGGTA
CTAGATCGAG GACAACGTTC GATTTCCTTT ACTCTCAACG CCATGGTTTT TCATATTGCT
CCCACTATAC TTGAAGTCGG TATAGTCACA TCCTTGATGG GATACCAGTT TGGCTACGCG
CATAGCAGTG TCGTAATGGC GACCGTGGTA GCCTACACCG GATTTACCCT TGGAGTCACC
TCCTGGCGAA CGAAATTTCG TCGCGAAATG AATCGACTCG AAAACCAGGC CAGTGGACGC
GTGGTTGATT CCTTGCTTAA TTACGAAACG GTCCAATACT TCAATAATGC ACAATACGAG
GGTGAGCGTT ACGAAAGTAG TCTCAAGGGA TACCAAAAGG CAGCACTAGA GTCGCAAACT
TCCTTAAGCT TGTTGAACTT TGGGCAAGCG GCTATTTTTT CAGCGGGTTT GACGTCGGTC
ATGTGGTTGA CTTCACAGCA AATTGTGGAA GGCGCAGCCA CCGTGGGGGA TTTGGTGCTC
GTGAATGGAT TGTTGTTTCA GCTTTCCGTC CCACTCTTCT TCATTGGGTC CGTCTATCGG
GAGGTGCGAC AGTCACTGGT GGACATGGAA GCCATGTTTC AATTGCGAGA CACGATACCA
GCCATTGTTG ACAAGCCAAA TGCGCTTTCT TATGATCCGA GTACCATGGG AACTTCGATT
GCGCTCCACA ACGTACACTT TGCTTACCCA ACTGCAGCGA ATCAACGACC AATTTTGAAC
GGCACCACGC TGGACATTGC TCAAGGCAAA ACAGTCGCCT TCGTTGGTTC TTCCGGTTGC
GGCAAGAGCA CAATTCTCCG ATTACTCTAT CGTTTTTACC ATCCTGATCA AGGATTGATT
TCCGTCGGTG GTCATGATAT TCAGGACATG ACGAAATATT CTCTGCAACG TGCCATAGCT
GTTGTTCCGC AGGATACCGT TCTGTTTCAC GAGTCCATCG CGTACAACAT TCAATACGGA
GATTTGAGCG CGTCCTGGGA TGAAGTGATT GAAGCTGCCA AAAAGGCCAA GATACACGAT
ACGATTATGA GTTTTCCGGA TGGCTACGAA ACGGTAGTGG GAGAGCGTGG TCTCAAACTT
TCGGGTGGTG AAAAGCAGCG CGTGGCCATT GCTCGGGCCA TTTTAAAGAA CGCCCCAATC
TTATTGTGCG ACGAGCCAAC GTCGTCCCTC GATAGTGAAA CGGAAACAGA TATTATGAGT
AACCTCAAAG ATGTTGGCAA AGGGCGGACC ACGTTGATCA TTGCGCATCG ACTGTCCACC
ATTCAAGATT GCGATGAAAT TATTGTCATG AATCGCGGGA TGGTCGTGGA GCGCGGCACA
CATGATGAGC TGATCGCCAT GGGTGGCCGA TACACGGAAT TAATCAAGAT GCAAGAGGCG
GTAGTTGACG AAGACGAAAA TTAA
 
Protein sequence
MMGGKVVTIQ VPYLFKHLVD DLPSGAAITS EGSATATEAA TAMMAADPSA SVAAGVPVFL 
LLGYGLSRIA SSGLQEWRNA VFAHVAQDAI RNVGRSVFDH VHRLDMQFHL SRNTGQLSRV
LDRGQRSISF TLNAMVFHIA PTILEVGIVT SLMGYQFGYA HSSVVMATVV AYTGFTLGVT
SWRTKFRREM NRLENQASGR VVDSLLNYET VQYFNNAQYE GERYESSLKG YQKAALESQT
SLSLLNFGQA AIFSAGLTSV MWLTSQQIVE GAATVGDLVL VNGLLFQLSV PLFFIGSVYR
EVRQSLVDME AMFQLRDTIP AIVDKPNALS YDPSTMGTSI ALHNVHFAYP TAANQRPILN
GTTLDIAQGK TVAFVGSSGC GKSTILRLLY RFYHPDQGLI SVGGHDIQDM TKYSLQRAIA
VVPQDTVLFH ESIAYNIQYG DLSASWDEVI EAAKKAKIHD TIMSFPDGYE TVVGERGLKL
SGGEKQRVAI ARAILKNAPI LLCDEPTSSL DSETETDIMS NLKDVGKGRT TLIIAHRLST
IQDCDEIIVM NRGMVVERGT HDELIAMGGR YTELIKMQEA VVDEDEN