Gene PHATRDRAFT_36397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36397 
Symbol 
ID7201544 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp350619 
End bp352936 
Gene Length2318 bp 
Protein Length759 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180991 
Protein GI219120508 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGACG ATGGAACATA TTCGGCTCGC TTTTTAGAAG ACAATGGCCT GGAACTTATT 
ACGGATGAGC AGGGTATGGT CACAGCACGA CCGCTAGGAC TCAACGATGG ACAGGTTCTT
GAAAAGGCTG TATCGCAACC CCAACAACAC AAACAGAAAG ATCAAAAAGT GGTTGAATTT
GATCGTGACG AAAGAATACC ATTGATCATC TCGACGCAGC CATCCATCGA CAGTCACGTA
CAGACCGAAC AGCCGGTAAA ACAAGCCCCG GCGAAAAAGC ATCCAATAAC GGAGGCACCA
GCAAAAGTGG TGCCGCAAAC AGGATTTAAT GTGGTGCTGA CACACTGTAC TGCAGATTTC
GATTCACTAG CGTCGGCCGT CGGTCTCGCC AAACTCTGGA GTGCGCAAGA TACTTCCTCG
TCAACAGCGG AAGCCAATAA AACCTTTGAT TCAGCGTCGG ACGTCCCTAC CTTTGTGGTC
TTGCCGCGTG GCGCTCATCC TGGTGTTCAG CGATTCTTGG CACTGCACAA ACACTTATTT
CCAATTCGTT CGCTCAAATC ACTACCTTCA GACTTATCGG GGTTGAATCG GTTGGCGCTG
GTCGATGCGC AGAGACGGGA TCGTATCGGA CCAGCTGAGC CTTTGCTCAA ACATGCCAAG
CGCATAACGG TTGTGGACCA CCACATCGAT CAGGACTTGA TATTCCAGCG ACTGATTATG
TAGTGGACAA GGTTGGTTCT GTATCCACGC TTATTTCAGA AAGCCTTCGT AAATCGAAAA
TTTTGTTGAC GGAGGCCGAA GCAACGTTGT TAGCATTGGG TGTCCATGCT GATACTGGCT
CTCTTTGCTT CGATTCGACG ACGCCTCGAG ATGCTGTAGC GCTGGCATGG TGCTTGGAGC
AGGGCGCCAG TCAAGTGGCC ATCGCGGAAC ACGCACAAAC TTCGCTCTCA GCTGAACAGC
AAGGTGTCTT GACGCAAGCA TTGATCAATA CGAACTCAAC AGCTATTTAT GGCGTTACCT
TGTCGACTGT GCTGTTATCA GCGGACGGAT TCATAAACGG CTTGGCTGCC GTGACGCAAG
ATGCCATGGA GTTGAGTAGC AGTGACGTTT TTCTTCTAGC GTTAGTATAC GAAGCCCAAG
CTGGTGGGCG TCGGCGAAAG CGAAAAGGCT CAGGGAGTTT ACTGACAAGC CGTTTGTTAA
CCAAAGATAA GTCTTCCGCC AATCCAGGGG GAAACAGCAA CAATGACGCA CAGGTATTTG
AGGCTGAAGC CTGGAAAGGC GGCCCCGAGT ACATTAAACA GCGTCGCTTA CGGTCTGCCT
TTGATCGCAA AGACAACGAT GATAGCGGCT TTTTGGAAGT TGATGAAATC ACGGCTGCGC
TAGCTTCATC GGGCGTGATT GCTTCCCCCG AAGCGGTAGC TGATCTAATT CAAGCTATCG
ACAAAGATGG CAACGGCAAG ATTGACTTCG ATGAATTTGT TGCATTTTCT GAACAGGCTG
AGACAAGACA ATTGGAAAGA GATGCCCTCA TGTCGAAAGG ATCAACGACT ATGATCATCA
TTGGCCGAGT GAAAGCGGGA GTCAATCTCA AATCCGTCAA GCTGAACAAG CTGCTAGAAA
AGTTTGGTGG TGGTGGCCAC GCAAAGGCAG CTTCCGCAAC TGTGCGTCTT GGTGAAGAAG
CGGAAGCTTC GGATGTCTTA CAGGATCTAG TCGATGAGTT GATCGAATCT AGTTTGAACG
AGCAACCCAC CGTCGGGGAC TTTATGACGG CTCCTGTTCT TTCCGTGCAA CCTGAGATGA
CGGAACATCA AGTTGAAGAT CTGTTCACCC GGTACGACGT TCGTGCGTTG CCGGTAGTAA
ATGAAGAGAA CGATGTGATT GGCCTCGTCA CCTACAAGGA AGTTGCGGCC GCGAAGCAAC
GTTTGTGGAA CAAGGAGCAG AAGCGACTGC GAAGAGAGCT TGAGTTGGTA GAAAAGGGCG
AAGTGGTGGA CGAGGCCGAT CAGAAGGTAG CCCAAGAGCG ACGCAAAGGG TCTACCGTCA
AGGGTTGGAT GCTGCAACAC GTGCAGCTCG TCGAAGCTAG CAAGACAATG GCAGAAGTTG
AATCCATTCT GCTTGAAAAT GACGTAGGAT GTATGCCGGT TGTGGCAGAC GGGACGAAAA
AGCTGGTCGG TATGGTTACA CGAACTGATT TGCTTCGGCA ACACCGATAC TACCCTTCTC
TGCACTATCA CAATAAGGGA TTCGCTAACT CGATCGCTGA CCGAAAACCG ATCATTGCTC
TACGAAAGAG GTTAAAGCAA TTCGATATCG AGGAATAG
 
Protein sequence
MTDDGTYSAR FLEDNGLELI TDEQGMVTAR PLGLNDGQVL EKAVSQPQQH KQKDQKVVEF 
DRDERIPLII STQPSIDSHV QTEQPVKQAP AKKHPITEAP AKVVPQTGFN VVLTHCTADF
DSLASAVGLA KLWSAQDTSS STAEANKTFD SASDVPTFVV LPRGAHPGVQ RFLALHKHLF
PIRSLKSLPS DLSGLNRLAL VDAQRRDPHN GCGPPHRSGL DIPATDYVVD KVGSVSTLIS
ESLRKSKILL TEAEATLLAL GVHADTGSLC FDSTTPRDAV ALAWCLEQGA SQVAIAEHAQ
TSLSAEQQGV LTQALINTNS TAIYGVTLST VLLSADGFIN GLAAVTQDAM ELSSSDVFLL
ALVYEAQAGG RRRKRKGSGS LLTSRLLTKD KSSANPGGNS NNDAQVFEAE AWKGGPEYIK
QRRLRSAFDR KDNDDSGFLE VDEITAALAS SGVIASPEAV ADLIQAIDKD GNGKIDFDEF
VAFSEQAETR QLERDALMSK GSTTMIIIGR VKAGVNLKSV KLNKLLEKFG GGGHAKAASA
TVRLGEEAEA SDVLQDLVDE LIESSLNEQP TVGDFMTAPV LSVQPEMTEH QVEDLFTRYD
VRALPVVNEE NDVIGLVTYK EVAAAKQRLW NKEQKRLRRE LELVEKGEVV DEADQKVAQE
RRKGSTVKGW MLQHVQLVEA SKTMAEVESI LLENDVGCMP VVADGTKKLV GMVTRTDLLR
QHRYYPSLHY HNKGFANSIA DRKPIIALRK RLKQFDIEE