Gene PHATRDRAFT_54310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54310 
Symbol 
ID7199562 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp428981 
End bp432477 
Gene Length3497 bp 
Protein Length531 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178771 
Protein GI219115952 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGTCGG CTGTTGCTGA AAACGCTGCG CGAAAACGTG CAGAACTTGG AGATGTCCCG 
AAGAAGACCA TGAATCAGAC TATGTATCTT GTCAAGTGGG CAGGTCTGGG GTACGAGCAT
TGCAGTTGGG AAACGAAAAA AGACGTCAAT GATGACAAAC TTATCGCTGA GTTTCACAAG
CTCAACAATA CGTTTCCTGA TGAGCCAGAT ATGCCGATGG AAGTCGTTGA CGATTTTATC
GAGAGTACGA AGCACATTAA CGTTGAAAAT GCGGGGGGAA TCTCTTGCAT ACCGAGTCTT
CGTGCCCAGT TGTACGCTCA AAGCCGATCA TTTCATTTCG CGAAATTTGG GATGAACATT
CCCGAGAAGG TCGGCGCAGA ATGTGGCCCA AAAACCAGAG CTGCATGGCA TTACCAGTTT
TCTAGTGATG ATGACAGTGC CAGACATCAT TCTACCGTTC CGCGGGAAGT GATTGAGTGT
GTTTCTGATC TCGTCTTCCA GGTCGCTAGG AAAGAGCCTG TCAGCTTTAT GCGTGCCAAT
ACGTCGCTGC CTCCGCCGAT GACTGGAGAG TATGATGCTA TCCTACCTAT TACTTCGAAA
GGATTGATGA TGAACGTAGG TGAAATTCAT GGGTCCGTAG CTTTTCTCGG TTATCGAACC
TTTCCCGATG GATCAAAAGG ACCGGCGGAC ATCGCGAATT TGATTCGGAA CGTGGGAGAC
AAAATCATCG CTGTTGATGG TTCCAGTACG ATCGGGAAAA CCTTCAAGGA GGTAATTTTG
ATGTTGCGCG AAAGCGGCAA GAACAAGTTT GCTTACATGC GCTTCCTTGA GACCAAATAT
GCCGTTTGTG ACAATGATCT TGCCAGTGTC GGAAAGAAAG GACGTTATGC TATCGAAGAA
CTTCAAAAGA AGGTGGCCAT GGATCGACAG CGTCTTGTTG TACAAAGGAA ACATTTGCTT
TCGGTGGATG AAGAACATGT TCCCGGGGAC ATAGGAAAGG ATCTTGCCCC AAAGGTCGAA
GACTCAGATG AAGAATCCGA GGAAGGAAGT GAGGGTGAGT TTGAACCGGA AAGTGACGAC
GAGGATCTTG TGGTGACGGG AAAGACAGGC GAAGTGGCCA CGGGTCCAAC GGTTTCCGAT
TCTCTAGAAC GTATTGATTC TGTGCCTGCC GTCTCTCCAA CTTTGAATTC TGAGATGGCA
GCTGAACGAC ACTCTGGGGT AAAGAAAAGC GAGGATGATA TAGGAGGGGA ACCAGCACCA
AAAATCGACC AGGAAACCAT TACTCCGGTT GAGGAATGCT CTTCAGCCTT ATTGGGTCCT
GTCTTTCGTC ACGAGACTAC TCGGTCTCTA GCCTATCGTT TACTTGGTGT AGATCTCGGC
TATAGTAGCG ACGAAGGTGG AGACGATGAC AGCGCATTTT TTGTTGATGG TGTCGATCAG
ACATACACTT CAATGCAGCA ACTTCAAGAT ATTGTCCGTC TACCGGCCGA AAGCGAAGCA
AAATCAACTA CTCCTGTAGA CGACAGTACC ATTCCAGTGC GCCAAAACGA GTTTTCTGTA
ATGGGCGATA GATCAAAACT TGCGACCGCA GTTGCTCTTA CGTCGAAAGA GCCTTCGACT
GAGGAATTTG ACAATTTTCC TTTGCCTTCG TCGAAAGAGT TATTGGCCTC AGAAAAAGAG
CAGCAAAGCC AGCAAGCTAA CAGTGCAGAG CTTCTATCAA AGTCAAGCAA ACGTTCAACT
GTCAAAGTAG AGCAAGTTTC TATCGTAACC GGGGACATTA TACACATTTG GGCAAATGTT
GAGTCTGCGG CTGCGACGCT TCAGCTTCCG CTCCCCCAGC TGAGGCAGGT TCTCCGGGGA
GAATACGACG AAGAAATTGG CGATGAAGTC GGGGGTTACA AATGGCGCTA TGCCTTGGTG
GGGGCGAAGG TCACTGCTGG AAATGGATCG ACCGGGCGAG GTGGCAGCGG ACGAAAGGCT
AAAGAGGCAT GGCTTGAGTT TCGCGACAAG CTCTACGACC CCAACGAGCC CCACAGCTAC
AAGAATGGCA ATCGCCTTCG AGATTATCAA GTAGAGGGAG TCAACTGGCT AGCGAGTACT
TGGTACAAGA AGCAAGGATG TATTTTGGCT GACGAGATGG GTCTCGGAAA GGTACGTGTC
GGACGACTTC ATAATGAATG TTGGGGTACT GGTCTTACAC CGATTGTATG TTCTACTGTA
GACTGTACAA ATTGTCTGTT ATATCGAGCA CATTTTCCGT GTTGAAAAGG TTCATCGACC
ATTTCTCGTC GTGGTTCCGT TGTCAACAGT GGAACACTGG CGGAGAGAGT TCGAGGGCTG
GACTGACATG ATATGCTGCA TCTATCATGA CAGGCAAAGG GTATGGCGAG ATGTTTTACG
AGAATACGAA TGGTATTACG AAGATCGCCC ACACACGGCC GAGTTCCTTA AGTTCGACGT
TCTTGTGACC ACATATGACA CCCTGATTGG AGACTTTGAC GTCATCAGCC AGATCCCGTT
TCGAGTCGCT GTTGTCGACG AGGCGCATCG GCTTCGCAAC CAAAAGGGTA GACTGTTGGA
ATGCATGCGG GAAATTAGCG CGAAGGGTAC CATGCAGTAT GGTTTCCAAA GCCGCGTCCT
TATGTCTGGA ACTCCTCTCC AGAATGACTT GACGGTGCGT TCATTATTAC TTCCTCTGTC
TTTTTGCTCT AGGTCCGCGT CTCAAAAGCC ATTTTTGTTT TTCTTGCCAG GAGCTTTGGA
CTTTGTTGAA CTTCATTGAG CCGTTTAAAT TTCCCGACCT TGATAATTTC CAGTTGAACT
TCGGGAATAT GGCCAATAGA GAACAGGTCG AAAGTCTGCA GCAGATGATT TCTCCGTATA
TGCTACGACG AGTGAAGGAA GACGTGGCCA AAGATATTCC AGCGAAGGAA GAAACTGTAA
TTGACGTCGA GCTCACTAGT ATTCAGAAGC AGTACTATCG AGCTATTTTT GAACACAATC
ATGCCTTTTT GAATATTGGG GCAACACGAA ACACAGCACC AAAATTGATG AATATCCAAA
TGGAACTTAG AAAGGTTTGC AATCATCCCT TTCTTTTGGA AGGGGTTGAG CACAGAGAAA
CAGACAGACA GTTTAAGGAA TTTTCGGAAA AGGGTCTCTT CGAAAACAAG GCACCGGAAG
AGCAACAGCG TCTTCTGAAC GAGCATGGCT ACATCATGAC AAGTGGAAAA ATGGTTTTAT
TGGACAAGCT ACTCCCGAAG CTGAAGCAAG AAGGTCACAA AATTCTTATA TTTAGTCAAA
TGGTAAAAAT GCTTGACCTG ATCTCAGAGT ACTGCGACCT GCGAGACTTC AGATATGAGA
GACTGGATGG ACGTGTAAGA GGAACGGAGC GACAAAAATC AATCGATAGA TTTGAGAACG
ATCCAGAGAG TTTCATATTC TTGCTTTCGA CTCGAGCGGG TGGTGTCGGA ATAAATCTTA
CGGCGGCTGG TATGTAG
 
Protein sequence
MLSAVAENAA RKRAELGDVP KKTMNQTMYL VKWAGLGYEH CSWETKKDVN DDKLIAEFHK 
LNNTFPDEPD MPMEAKEAWL EFRDKLYDPN EPHSYKNGNR LRDYQVEGVN WLASTWYKKQ
GCILADEMGL GKTVQIVCYI EHIFRVEKVH RPFLVVVPLS TVEHWRREFE GWTDMICCIY
HDRQRVWRDV LREYEWYYED RPHTAEFLKF DVLVTTYDTL IGDFDVISQI PFRVAVVDEA
HRLRNQKGRL LECMREISAK GTMQYGFQSR VLMSGTPLQN DLTELWTLLN FIEPFKFPDL
DNFQLNFGNM ANREQVESLQ QMISPYMLRR VKEDVAKDIP AKEETVIDVE LTSIQKQYYR
AIFEHNHAFL NIGATRNTAP KLMNIQMELR KVCNHPFLLE GVEHRETDRQ FKEFSEKGLF
ENKAPEEQQR LLNEHGYIMT SGKMVLLDKL LPKLKQEGHK ILIFSQMVKM LDLISEYCDL
RDFRYERLDG RVRGTERQKS IDRFENDPES FIFLLSTRAG GVGINLTAAG M