Gene PHATRDRAFT_47780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47780 
Symbol 
ID7202942 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp10844 
End bp13331 
Gene Length2488 bp 
Protein Length716 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182307 
Protein GI219124011 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTAC AGCACAGCTT GCTACCTCAG CTCCAAAGGA GCCGCCGCCT TGGGATCTTA 
ATCGCTCTGG TTGTCTTTCA AAGTGTGACG GTCCTGTTGC ACAAACAGAC GTCCGTTTTG
CATCCGCCAG TTCCTCGAGA CTTGTCCGAT CCCTTTGTAT TGGCCTCGGA ACCTAACGGA
CAAGAAGATC TTGAGCCAAT GTCCCGCATT TTTCTAGAGG GAAGAGCAAC CTATACTAGC
GAATGCAGAA GGTTCAAGTT GACCAACATC ACCTGGGGTC AATCGGTCGC TCCTTTCGTT
GACAAAGTCG GAGCCAAAGT ATTGGTAGAG CAAATGGGAA CTTCCGTCAA GATTGTTCCC
ACCATTGCAG TCTACGACAC GGCCAATATA TCAGACTTTG ACGCCATGTA CATGAAGGCC
TTGCCAGACT CAATCATCAA ACCAGCTCAC GCCACCGGAT GGACCGCACA AGTCCAAAAC
AAGAGCTACG TTTGCTTCAA AGGTTGCAAG CAAGAAACAC ACAAGTTCCA ATCCTGGCTC
GGAGAGCCTT CGGAAATCGA ACAGGCGCAC AAAGTCGCAA AGAAGGTAAT GCAGTATACC
CTTTCCGACG TTCCCAGCCC CGAGTTTCAA AAGAAAGAAC CCCAGTATGG GTTTGTTCCT
CGTCGGGTCC TCATCGAACA CCGACTGCCC GTAGAACGCA TGAAGGAATA CCACTGGTGG
ATTGCCAACG GACAACCAGT TTTTGTCTGT ATCCGATGTG ATGAAGGGCG GACGAAACGT
GGATCCTACT ATTCTTCCGC CTTTCAAAAG CTGGAAATCA CCAGTATCTT GGAACCTTGC
CAGCATTTGT CACGGCCCAA GACTTGGGAA AAGATGATTT CCATTGTGAA AAATATGGGT
GAGCACGTTC CTGGGGTAGC GTGCATCGAT CTCTACGCCG ACGATTTGGA TGTCTACTTC
TCCGAAATCA CTTTCACACG AGGTAAATGC CGAACATACT TCCAACCGCT CGTGGCCGAT
GCCCTCTTAT ACGCAATGAG CAACGATATC CTCGCTGCCA AAAGCATCAC CGCTGACTAC
GTCGAGAAAA CAGTTGCCGA TCGATCGTGG GTTCACGTTT CGTTCGACCT CGGCGAGCCT
CTTCTTAGCA CTAACAAGGT TGTCAACGCA ACAGGATTTC CGTCGGGGCC TGATCTTTGT
CGAAACCAAG CAACTGCCAA TTCCTCCGTC TGCGACCACA CCATAGATTC AGTTGCGAGC
TGGGACTTGC ATTGTGTTAT CAGCAAGGAG AACGCTTTAA CAGCGGTGGG TCAATCAAAA
ATTCGGACAA TCGGTCGTAT CGTTCAGAAG ATTGACTGGC TGTTGGTTCT TGGTCTGGTA
GTGTTACTAG TGCTTGTCAA GTTTGGACAC AAGACACGGC AACTTGACCG ACCAGGGCCT
CAAGTCTTTC ATTGCTTCCT TTATTTGGCT GCAGTTGCGG TATTCAAGAC CTTTCAAACC
CATTCTGCAG GCCTTCTATC ACCGAGACCA ATCTGGTACA CTGTAGTCGA AAGTTACCAA
ACTTTCAAAA TTGTCCATCC TGTGACATCC CCGGCAATTG CATTATCGCA TTTTGCGACT
TACTGGATTT CCGTCAGCGC ATTCTTTTCC AAAAGATTGA CTACAATGTT GATTCTTTGG
TGCCTGTATG AAGTCTGTAC AGCTTTTGTA AACGAATACT TTCACTTTGG TGAGGAGGAC
GATTCGGTGA GATGCATGCG AGTTTCGTTC ATTCTTTACA CCAAAGAATA CGCTATCAAT
GACGTCGTGA GGGTCTACTT GCTGCCGCCT CTCTTTGTGT ATGGGTATCT GTTGCCCAAG
ATGATGCTCT ATTGGTTTGG TCCCCATGGT ATGTTTCTGG TTTGTACCGT TTCTGCTGGC
TTCGGGACCT TTTTGTCTTT GAGCACTTGC CGAAAACGAA CATATTGTGG AGCTACACTA
TGCCATCACA CGTCAACAAG AATTGCAAGA TGAATTAGAC AAAACGGTTT GTTCCAACAT
CCAAACAATT TTATATTAGA ACATTGTGTA TGCTACTATG CCATTGAGTA CTCTAAAAGA
GTGTACAAAG CGGCAACAAT GTAGTATGCA TGGTGCTGTA GATCTAATTA TGCATGTGCC
TGACCAAGTG ACTGTTGAAT AGGGATTGGG TTGCCTGTAA GGCTTGTGTG TGGCTTACAT
CACAGGTTGC GCAACCTGGA CACTGACCAG ATTGGGATTG GCTGCAAGAG TGTTGAGTTT
GTATGGTGTT CATGCCCCTC ACGCTGACCA CAATGGAGGC GCGCTCGGCA CCGCCAACGC
AAATCACGGC GGTGGCAAGT ACAGCGCAGG ATTCTTCCAG ACGCCGAGGG TCACTGACAC
GGGCCTGCAT GGCGCCACTA GACGCGACGG CACCGGAGGC AACCAAGGAT ACTTCCCAAC
AAACGTCAAC CACAGCAAGA CTTTGTAA
 
Protein sequence
MTLQHSLLPQ LQRSRRLGIL IALVVFQSVT VLLHKQTSVL HPPVPRDLSD PFVLASEPNG 
QEDLEPMSRI FLEGRATYTS ECRRFKLTNI TWGQSVAPFV DKVGAKVLVE QMGTSVKIVP
TIAVYDTANI SDFDAMYMKA LPDSIIKPAH ATGWTAQVQN KSYVCFKGCK QETHKFQSWL
GEPSEIEQAH KVAKKVMQYT LSDVPSPEFQ KKEPQYGFVP RRVLIEHRLP VERMKEYHWW
IANGQPVFVC IRCDEGRTKR GSYYSSAFQK LEITSILEPC QHLSRPKTWE KMISIVKNMG
EHVPGVACID LYADDLDVYF SEITFTRGKC RTYFQPLVAD ALLYAMSNDI LAAKSITADY
VEKTVADRSW VHVSFDLGEP LLSTNKVVNA TGFPSGPDLC RNQATANSSV CDHTIDSVAS
WDLHCVISKE NALTAVGQSK IRTIGRIVQK IDWLLVLGLV VLLVLVKFGH KTRQLDRPGP
QVFHCFLYLA AVAVFKTFQT HSAGLLSPRP IWYTVVESYQ TFKIVHPVTS PAIALSHFAT
YWISVSAFFS KRLTTMLILW CLYEVCTAFV NEYFHFGEED DSVRCMRVSF ILYTKEYAIN
DVVRVYLLPP LFVYGYLLPK MMLYWFGPHG CATWTLTRLG LAARVLSLYG VHAPHADHNG
GALGTANANH GGGKYSAGFF QTPRVTDTGL HGATRRDGTG GNQGYFPTNV NHSKTL