Gene PHATRDRAFT_47407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47407 
Symbol 
ID7202546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp552541 
End bp554499 
Gene Length1959 bp 
Protein Length652 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181581 
Protein GI219122499 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGGCCG TGATATCCGG TCGGTTCCTC GTGCAGGGAG ACCGCAATCG CCATCGTGGC 
CGTCTCCTAG ATCCTTCCCC TCTTCTGATA TCGACGGAGC CACTTTCTTT CTGGGGAGGT
ATTGATCCCT TGTCCGGAAT TGTGATTGAC TCGACCCATC CATTGGCCGG ACAGTCCGTC
ACTGATACAA TACTCTGTCT ACCCTCGGGT CGTGGGTCCT GTACCGCATC TCAGGTACTA
CTGGAGCTTA TTCTCAATGG CATTGCTCCT CGTGCAATTG TCCTACGTGA TGTCGACGGA
TTAGCTGCGG TGGGGGCTTT GATTGCACAG GAGGTTTTCC TGGAGGCTTC AATGGATATT
CTGCACATAG GAAAGGAAGG ATATCATGCG ATACTTTGTT CGTCAAACTC GCACGGTGTG
ATATCTTCAA ACGGGGAGCT TGCTCTAAGT AGAGACGTCG AATCGCTTAC CCGGACGGTA
TTCAAGCCGC ATACTATAGT AAAAAAAGCA ATTGATGCGT CCAATCTGTC ATTTACTAAC
GAAGAACAAA AAATGTTGGA AGATTGTGAC TCTGAAGCAG CACAAATGGC GCTACGGGTC
CTATTTCACT ATGCCCACAT GACGAGCCCC CCGAATACAG TACCGACATA TTTGAAAGTC
ACCAAAGCGC ACATTGATGG ATGTACGTAT ATTGGCCCGG GAGGTCTCGC CTTTTGCCAA
CGCCTTGTCA AAGCCAAAGG CCATGTTTCT GTTCCCACCA CCCTTAATTC TATGTCGGCT
GATCGTCAAC GCTGGCAAGT CTTGGGTGTG CCGACGAAAT ATGCCCACAA TGCGATTGCG
TTGGGTGACG CTTACCTGCA GCTGGGCTGT TTGCACTCTT TCACGTGTGC ACCCTATATT
TTGAACAATC CACCCACGCT TGGTGAGCAT TTGGTATGGG GCGAAAGCAA TGCGGTTGTT
TACGCCAATT CTGTTTTAGG TGCACGAACG GAAAAGTACG CAGACTATCT AGATATTTGC
TGCGCTATCG CAGGGATGGT ACCTGCCGCC GGAGTTCATC TAGAAGAAAA TCGACGCCCT
AGGATTGTTC TGGACGCGAC ACCTTTGGTA GAATCGCTTG ACCTGGCACT TGTAGACTTG
GACATGTTTT TCCCTCTTCT GGGTCACTTA TGCGGATCCC TCTCTGATGG AAACGTTCCT
ATTCTTTTGG GATTAGAGCC ACTGTCTAGC TCCATCAGTC TAGACCATTT AAAATCGTTT
TGTGCGGCGT TCGGCACAAC TGCATCGTCT CCACTAATTC ATATTTCCAA AGTAACGCCA
GAAGCTCAGG ATGAGGGAAT TGTAAAAGCC TGGATCCAAG CGTGCGGAGA TAAGACTGAA
ACGATATCGA CTAGACAGTT GCGCAAGACT TTTGAGAAGC TCGATGGAGA TCGCGATGGA
GATGGGAAGG TCAATCTTAT AGCTCTTGGA AATCCTCACT TATCTGTATC AGAATGCAGA
GACTTGGTCA ATCTTATCGA ACTACCGCAT ATTTCTAACA ATCATAAGGT CAAGCATCCC
GAAGTCCGTG TGATTGCTTG TATGGCTAGG TCTCTTCAGT TGATAGCGGA GGAAGCAGGG
TATGTGGGAA AGCTTCGTGA CTTTGGGGTT GAATTCATCA ATGATACGTG CTGGTGCATG
CTCCTCGATG AGCCTGTGAT CCCGATCGAT CCCAGCTCTA AAATTTTGAC GAACAGTGGC
AAATATGCGC ATTATGGACC CGGTCTTACT TCCCGCAAGT TTCGTTTTGG GAGTACGAGT
GATTGCGTGC AGGCAGCTTT CACCGGCGTG TATCCCAACA AGGTATTTTC GGATGTTTCT
CCCAGCAGTT GGTTGATGAG TAGACCTCAG ATAATGCAGC GACGGACGTT CAAGACGTCG
ATCCATAGCA TTCAGAGGAT CGTGTTACTT CTAAGATAA
 
Protein sequence
MSAVISGRFL VQGDRNRHRG RLLDPSPLLI STEPLSFWGG IDPLSGIVID STHPLAGQSV 
TDTILCLPSG RGSCTASQVL LELILNGIAP RAIVLRDVDG LAAVGALIAQ EVFLEASMDI
LHIGKEGYHA ILCSSNSHGV ISSNGELALS RDVESLTRTV FKPHTIVKKA IDASNLSFTN
EEQKMLEDCD SEAAQMALRV LFHYAHMTSP PNTVPTYLKV TKAHIDGCTY IGPGGLAFCQ
RLVKAKGHVS VPTTLNSMSA DRQRWQVLGV PTKYAHNAIA LGDAYLQLGC LHSFTCAPYI
LNNPPTLGEH LVWGESNAVV YANSVLGART EKYADYLDIC CAIAGMVPAA GVHLEENRRP
RIVLDATPLV ESLDLALVDL DMFFPLLGHL CGSLSDGNVP ILLGLEPLSS SISLDHLKSF
CAAFGTTASS PLIHISKVTP EAQDEGIVKA WIQACGDKTE TISTRQLRKT FEKLDGDRDG
DGKVNLIALG NPHLSVSECR DLVNLIELPH ISNNHKVKHP EVRVIACMAR SLQLIAEEAG
YVGKLRDFGV EFINDTCWCM LLDEPVIPID PSSKILTNSG KYAHYGPGLT SRKFRFGSTS
DCVQAAFTGV YPNKVFSDVS PSSWLMSRPQ IMQRRTFKTS IHSIQRIVLL LR