Gene PHATRDRAFT_42779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42779 
Symbol 
ID7196401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1090785 
End bp1092657 
Gene Length1873 bp 
Protein Length521 aa 
Translation table 
GC content45% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176721 
Protein GI219109937 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0527893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TAGAGCCGTT CAAGATTGAG AGTATAGAAA TCGGAAAGCC CCGCAAGAAA TATCGCTTTT 
GCCTCCTATG AACTCACTCT TCTAACTTTG AAAGGATCTC ATTGTGCTGA CTGATAAATG
TGTATTTCAG TCGAAAAACC TATGGTTGCC GACGGCAACA GAAGTTCCCG ATCTTCGTTT
TTTGAAACGC CGATTCTACG GCTTCGATAC TCCGTGGTGG GACCGGCTTT GACGATCGAC
GCACAAGTCT TGTTGGCTCT TGTATTGATA GCAGCGTATC TTTCTAATTG CTTGCAGGAG
GAGCGTGTTT CCTTTCTTAA TAATTTCTGG ACAATACGAA CGATTCTGCA GCAGGAAAGT
CTCAGGGCTG TAATTTTCGT ATTTTCAATG TTTGCTTTTG GGCGGAAATG GTTTGGTTCT
TTGCTAGGAG CTCAAAAATT GGCTCAAACA GGTTCATCGG CTCTGAAGAG CGTTGCTTCA
AACGGAACCA TCGTGAGAAT CCGTAGTTCC ATGTCGATGT CTGTCATATC GGACCGGCTA
ATGTCGATGA ACGGCAAGTC GACCTCGCAC AGGGCTTCTC TGTCACTGTC AGACAAAACG
TGCTTTGCTC AATTGAAGGA TATTGAGAAA CTGTCGGTAA AGGATATGGG GAGCATTTTT
CGCTACGCAA TACAATACAA TATATGGACT GACGCTCAAC TTAAAGCCTT TCTTTCAGAA
ATTCGGGTCC AGGCCTTGTC CGTTGTTACC GCAATTGATC ACGCGCTTCC CCCCTTACAT
CGTGCGTCAA TGGTGCAAGA ACCAAACGCG CAGAGTAGCA GTCCTAGGAA CTTTGGAGAT
ATGGACGTAC TCCTGTTTTC AGCAGCCGTT CGAGTATTTG CGGAATGGAG ACTTCTTCGC
CTTACCCCAG CAGGATATCG AAATTACGCA CTTGGGATGG CGCTGACACG TCGCGATTTG
GTGCAAAATA TTGGAAAGAT AGAAGCTGCT GTACATGAGC TATTGGAAAG CCAAACTTCT
CACAACGGAG TCAATTCTAT TAGACCTACA ATTTGGCAAT TGCTGGATTA CGAAGTACAG
CGTGGTGTGC ACCCAAAGCT ACCTTATTTG GTCGAGAAAT CGGGTGCGTC GGGTATCCTT
TGGATCATGC GGCAACTGAG TTTCCAGGTG AATTCATTTG AATACATTTC TAAGGTACCA
ACAGCGTTTC CGTCATTCAA GATAGCCGTT CGCTCAGCAT ATGATCGAGT TTATGGCGAC
TATCATGGAT TCTTCTTAAA GCAAATTTTT TGGAATTCGT TCAAATCGGC TCCGGAAGCT
AGCGTAATTC TGAAGTTTAT GGAGGAAACC GAAGAAATTA TGTCCAGAGA TCTGTCTCCA
TCGAATTCAT CCGAAAAGGC CACTGTTCTA ACAAGAGATG CTGTTTGTGA TAACATACAA
TCTCCTCAAG AGCGGAATCA TTTTATTGGC ATGTTTCTCG AAGTGCAGCA GTTTCTTAAG
CAGTGTCACG GGGGGCACCC AAAAGTAAGG CCACCAGCAG TCGACACATC ACTTCATGGC
CCTGAACATA TTGGACAAAA GGGAGGGGAT CTCACCGCTT TTCTTCTCGA AATGCACCCG
CTAATATCAG GACTTGATGG CCTGATTGGA CACTTCAATA TGAAAGATCC CTCTAAAGTT
TAGCCGCCGG AGAGCCTGTA CCTCCTCTAA TCCCATTTAT TTACTTGAAG CTAGCAGGGA
CATAGATACC GGCAACTTCA TCCTTGTAAG ATGATTTAGT AAAAGGCCAA ATTGTTTACA
GTCAGCAATT TGAAAGTGTG TTGGCCTACC GCATTTTCGG GGACACTAAC AAAAACAGTT
GTTGTAACAC CGT
 
Protein sequence
MCISVEKPMV ADGNRSSRSS FFETPILRLR YSVVGPALTI DAQVLLALVL IAAYLSNCLQ 
EERVSFLNNF WTIRTILQQE SLRAVIFVFS MFAFGRKWFG SLLGAQKLAQ TGSSALKSVA
SNGTIVRIRS SMSMSVISDR LMSMNGKSTS HRASLSLSDK TCFAQLKDIE KLSVKDMGSI
FRYAIQYNIW TDAQLKAFLS EIRVQALSVV TAIDHALPPL HRASMVQEPN AQSSSPRNFG
DMDVLLFSAA VRVFAEWRLL RLTPAGYRNY ALGMALTRRD LVQNIGKIEA AVHELLESQT
SHNGVNSIRP TIWQLLDYEV QRGVHPKLPY LVEKSGASGI LWIMRQLSFQ VNSFEYISKV
PTAFPSFKIA VRSAYDRVYG DYHGFFLKQI FWNSFKSAPE ASVILKFMEE TEEIMSRDLS
PSNSSEKATV LTRDAVCDNI QSPQERNHFI GMFLEVQQFL KQCHGGHPKV RPPAVDTSLH
GPEHIGQKGG DLTAFLLEMH PLISGLDGLI GHFNMKDPSK V