Gene PHATRDRAFT_47649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47649 
Symbol 
ID7202684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp396746 
End bp397831 
Gene Length1086 bp 
Protein Length303 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181900 
Protein GI219123164 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTGTAGACA ATTTCCAGTA CTCTCAATTA TTTTAAACCG AAACATATCT TATGTCGTCG 
AAGATGAGGG TTGTACTGTC GAATAAGAGC CTTTGGACGT TTCTCAAGGG TGTGGTGCTA
CTGATTGTGA GCTCCTTTTT TTCTTTTACA AGTGGTTTTT CTTTCGAGCT TACAAGTCGT
CGCCAGGCTC TAGGATGGAT GAGCGGTGCC ACTACAGCTA GCTTTTTTAG AGGCAATGAT
AGATTCGCCA ATGCCCTGGA AGAGCTCCCT GCTAGTTTCG ATGTCGACTC CTACTTGAAA
TCTGGCTTCG TTTTGAATCC CATGGGCGTT TCAGGACAGG CGGGCAAATC ACGTCCCGAA
ACAGGTGTAT TGCTGCGCGA TGGTAGCGAG GTATCCCGAG ATCCTCGTAC CGGTGAAATT
CTGGCGGAAA TTATTCTGAC GACCATCTCC GGCGAAAAAG TTCCTGTTCT CGCGTCGTAC
TCGTCGACTT GGCCCTTGGC GACGGGTAGT GTCTTTGACG TTGAATGCCG CGATTTAAAA
ACGGGCGACA CGGCATTCTT GGCGGTTTCG TCCAGCACAG GAGGCAAATC AATTTCGGAG
CTCAAAGACT CTTTTTTCAC CAACAATTTG TTTGCTCCGA AGGGACGTTT TTCGAGCTAT
GGCCAACCGA CCGATATCAA AGTTCGTAAA AACAGTGTCA CAGGTGCCTA CCGTGTGCTG
GATTTGAGTT TTTCCACCCT TTCGCAGAGT ACCCAGACTG AGATCCCTCG TCGTGCTCGG
TTAATTGCTA CTATCCCGGA AGGTACAAAT CAAGCCGTAA TGTTGATAGG CTCAGCGGCC
GACTTGCGAT GGCGGAAGGG ATCTGAAAAG GAAATTGAAC AAGTTATTAA TTCGTTCCGA
GCCATTCCAG CTCCCCAGAC CAATATGAAA GTTCGCGCAA AAGAACGTCG TAACCAAGGG
TAATTTGCGT CGCTTTAGAT TTCTACACCG ACACTCTGAC CCCAGGCAGA AGCCACATGC
ATGCACTTGG ACGAAGAACG TATTCACAGT CAAGCTAATT TAAGTTTAGC ATATACTTAC
TACAGC
 
Protein sequence
MSSKMRVVLS NKSLWTFLKG VVLLIVSSFF SFTSGFSFEL TSRRQALGWM SGATTASFFR 
GNDRFANALE ELPASFDVDS YLKSGFVLNP MGVSGQAGKS RPETGVLLRD GSEVSRDPRT
GEILAEIILT TISGEKVPVL ASYSSTWPLA TGSVFDVECR DLKTGDTAFL AVSSSTGGKS
ISELKDSFFT NNLFAPKGRF SSYGQPTDIK VRKNSVTGAY RVLDLSFSTL SQSTQTEIPR
RARLIATIPE GTNQAVMLIG SAADLRWRKG SEKEIEQVIN SFRAIPAPQT NMKVRAKERR
NQG