Gene PHATRDRAFT_48689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_48689 
Symbol 
ID7194919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011686 
Strand
Start bp611804 
End bp613483 
Gene Length1680 bp 
Protein Length559 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183127 
Protein GI219125731 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000119699 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTGGC CAACGGTACG CAACGTCCAG AAACTCATCC TCCTTACTAT TCAAGTAGTG 
AGCAGTTTTG TTCCGAGACC GACCGCTCGC TCCAGATTGC AGTGGACGAC TCGAGCGACT
GCCTCCTGCA GACTACTGTC GTCTACCGAA GGACCAAACA GAGATGACCT ACCAACTCCC
ATATACCACC ACATAGATCT ATACAACGAA AACTCGTATA CCGACGACGT TGAGACAGTC
TTGGGGCGTG TCGATGGGTT TATGAGACTG CTTGAGCAGC AGGAAGCCTG GCCACTTCTA
CACGAGAGTA AAATCCGTGA TAGAATTCAT AGCCAGCATA CACGAGATTG CGAGGCAGCG
TCTCTTAACT TGGCGCCCGA ATCGCCAGTC CTGCGCGACC AGCAATGCAG TACTCGGTAC
GAATGGATTT TTTCTGGCAC CAAATCCTAC CCGTTAGCGG CGCAATCGTT AGAGCCAATA
TTGAATACAA CGGCTATTTC TACAATTCGC GAAGCCGCAG AAGCGCATTG GGAAAATCCG
GCTTTCTGTA CATCCCGGTT CACCTATCAG CGCCCAGGAA ACTATGAAGC CCACGTAACG
GACTTGGGCG AGCGAGTTCG CTCGATTGTC AACGAAACAT TGACAACTAG AATTTACCCA
CTAATTCGCG ACGTGTTTTG GAATGACCCA ACCTGTTCGT TGGCTCCAGT TGATCAACTT
TGTGTGTACG ACGCTCTCTA CATACGCTAT AATTCCACAC AAGCCAAGCT ACTGGATCAC
ATTGGTGCCG GGCAACCTTT GCACCGTGAC TTGGGACTCA TTAGTATCAA TATTCGACTG
AACAACGATT TTGAAGGTGG TGGCACCTTT TTTGAAAATC AGCTCCTCGA CCGAAAGGAG
TCCGATCTCG AACCAGGAAT AACACCTCTG ACACCTCTCG AGCGCGGCCA CGTCCTATTG
CACAAATCAT CGGAACGACA TGCTGGTGCC AGCACCGTAG ATGGAGTGCG AGACATTCTC
GTCTTTTTTG TATCAGGAGT CTCCTGCGAG ACAGGCTCCG GCAGTGGCTC GACAGTACCG
ATGCCAATTC AATCCGCTAT TGCGAAACAG TCTCGTGGAT ACTGCGACGA CTGCTATTCG
AATAATCCAC TACGGGCTAT CTTTTGTCGC ATCGCACATC AACGCTACGC GGTTCGGGTT
GCGCCTTCGG ATGGGGAAGC TTGGCAGTAC TTGGGTACTG CCTTCATGGA GTACGATACC
TACCTGGCAG TAATGAAGGC TTCCGCCGGG CTTCGAGGGG CCGTACTCCA GACGGCGACA
CGATGTCTAC AGCTCGCCAC GAGACTGACG CCGTGCGATT CGCGTGTCTG GAACAACCTG
GCTCTCACGC TGAACCGGCA ATGGAAATTG AAAAGCGACG CGAGTTTGCT TAATACCACG
GAAATGGCTT TTGCGACGGC TCACCAACTT CTGAAAATGT CTCGAGAAAA GTGCGATGTT
GAGGGCGAAT TGGACAATGT GAATGTAAAC TATGGACTAT TTCTTTCCAA TCAAGACAGG
TTTATGGAGG CGGGGTGTAT TTTGGAAGGC ACTGCACTGA AGAAGATCAT AGATCAAGAT
TGCGGAAAAG CGGTTGAAGA TGCCTATGAA TTGTGGAAGT TTTGTAGACA ACAGCAATAA
 
Protein sequence
MSWPTVRNVQ KLILLTIQVV SSFVPRPTAR SRLQWTTRAT ASCRLLSSTE GPNRDDLPTP 
IYHHIDLYNE NSYTDDVETV LGRVDGFMRL LEQQEAWPLL HESKIRDRIH SQHTRDCEAA
SLNLAPESPV LRDQQCSTRY EWIFSGTKSY PLAAQSLEPI LNTTAISTIR EAAEAHWENP
AFCTSRFTYQ RPGNYEAHVT DLGERVRSIV NETLTTRIYP LIRDVFWNDP TCSLAPVDQL
CVYDALYIRY NSTQAKLLDH IGAGQPLHRD LGLISINIRL NNDFEGGGTF FENQLLDRKE
SDLEPGITPL TPLERGHVLL HKSSERHAGA STVDGVRDIL VFFVSGVSCE TGSGSGSTVP
MPIQSAIAKQ SRGYCDDCYS NNPLRAIFCR IAHQRYAVRV APSDGEAWQY LGTAFMEYDT
YLAVMKASAG LRGAVLQTAT RCLQLATRLT PCDSRVWNNL ALTLNRQWKL KSDASLLNTT
EMAFATAHQL LKMSREKCDV EGELDNVNVN YGLFLSNQDR FMEAGCILEG TALKKIIDQD
CGKAVEDAYE LWKFCRQQQ