Gene PHATRDRAFT_2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_2097 
Symbol 
ID7201394 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp2515 
End bp4257 
Gene Length1743 bp 
Protein Length552 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180550 
Protein GI219119587 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.143499 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AACAAGTATG TGCCACCGCA TCTGAGAAAC TCTCAGGGCA GTGGTGGTCG CACCAGCGAC 
TCTGTATCGG ACGGCCGTGG CGGAGACCGT CGCGATTCCT ACTCGGACCG TCGTGGAGAT
AGTGGCGGCC GTGGCGGTTA TGGAGGAGAC CGTCGCGGCT CCTACGGGGA CCGGCGCGGA
GAACAAGGAC GTGGAGGGGA TGGTCCACCT CCTACTGGAA ACTCCCGTTG GTCGGAAGGC
GGTGGAGGCG GCCGCGGCTC GTCTTCGTAC GGAGGAGGCC GCGGACAATC TCGTCGTAAC
GCTCGTGGCT TTCACGGTGA CCTCAAGGAA GACCCGCGCA CACAAGCACG TCTCTTTGGT
CGCGACGACC ACCAAACGAC AGGAATCAAC TTTGACAATT ATGACAAGAT TCCCATTGAA
GTGTCGGGAG ACGATGTTCC CGATCCTATC GAAACCTACT CCCCCGAAAC TATCGGAGAC
GATCTCTTTC GAAACACTCA GCTATGCGGC TACTCACGTC CTACTCCAGT CCAAAAGTAC
AGTGTTCCTA TCTGCACTCA GGGACGCGAT CTCATGGCCT GCGCGCAGAC GGGTTCTGGA
AAGACGGCAG GTTTCCTCTT TCCCATTATC ATGTCCATGA TAAAGCGAGG TGGAAGCGAC
CCACCCGAGA ATGCTCGCCG TCGTATATAC CCCGAGGCGC TGGTATTGGC TCCTACACGC
GAGTTGGCTC AGCAGATTCA CGAAGAGGCC AAGCGTTTTA CCTACGCTAC AGGCATTGCT
TCGGTAGTGA TTTATGGAGG AGCAAACGTG GGCGACCAAC TGCGTGAAAT GGAGCGCGGC
TGTGACTTAC TGGTCGCCAC CCCGGGTCGT CTGGTCGATC TGATTGAACG GGGACGTCTC
GGCATGGAAA GCGTCTCGTT TCTTGTTCTG GATGAGGCCG ATCGCATGTT GGATATGGGT
TTCGAGCCTC AAATTCGTAG GATCGTGGAA GAATCGGGCA TGCCCGGTGG TATTGATCGC
CAGACAATGA TGTTTAGTGC CACCTTTCCC GCCAATATTC AGCGTTTGGC AAGCGATTTC
ATGCGTGACT ACGTTTTTTT GACGGTTGGA CGCGTGGGCT CCGCCTCCAA GGATGTCACC
CAAACTGTAG AGTTTGTGGA GGAACGCGAT AAGGTTGACG CCTTGATGAA GTTTCTTTTG
ACCATTCAAG ATGGCCTCAT CCTAATTTTT GTTGAAACGA AGCGCTCGTG CGACTACGTT
GAAGACGTTC TCTGCGGCCA AGGATTTCCT GCCTGCTCGA TCCACGGCGA TAAGTCACAG
CGCGAACGGG AAGACGCACT TCGCTATTTT AAGAACGGAA ATACGCCAAT TCTTTGCGCA
ACTTCTGTAG CCGCCCGAGG ATTAGATATT CCGAACGTTA CCCAGGTTGT CAACTACGAC
CTTCCGTCCA ACATTGATGA CTATGTGCAT CGCATTGGAC GTACAGGTCG CGCAGGAAAC
ACTGGGGCAG CGCTGTCTTT TATCAACGAG AGTAATTCGG GTGTTGTCCG CGAGCTGCGC
GATCTTCTCG ACGAGAATGA GCAGGATGTT CCCCCTTGGC TCAATCAAAT GTGCCAGTTT
AGTGGCGGCC GTAGTAGCGG CGGAGGTGGT CGAGGAGGAG GCGGCCGTCG TGGCGGCGGT
GGCGGAGGTT TTGGCAGTCG TGATGTACGC AGCAAAGGTG GCAATGATCG CGGACAAGGC
GGC
 
Protein sequence
NKYVPPHLRN SQGSGGRTSD SVSDGRGGDR RDSYSDRRGD SGGRGGYGGD RRGSYGDRRG 
EQGRRGQSRR NARGFHGDLK EDPRTQARLF GRDDHQTTGI NFDNYDKIPI EVSGDDVPDP
IETYSPETIG DDLFRNTQLC GYSRPTPVQK YSVPICTQGR DLMACAQTGS GKTAGFLFPI
IMSMIKRGGS DPPENARRRI YPEALVLAPT RELAQQIHEE AKRFTYATGI ASVVIYGGAN
VGDQLREMER GCDLLVATPG RLVDLIERGR LGMESVSFLV LDEADRMLDM GFEPQIRRIV
EESGMPGGID RQTMMFSATF PANIQRLASD FMRDYVFLTV GRVGSASKDV TQTVEFVEER
DKVDALMKFL LTIQDGLILI FVETKRSCDY VEDVLCGQGF PACSIHGDKS QREREDALRY
FKNGNTPILC ATSVAARGLD IPNVTQVVNY DLPSNIDDYV HRIGRTGRAG NTGAALSFIN
ESNSGVVREL RDLLDENEQD VPPWLNQMCQ FSGGRSSGGG GRGGGGRRGG GGGGFGSRDV
RSKGGNDRGQ GG