Gene PHATRDRAFT_42521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42521 
Symbol 
ID7196069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp288163 
End bp289641 
Gene Length1479 bp 
Protein Length492 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176560 
Protein GI219109611 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACTGA TTTTACTATT GCTACACTTT CCTGTATGCT TCGGATTCCA CTTTGCGCCG 
CTGGATAGAG CACTAATCAA GCATGTCACG AGCAGCGATC GTTTGTCTCA CTTTCCGACG
TTGCACCGAA AATGCCGTAG ACCTCGCCAC CGACAACCTT GTCGACCGCA CCAAGCGATT
CCCATCGCTT CTATTGGAAT CGAAACGACC AGAGCTTTGT CAGGGCTTGT CGCGTCGTCC
ATAGTTGGAT TCAATTTTGA TCGGATACTT CCTGATTCTG GGATTCTAGT AACTCTTATT
TCAGCGGCAC TTATTTCCAA CGTGGGCTTG GCGCCCACGC TTCATCCACT ATACGATACG
TGCTGGACAA CATTTTTGCC AGGATCCCTG ACCCTGCTTC TACTGTCGAT GCAGAAAAAG
ACGACAGAAA CGTTTGCGAA TGGAGAATCT ATTTTGACGG TCGTTCGGAG AGTGTCTGTT
CCCTTTGCTA TCGCTTCCGT AGCGTCCGTA TTGGGCTGTG CATTATCCTT TTGGCTATGT
CTTACTTTTC CAATGCATCT TTTACCCAAA CAAGAAGCCA CTGTAGCAAC GGCATGTCAA
GCAGCGTCGT TTGTAGGAGG ATCCGTTAAC TTTTTCGCCA CCGCAGCGGT TGTTGCCGAT
CGATCAGTGT CTACACTGGT TAGTTCAATG GCCACCGCAG ATTTGGTAGT AACGGCCATT
TTCTTCGCGA TTTTGAGCAC AGCGCTTCAA TCCCCTTCAC TGAAACGAAT GTTTTTGAAT
GACAACGAGA GAGAAGCCCG AAACTCTGAC GTAGAAGACA TAAATGAGTC CACCAACAAA
TCTACCGATA GCCCCGACCA GCCAACGCCA AGGAAGTCAA TTAAGGACGT TTCTCCAGCG
ACAATGCTAC GTTTGACGAT ATCATCAATT CTGGTTTCGT CTGTAGCACT AGCGATTGTC
CGCTTGGCCG AGCGCTTCGA AGCCGTGGTC TCGAGCATCA TCCCAGGGAC AGCATGTGCC
GCTATCACAG TTCTCGCTCC GCTTGTTCCG AAATTTATGC CTCGCGACCT TTGGCTCTGG
AAAGATATGC AGCGCGTCGC CGTTCCGCTT TCGCAGTTCT GTTTTTTGTT TTTGTTTGCA
TCCATCGGGA TGTCGGCCGA TTTGACGGCC GCGTTGATAT CCGGCCCCGC TTGTCTGGTT
GTTTCGTTGA GTGCTTTAGT CGTTCATTTG ATTGGTACAC TATTGGGTTG CTTGATTTCT
CGCCGTTGGT TTCAGTCAGA ACTTCGTTTT GAAGATGTTT TGGTGGCGTC CAATGCAGCC
ATTGGAGGAC CGGCGACTGC GGCTGCCTTT TGTGGTCGGA TAGTAGGACC TCGTCAAAAG
GCTTTGACCT ACGCGGCTAC CATATGGGGT GTCGTGGGAT ACGCTATTGG CACAACACTT
GGAGTAACTT TCTTTCGAAT CGCGCGACAA TTTTTATAG
 
Protein sequence
MRLILLLLHF PVCFGFHFAP LDRALIKHVT SSDRLSHFPT LHRKCRRPRH RQPCRPHQAI 
PIASIGIETT RALSGLVASS IVGFNFDRIL PDSGILVTLI SAALISNVGL APTLHPLYDT
CWTTFLPGSL TLLLLSMQKK TTETFANGES ILTVVRRVSV PFAIASVASV LGCALSFWLC
LTFPMHLLPK QEATVATACQ AASFVGGSVN FFATAAVVAD RSVSTLVSSM ATADLVVTAI
FFAILSTALQ SPSLKRMFLN DNEREARNSD VEDINESTNK STDSPDQPTP RKSIKDVSPA
TMLRLTISSI LVSSVALAIV RLAERFEAVV SSIIPGTACA AITVLAPLVP KFMPRDLWLW
KDMQRVAVPL SQFCFLFLFA SIGMSADLTA ALISGPACLV VSLSALVVHL IGTLLGCLIS
RRWFQSELRF EDVLVASNAA IGGPATAAAF CGRIVGPRQK ALTYAATIWG VVGYAIGTTL
GVTFFRIARQ FL