Gene PHATRDRAFT_42743 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42743 
Symbol 
ID7196128 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp947078 
End bp949270 
Gene Length2193 bp 
Protein Length669 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176692 
Protein GI219109878 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.599119 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AATAACCAAA GTCGATCGAC GCCATACCTT GGACAAGCAT CTTCGACCAT GAAGAGAATC 
CACGGAGCCC ATAGCCGTCC GGGAACGAAG CCTCTGTCGA TGGATACTAC GCTCGACGAT
GCCGATGTCT ACACGGTCTC GTCGAATGTC GATCCCGCTT TTTCCTTGTC CTCGTCGGTG
CCCCAAACGC GGACCCTTGA CTCCACGAGA TCAGAGCGGA TTGGTAGGAG CCGGAAACAA
CCATCGGCTT CAATGACTCA GGGAAGGAAA CCTCTGTCGA TGGATACTAC GCTCGATGAT
GCCGATGTCT ACACGGTCTC GTCGAATGTC GATCCCGCTT TTTCCTTGTC CTCGTCGGTG
CCCCAAACGC GGACCCTTGA CTCCACGAGA TCAGAGCGGA TTGGCAGGAG CCGGAAACAA
CCATCGACTT CAATGACTCA GATAAAATGT AAAGCAAGCT TTGAAAGACA GTACGATCTT
ACTCCGCTTG GTCTTACAAC GGATGCCGGA GCTTCCTCTA GATTTTCCCT CGATCCGTGG
GAGAACGATT CGGAAAGCAA AAAAATGGTA AAAGAACAGA CTAGCACATC CAATCGGTAA
GTGGTATCAC TATGATAAGA CGTCCGGATA GTCAAATTCA CATGGTGCTC TCCTATTATG
AATAGATCGA ACCAGTTACA GAGGAAGGCG AGCGATACCT CTCCATCTAG TGACGATACC
GAAGAAATTC ATTTCGTGGA CGAATCGCCA AGTGTGATGA GCACCTATCT AGATCCTGAA
GAGATTCATC TGAGTATGGA AGATAGAGAA GAGGAAATCG CTCAGTTGCG GCTTTTGGCC
GGAAAGCTGT CGGCGGATTG GAGAGCTCAG GACTTTGTGG CCCCGGCTCT GGCTAGGCGA
ATGAGAGATT TCCATTTCGC TCAGGAAAAG CGGCGCAAGA AGTACGGGGA CGAAAGGCCT
TGGGGAATCC TCGGGTTGTA CGATCATCTT TCCGCAATTC GCATTGATGT ACAGTGGGCC
GAAGATGCAG CTTGGAGACG GGCCAATGGA GAACCTTACC TTTCCTGGGC GGACTTTGAC
GAGAGCAAAA AAGGTGGAAA GAATCGACCC TACTTTACCT ACGTTCTTTT GTTCGTCTGC
ACTGCCAACA TGATTGCGTC TATTGCGGCG AATGATTGGA CTGTGGAATC TTTGGACGAG
AATCCCATGA TTGGGCCCAG TGCTGCGACG CTCATTCGAA TGGGTGCGAA AGACTCCTAC
TTGATTGTAC ACGCCGGCGA AGGGTGGCGT TTGTTAACCT CCACGATTCT CCACGCGGGT
TTAGTTCACT ATTTCATCAA TATGCTAGCC TTATGGTTTG TCGGTGGAGC TATCGAAATG
AGCCACGGGT GGATCTCGGC CATGATCATC TTCAGCAGCT CCGCCATTGG AGGGATTATC
CTTTCCGCAA TTTTTCTCCC GGAATTCATT ACCGTAGGTG CCAGCGGAGG AATCTTTGGT
TTCATTGGGG CTTGCCTTGC CGACATTATC ATGAACTGGA AGCTTCTCTT TGATGGGCTT
CTCGATGAGA ACGGAAAAAA ACATCAGCAC ACTATGGTCG TTGTCGTGTT GCTCTTCGAT
ATAGCATTGA ATTCAATCAT TGGGTTGACG CCTTATGTGG ACAACTTTAC TCGTAAGTAT
CGCGTGTTGA CTGGTTGATC AAATACTCGC ATCTCACGAA TAAGAATGCA CCTTGCAGAT
TTGGGAGGAA TGGCCTATGG CTTTCTCTGT GGGCTCTCTA CCATCGAACG GCTGTCCAAA
GATTTTTTTG GCTTAGAGGA GTCTTGGATG GTTCGAGCCA AGAACTTTTG CGTGCGCTTT
TTTGGAATTA TCGTTACTGT CGTTTTTATT TGCGTAACTG CAATTATTTT GATGGGAGGC
GATGGCGTAA CAACTCCCTG TACAAATTGT AGTTGGCTCT CCTGTGTCCC TTTTCCTCCG
TGGCAGAGTC AAAGTAATAA ATGGTGGTAC TGTGATGATT GCGGCCGTAT CACGGCCGAA
ATTATTTCTG AGCCCTATTT ACACCTAGAA CTCGACTGCC CGGGCGGTAC AATTGGTTTC
GTCAACGTAA CAAGCGATCA GTTGGACAGA GGCAAACTGG AACAAAGTCT TCCTTCCTAC
TGCCGTCAGT ATTGTCCCAT CAAAGAGCTT TGA
 
Protein sequence
MKRIHGAHSR PGTKPLSMDT TLDDADVYTV SSNVDPAFSL SSSVPQTRTL DSTRSERIGR 
SRKQPSASMT QGRKPLSMDT TLDDADVYTV SSNVDPAFSL SSSVPQTRTL DSTRSERIGR
SRKQPSTSMT QIKCKASFER QYDLTPLGLT TDAGASSRFS LDPWENDSES KKMVKEQTST
SNRSNQLQRK ASDTSPSSDD TEEIHFVDES PSVMSTYLDP EEIHLSMEDR EEEIAQLRLL
AGKLSADWRA QDFVAPALAR RMRDFHFAQE KRRKKYGDER PWGILGLYDH LSAIRIDVQW
AEDAAWRRAN GEPYLSWADF DESKKGGKNR PYFTYVLLFV CTANMIASIA ANDWTVESLD
ENPMIGPSAA TLIRMGAKDS YLIVHAGEGW RLLTSTILHA GLVHYFINML ALWFVGGAIE
MSHGWISAMI IFSSSAIGGI ILSAIFLPEF ITVGASGGIF GFIGACLADI IMNWKLLFDG
LLDENGKKHQ HTMVVVVLLF DIALNSIIGL TPYVDNFTHL GGMAYGFLCG LSTIERLSKD
FFGLEESWMV RAKNFCVRFF GIIVTVVFIC VTAIILMGGD GVTTPCTNCS WLSCVPFPPW
QSQSNKWWYC DDCGRITAEI ISEPYLHLEL DCPGGTIGFV NVTSDQLDRG KLEQSLPSYC
RQYCPIKEL