Gene PHATRDRAFT_49118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49118 
Symbol 
ID7195339 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp640618 
End bp643660 
Gene Length3043 bp 
Protein Length918 aa 
Translation table 
GC content61% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183786 
Protein GI219127110 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00139685 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACT TCCCTCACAA AGTCCTCGAT CCAATCGCCA CCACCACCGT TCCGCCGACC 
TACGCCACTC TCAAAGTGGC CCAACGTCAA CTCAGTACCA ACGCCGCCGC CATCCCTACG
CTCAATGGTG GTGGCGCCCA CGGCCACATG GCCCTCACGC TTACCGCCCG CGCCTACGCC
GACATCAGCG ACGTCCCATT CGACATTCCC GTCGCCCCTC CGGCCAACCC TCCCGTCGGC
ACCACGCAAC CGCAAATCAC CGAGTTCAAC CGCATCCACC AACGCAATGC CGACGTCTAC
AACCTGTACG TCGCTGTCAA TAATGCCCTC CGCCAGCAAC TTCTCGACGC CCTCCCGAAG
ATTTACGTAC GCGCCCTTGC ACATCCCATT TTCGAATTCA GCACCGTTAC CTGCCTCGAC
CTCCTTTCGC ACCTCTGGAC CAAATATGGT ACCATTAAGC CTGCCGACCT CCAGAAAAAT
TTCCAATCCA TGTACACCCC ATGGAACACT GCCGAACCCA TCGAGACTGT TTTCTTACAG
CTTGACGAAG CTATCGCGTT TTCCATCGAC GGCAACGACC CCATCTCCGA GGCCGCCGCC
GTTCGTGCCG GCTACGACGT CCTAGCTCAC TCCGGCCTCT TCCCTCAGGA CTGCAAAGAC
TGGCGCAAAT TACCCCTTGT TTCTCACACC CTTGCCAACT TCCATCAGCA TTTCACTCTC
GCCGACGAAG ACCGGCGCCT CACCGCCACC ACTGGATCCC TTGGCTACGC CAATCTTCTC
GCGGCCACTC CCTCTCTGGC TCCAGCCACG GTTTCCGACA CCCTTAGCCT TCCTTTCTCC
GCGCTCTCTG TGTCCCAGAC TTCCGTCTCC TCTCCAGAAA TGACGTATTG CTGGACTCAC
GGAACCAGCA AGAACCGGCG CCACACAAGC GCCACGTGCA AAAACAAGGC CCCTGGCCAT
CGCGACGACG CGACGGCCAC CAACACTCTT GGCGGATCAA CCAAGATTTG GACTGCCCCC
AGGCCTCCTG AATAGGAAGG AGGGACGGCT ACGCCGACGA TTAAAACTAG TAATACCGAT
TCTTTACATC ATATTACTAG TCTTAATTCG TCTGTAGTCC CCTCCCCGCC TAGTACACAC
ACCTCCGCCA TTGCCGACAC TGGCTGCACC GGCCACTACA TTACGGTCGA CTGCCCTCAC
ACCCACAAGC ACCCAGCAAA CCCCAGCCTC GCCGTCCGTG TCCCAAATGG CGCCGTCCTC
CGCTCCAGCC ACATTGCCAC CCTGGCCCTG CCTGGTTTCT CCCCTGCCGC CTGCCAAGCC
CACATTTTTC CTGGGCTCGC TTCCCATCCG CTCCTCTCCA TTGGGCAACT GTGCGATGAC
GGCTGCACGG CCACCTTCTC TGCCACTCGC CTCGACATTC ATCGCGACGC TACCCTGCTG
CTCTCTGGGG CCCGCTCCCC CCACACTGGC CTTTGGCACC TTGATCTTGC CCCAGCTCCC
TCTCCCGCGA CGGCCCATGC CCTTGTTCCA CACACACCCC TTGCCGACCG CATTGCTTTT
ATCCATGCCT CACTCTTCTC CCCGGCACTT TCCACGTGGT GCCAGGCACT TGACTTGGGC
CATCTCGCCA CCTTTCCGGA CCTTTCATCC CGGCAAATCC GCAAGCATCC ACCCAGCTCC
TCTGCCATGA TCAAGGGTCA CCTCGACCAA CAACGAGCTA ACCTTCGCTC CACCAAGCTT
CCCCCGGTCA GTCCTCCTAC CACAACGACA CCTCCCGTCG ACCACGAGCC TGACAGGGAT
CCTCCCGATG CCCCACCGGT CACACGCACG CACCACGTCT TCGCTGCGCA CCAGCGTGTT
ACCGGCCAAA TCTACACAGA CCAACCGGGA CGTTTCCTCA CTCCGTCCAG TGCAGGCCAC
AACGACATGC TTGTGCTTTA TGATTACGAC AGCAACGCCA TCCACGTCGA ACTCATGAAG
AACAAGTCCG GCCCGGAGAT ACTGGCCGCC TACAAACGCG CTCATACCCT TTTCACCCAG
CGTGGCCTCC GTCCCCAACT CCAGCGTCTG GACAATGAAG CCTCTGCAGC CCTCCAGTCC
TTTATGACCT CAGAACACGT TGACTTTCAG CTGGCACCCC CCCATCTACA CCGTCGTAAT
GCAGCCGAGC GGGCCATCCG CACCTTCAAG AACCACTTTA TTGCTGGCCT CTGCACCACT
AACCCGGATT TTCCATTACA CCTTTGGGAC CGCCTCCTCC CACAAGCCCT TATCACCCTC
AATCTTCTTC GTCGCTCCCG CATCAATCCC AAGCTGTCCG CACACGCCCA GCTTCATGGT
GCTTTCGATT ACAACCGCAC CCCGCTTGCT CCTCCCGGTA CTCGCGTCCT CGTCCATGTC
AAGCCGTCCG TCCGCGAAAC TTGGGCCCCC CATGCTGTCG AAGGTTGGTA CCTCGGCCCC
GCCCTGCACC ATTACCGCTG CCACCGAGTC TGGGTCACGG AAACACGTGC CGAACGCGTT
GCTGACACCC TTTCCTGGTT CCCGACCCGC ATTCCCATGC CCACCGCTTC GTCCACCGAC
CGCGCCCTGG CCGCCGCCCG CGACCTGATC CATGCCCTCC AGAATCCCTC CCCTGCGTCT
CCATTCGCCC CCCTCGACGC CACCCAGCAT CAGGCACTCA CCCAACTTGC CAATCTCTTT
GCCACCGTGG CCGCCCCGGC CGCCGCCGTC CCTACATCCG CTCCCACGCC TCCGGTCCGT
CCTCCTGCCC CAGCACCTCC CCCTTCTCAG GTCCGCTTTG CCGTTCCTCT CGTCACGGCC
GAACATGCCC CTGCACTTCC GAGGGTGCCC ATTCCGGCCG CCGCACCTCC GAGGGTGCCC
ACCATAGCCA CCTATCACTC TCGCACCGGC AACCCAGGCC GTCGCCGCCG CAAAGCACGC
ACACAACCGG CAACCCCAAC CCTAGTACCA GCGCATCCAC ACAACACCCG CACCCGGCCC
TTTCTTGTCC CGGCCTCTGC CAACGCTGTT GTCGACCCCG CCA
 
Protein sequence
MSDFPHKVLD PIATTTVPPT YATLKVAQRQ LSTNAAAIPT LNGGGAHGHM ALTLTARAYA 
DISDVPFDIP VAPPANPPVG TTQPQITEFN RIHQRNADVY NLYVAVNNAL RQQLLDALPK
IYVRALAHPI FEFSTVTCLD LLSHLWTKYG TIKPADLQKN FQSMYTPWNT AEPIETVFLQ
LDEAIAFSID GNDPISEAAA VRAGYDVLAH SGLFPQDCKD WRKLPLVSHT LANFHQHFTL
ADEDRRLTAT TGSLGYANLL AATPSLAPAT VSDTLSLPFS ALSVSQTSVS SPEMTYCWTH
GTSKNRRHTS ATLNSSVVPS PPSTHTSAIA DTGCTGHYIT VDCPHTHKHP ANPSLAVRVP
NGAVLRSSHI ATLALPGFSP AACQAHIFPG LASHPLLSIG QLCDDGCTAT FSATRLDIHR
DATLLLSGAR SPHTGLWHLD LAPAPSPATA HALVPHTPLA DRIAFIHASL FSPALSTWCQ
ALDLGHLATF PDLSSRQIRK HPPSSSAMIK GHLDQQRANL RSTKLPPVSP PTTTTPPVDH
EPDRDPPDAP PVTRTHHVFA AHQRVTGQIY TDQPGRFLTP SSAGHNDMLV LYDYDSNAIH
VELMKNKSGP EILAAYKRAH TLFTQRGLRP QLQRLDNEAS AALQSFMTSE HVDFQLAPPH
LHRRNAAERA IRTFKNHFIA GLCTTNPDFP LHLWDRLLPQ ALITLNLLRR SRINPKLSAH
AQLHGAFDYN RTPLAPPGTR VLVHVKPSVR ETWAPHAVEG WYLGPALHHY RCHRVWVTET
RAERVADTLS WFPTRIPMPT ASSTDRALAA ARDLIHALQN PSPASPFAPL DATQHQALTQ
LANLFATVAA PAAAVPTSAP TPPVRPPAPA PPPSQVRFAV PLVTAEHAPA LPRVPIPAAA
PPRAVAAAKH AHNRQPQP