Gene PHATRDRAFT_42424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42424 
Symbol 
ID7196636 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp18350 
End bp20109 
Gene Length1760 bp 
Protein Length551 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176998 
Protein GI219110493 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.960008 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGAAT CTCCCTCGCG TAGCCTTTTC AAGAAGGCTG GTTTTGTGTT CACAATCGCG 
TTATCCCTAT GGGCGATTAC TGACGAGAAC ATTTCTGTCG GAAAAGCGCT GCAACAGCAC
GAACCGAGTC TGCGAGTGTT TCGATCCTTG CTCGAAGTGA ATCTTTTATT TTTCTGTACC
GCGGCGGCAT TGTTTGTCTG GTCCAAGACC ATAGGACAAT CAACAATTGA AGCGCTCTTG
TTTCAACCAC TCAAGCTTTC CGGGGCACAC CCGGAGAACG CAGACCGGCA TGTGTACGCC
ATGACCGAAG TCAATGACGA CGAAGTTCTC CAGGAAGACG ACGCTCTAGA TGCCGATTAC
GCCAGTGATG GAGAAGTCGA TCGGCAAGAA GACTTAGACG AGAAACTATA CATCCCCACC
GCGGCTTCTG TTGCCAATGC TGCCTTAAAC ATGCTCTTTA CTATTCTGGT CGTTCTGTTC
TTGTTCACGT TAAGCTCGAT CTCCACCGCC AAGCATTCAG TCGAAGAAGT AGCATCCAAC
ACGAACAGCA GTGGCCTCTG GGATCTCTTT TCACGTGTCA CGGCTCCAGT TTTCCCTCTG
TTGCTCTTTC TTTACTTTTT GCTACGGGCT ATCTTCCCCT GGAGGCGTAA AAGATCGTTT
TGGGCAGTTG TTTTCATGAC GATGTCCGCC CCGTGGCATC CAGTAGACTT TCGGGACGGC
TTTATCGGTG ATATCATTAC TTCTTCAGTA CGACCGATGC AAGACATTGC TTTTACCGTA
TTTTATATCC TATCGGGTCT AAGAGGATGG TGGTCACGAG AATATCGAGA CGGCAACTTT
ATCGATTCCG CGGATGCGAG CGTTCCAGCA ATGGAACGAT CATGGCTGTT ACACACTGTC
GTACTGCCAA TGTGCATGGT CAGGTACGCA TACATGCATA TACACGGGTA GCAAATACGT
GTTAGTTGCT AACGTTGGGA AATCAGTATC TAGCCTCACT TACCGTCCCC TTTGCTCATG
GTGGTAGTCC CCTCTGGTGG CGATTTCTTC AAAACCTTCG ACAAAGCTAC GATAGCAAGC
AGCGCTGGCC GCACCTTGGC AACGCTCTTA AATACTGTTT CGCCGCCCAA ATTGCAATGT
TTGGTGTATT CAATCCCGAC CAAAAAAAGA GCGTTCTCTG GTTAACAAGT TTTGTTGGCG
CTACTTTGTA TCAGCTTTGG TGGGACATCT TTATGGACTG GTGCCTATTG GTTCGTGTGG
ACGAGCGCTG GAAACTTCGT AGTACACGTC TGTACACCAA AACATCTGTA TATTGGATTA
TCTGTGGGGC AAACTTAGTT TTGCGTTTTT GCTGGACTCT GAGTTTTGTC CCGCCGCGCT
ATCTAAATGC CTCCGGCGTT CTGAAAGAAA GCTTCTCAGG CGATGTGAAG AATATCCTGG
GCCCCTTTAT TGCTTCCGCC GAAATTGTGC GAAGGGCTCT ATGGGGACTG CTGCGTTTTG
AATGGGAGGC GACGAAGAGA TACAGTGATC GTAAATCATC GTTTGACGAA AGTCAAGACG
GTTTGAGAAA TGAAATCGAA CTTACACCGA TGAAAATAAA ACAAGATGAG TATCGCAAGT
CTTCCAATGC TTTCTCCGTA GGCCATTCTT GGAAAATGTC CTCGATGAAT GAGGTTCAAA
TAATTGGCGA GCTTGGTGTA TACGCGACAG CTTTCTGGTT GATTGGGACA CTAGCCGCCG
CACATCGAGG AACTTTGTAG
 
Protein sequence
MVESPSRSLF KKAGFVFTIA LSLWAITDEN ISVGKALQQH EPSLRVFRSL LEVNLLFFCT 
AAALFVWSKT IGQSTIEALL FQPLKLSGAH PENADRHVYA MTEVNDDEVL QEDDALDADY
ASDGEVDRQE DLDEKLYIPT AASVANAALN MLFTILVVLF LFTLSSISTA KHSVEEVASN
TNSSGLWDLF SRVTAPVFPL LLFLYFLLRA IFPWRRKRSF WAVVFMTMSA PWHPVDFRDG
FIGDIITSSV RPMQDIAFTV FYILSGLRGW WSREYRDGNF IDSADASVPA MERSWLLHTV
VLPMCMVSPL WWRFLQNLRQ SYDSKQRWPH LGNALKYCFA AQIAMFGVFN PDQKKSVLWL
TSFVGATLYQ LWWDIFMDWC LLVRVDERWK LRSTRLYTKT SVYWIICGAN LVLRFCWTLS
FVPPRYLNAS GVLKESFSGD VKNILGPFIA SAEIVRRALW GLLRFEWEAT KRYSDRKSSF
DESQDGLRNE IELTPMKIKQ DEYRKSSNAF SVGHSWKMSS MNEVQIIGEL GVYATAFWLI
GTLAAAHRGT L