Gene PHATRDRAFT_49149 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49149 
Symbol 
ID7195649 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp70398 
End bp72420 
Gene Length2023 bp 
Protein Length570 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183804 
Protein GI219127150 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.536762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAGA CCCTATTGAA TGGATCAGGT CCAATTCAAG GCTTGAAACA TCATGTACAA 
CAGCGTGGCG CTGAGGTGCT CTCCTCTTTT GACAAGAATA TCCCGCCTTG GCCGACGCAT
TTTGTCGTGA GTGAACAAGT CCAGGAGCCA CTTTCGATTG CACAAGCACT GGGTTTCGAT
TCCTTGGAAG AAATGTCATC TTTTCTGTAC GACCATAGCA TTGTTTGCGC CACGCGGAGG
TGGGTTCATC GCGGGGATCG ACTGGACAAG CCGCCTCTGG AGGAACCTAC CATGATGGAA
ACGTATCTTG GAATAGGTCC TAAGCGAAAA TGCAAACGAC ATAGAACGAA CAAAAATGAA
GAGTCGAACG ATTCGCAAAG TTCACGGTCG AATGTTTACA AAACTAACCA ATCTCTATCG
GAAGCCTTTC GGACCTTGTC TAAACGTCAC CAGGAAATGC CGCTGAACGG AGAACTGGAT
GCTTGGAAGT CTTATTCGTT TCAAATTACG GCGGGTCGAT TGCTGCATCT AGGCTTCGAA
ATCAGAGATA GCCCGGAGGT TCTTCGCCAT CTCGCGTCGA TCAACGGGTT CGGCTCGTCG
ACGATGGATA TCATTACAGA TTATCTACGA ACGCAGCAGT GCAGCCGTTT GCGTAATCTG
GAATCAGATC CGGATCGGGT TGCTATGAAG AATATGATGA ATATCTGGGG TGTAGGTCGG
GTTCGAGCCA AGGAGTTGGT GGATGCTGGC TTTAAGAGGA TTAATGAGGT TCGGCAAGCT
GTCGAATTAG GGAACCTACA ATTGGAAAGA AACCAATACA TTGGCGTATT GTGTTACGAC
GATTTGCTGG AAAAGATGGA TCGAACGGAA GTCGAAAGTA TCGGTAAGAT TATATCTAAC
ATTTTCAAAA TGTCCTATCC TGAGGCGGAA GTGTGTGTAA TGGGAAGCTA TCGACGAGGG
AAGCACGCTT GCGGGGATGT TGACATTCTT ATTACACACG AAGATTATAA TCACACGGTC
CCACCGAAAG CACTGGGACA ATTTATCGAC GAACTACGGC AACAGGGACA CATCGCATAT
CACTTGACAT TTATCTCCGG CATGAAGCAT GAGCTATATG AAACAATCCC AGATGCACCA
AGTCACTGGT CGCCGCAGCG CGACAAACGA GACAAATCTT CGAGCAGCTC CTATATGGGT
GTTTTCAAAT CACCTTGTAT GACGGGTAAG AAGCGCCGAG TTGATATTAA ATTTTATCCT
TGGCGAGAAA AGGCTTTCGC GAGTCTTTAC TTCACCGGAA ATGGCTACTT CAATCGATCG
ATGCGCCTTT GGGCAACACG CAAATTCAAC TATACGTTAA ACGACCATGG TGTTTTCGAT
CGAGGATCTC TTGTTCGCGT TTTAGACACG ACTTCCGAAA AAGAAATTTT TGAATTTCTT
GATATAAGTT GGAGGGAACC CAAGGAAAGA GATTCCTTTG ACGCTGTGAA AGGCAAGAAA
AATGGCGAAA GTGCAGCGCA ATTAGAAGGT TTTTCAAGGT CAGAGGTTTC ACGAGAGTCA
AGAGATCACA GATGGATTGT GTAAACAGCT TTTGGCCGCT GTTGCCTGGC ACAGGCTGTG
CATATTGAAC GCAACATTGT CAAATGATCA ATCATTTAAA CTTCGAGTCT CGATATATTT
TACATTAGCT CGCATCGACA AGGCTTTTTA CGATTGATAT TCCGCACTTG TCCGTGAAAA
CGAACTTCTC GTATACCTTC AACGAGTGAA AGCAAAATAT CCTCTTATGC ACGAAAGAGA
CGCCGTCTTC TAAACAGAGT CCGGTATTGC TGCTACCGCT ACTATCAGTT TCACGTCGTT
CCGTGAGCTC AGTGGGAAGG GACGAAACAC GACATCTGGT GGAGCTCCAC TTGACAGGCC
AATCCCGGCA TCGACACGAT TCCTTGTCCC ACACAACAAG ATTCCTCGCG ATGACGTTAT
GCACAACCAA CTCTACGACT CGAAAGCACT AACGAAAAAC TGA
 
Protein sequence
MNKTLLNGSG PIQGLKHHVQ QRGAEVLSSF DKNIPPWPTH FVVSEQVQEP LSIAQALGFD 
SLEEMSSFLY DHSIVCATRR WVHRGDRLDK PPLEEPTMME TYLGIGPKRK CKRHRTNKNE
ESNDSQSSRS NVYKTNQSLS EAFRTLSKRH QEMPLNGELD AWKSYSFQIT AGRLLHLGFE
IRDSPEVLRH LASINGFGSS TMDIITDYLR TQQCSRLRNL ESDPDRVAMK NMMNIWGVGR
VRAKELVDAG FKRINEVRQA VELGNLQLER NQYIGVLCYD DLLEKMDRTE VESIGKIISN
IFKMSYPEAE VCVMGSYRRG KHACGDVDIL ITHEDYNHTV PPKALGQFID ELRQQGHIAY
HLTFISGMKH ELYETIPDAP SHWSPQRDKR DKSSSSSYMG VFKSPCMTGK KRRVDIKFYP
WREKAFASLY FTGNGYFNRS MRLWATRKFN YTLNDHGVFD RGSLVRVLDT TSEKEIFEFL
DISWREPKER DSFDAVKGKK NGESAAQLEG FSSFTSFREL SGKGRNTTSG GAPLDRPIPA
STRFLVPHNK IPRDDVMHNQ LYDSKALTKN