Gene PHATRDRAFT_47001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47001 
Symbol 
ID7202235 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp144562 
End bp146662 
Gene Length2101 bp 
Protein Length534 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181146 
Protein GI219121589 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGTACGTACA AGCTAGTCGA GCACCCCATC TCCTTACACA CCCCCATTCT CTGAGTCGTG 
CACCCGCGTC TACAAACAAT TCGTTTGTAT CGAGTACCAG TGTTCTAAGC ACCCGTACAT
AGTTTCGAGA CTACAAGGAG ATTGTAGCAG GTTGTTCGAA AGCAATAAAG CAAGGACATC
CTTTTTCTTC AATCCAGGCG CCCTCTGAGT TGGTGACTAC GAGCGATCAG TCCGGGCGCT
TTCAGGAGTA GGACAATGAG CCATTCGGTG GCTGGCTCCT CCGTGGATGT TTCGTCCTCG
AATTCGGTGG CTTCTTCAGG TGACGCTTCT CCGACGTCCG GCTCTCGTAT TTTGAATCAA
AGCACTCCTT CCAGCGATGC TTCTACGGTG GCTTCAGTTC CCTCTGTAGA ACCTCAAAAT
CCCTTTGAAA TGTCGCTAAA CGACACGGCA GTGTCTGCTT CCGGAGCCGC CGAAAAGACT
AATCAAAATA CGTCTTCTCC GGCTCGAAAC GGTTCAAATC CTCCTTCTCC GATGCATGCT
CATGCCTCTC ACCAGACCAC TCGCTCGCAT CCTTCTCCGG TTGGAACTCC GGTTTCGACG
ACCCCGCTAC GCGAACCACA TTCGCCGGGC TATCATCAAG CGCAATCGTC TCCCCCTCAC
GCCATCACAG GAGTCGGCAG TTTGACGGTA CCCACGTTGT CGCAATCGGA TCAAGCGCGG
TCTCGTATTC AACAGCAGTT GCCGGGTCAT TTGCAGACTT TTCCTAGCCC TGCTCAGCAC
GGCGTGCGCG AGCCCGTATT TGATGATGAT GAGAATACAG AACCGTCGTC CACCAACAAT
ACTGCTGATC AGCAGCATCA GGAGGTTGGG CACGGCTCTT CTTTCATTCG CTTTCTGGAA
AATACTCGTA AGCGCTTGAG TGTTGCCAAC AAGGATGAAG ATGCAGAAGA TCCTTTGGAA
GACGAGGAAG GCTTGGACGG TGCTTTGATC TATGGTTATT TGCAAAAAAT GGGACGCAAC
GGCAAGTGGC AAACACGCTG GTTTGAATCG GATGGCGAGT GTTTGTCGTA CTTCAAAAGC
AAGAAACGTA CAAAATTACT GGCCACGTTG GATCTAGAAA AGGTAAGTTT GTGTGCTCAT
TTATGTTAAC GGTGAGTGAT TATGGAACTC CGGACCCTCA CAAACCATGG CATGACATTG
ATTTATGTAG GTTGGATCGA TTTGTATTGA TCCACAAGAT CCACAAGGTT GCTCGTTTAC
CATTCAAGTG TTGGGTCGAA TGTATCATTT GCGTGCGAAC AGCAAAGCCG CGACGAAGGA
CTGGGTAATT ACACTGAACA GGATCAAAGA AGCCAAGATG CAGCAAGGGC ATATTCATCT
TGTCAACCCT TACGAACAAC AGCCGCAGGA TCTCTTGGAT AACCACGAAG AGATAGTCGC
GCCTCGGGTT GTCGTGGTGG CCAATCGGGA ACGGACACGC GCCGTTGCCG AGACCATCGA
TTTTGACCAG CTTATCCGTG TTGACCAGAA TGGTGAGAAT CGTGAGTTGA CCTATGACAA
TTCTAAACGG CGTTCGACCA TTGGAACTGT GGTTTTGGGG CGCTGGACAA AGCGTCGTTC
GTCGCTTTCT CGCCTCAGTG CCAAGTTCTC CAAGTGGGCC CGTAGTCTGA AGAAGTACAG
CTGTACCGAA TCAGGTACAG AAAATGTGCA GCTCGATCGC TACGTTCATC CTCCTGGTCA
TGATGACATA CCGAAGCGTC GACAGCCAGA TTCTGGTCCA AAGCTCGCTG CGGACGCAGA
GTCGAACCCT GTAAGCGTTT CAGGGTGGAT TGGCAAGGAG ACGTCCCGGT CAGGACAGGC
TGGAAGCGGA TCGGCAGATG TACCCCAACC AACACGTGCC GTCCGTAGCA TGAGCCAAGC
ATCCGACGAT GTTCGCATGC TATCGTAGAA GGCCGATGCG TCCATGATAG AGAGGATTGC
AGTGCTGAAG CAGGTCGCAC ATTCTGCGAA AATGCTCTTA TATGTTTTTT ATATTTCGTG
CGAGAAGAAA ATTGATAGTG GTAGGGTTGA ACGTATTCAG TTTCTTGGAA TGGCATATTT
T
 
Protein sequence
MSHSVAGSSV DVSSSNSVAS SGDASPTSGS RILNQSTPSS DASTVASVPS VEPQNPFEMS 
LNDTAVSASG AAEKTNQNTS SPARNGSNPP SPMHAHASHQ TTRSHPSPVG TPVSTTPLRE
PHSPGYHQAQ SSPPHAITGV GSLTVPTLSQ SDQARSRIQQ QLPGHLQTFP SPAQHGVREP
VFDDDENTEP SSTNNTADQQ HQEVGHGSSF IRFLENTRKR LSVANKDEDA EDPLEDEEGL
DGALIYGYLQ KMGRNGKWQT RWFESDGECL SYFKSKKRTK LLATLDLEKV GSICIDPQDP
QGCSFTIQVL GRMYHLRANS KAATKDWVIT LNRIKEAKMQ QGHIHLVNPY EQQPQDLLDN
HEEIVAPRVV VVANRERTRA VAETIDFDQL IRVDQNGENR ELTYDNSKRR STIGTVVLGR
WTKRRSSLSR LSAKFSKWAR SLKKYSCTES GTENVQLDRY VHPPGHDDIP KRRQPDSGPK
LAADAESNPV SVSGWIGKET SRSGQAGSGS ADVPQPTRAV RSMSQASDDV RMLS