Gene PHATRDRAFT_41639 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41639 
Symbol 
ID7199464 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011701 
Strand
Start bp27642 
End bp29678 
Gene Length2037 bp 
Protein Length678 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185597 
Protein GI219130913 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.627134 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGACG AATCAAATGT GCATGAGGAG GCAGCTTTTC AGGATCACGT ATCGTCTAGT 
GAACACTCTC AGAAAGCACG TGAACGTCGA CACCCCGTCG CAATCGCAAA CAACGCAGAG
GGTAAATCTT CGGCTGAATT TTCCGATCAA ACGATTACAC AAACGACCGC AGCCGATTCA
GTTTCAGACC TGATCGGCGG CGAAGACTAT GAGACTCCAG TTGTGCCTTA CGCCAATCTG
GGCAATCCTA CGCTTTTTGA ACAAGCTGAG ACCAAGCATT TCTCTTCGCA AAAGAATAGT
GACGATATTT CTGACACGGT TGAAACTCAT TCGAGCGCGG CCACCCAAGA CAATCCGGAT
AGGAATAGGG AGAGTTTGAT CAAAATCAAC ACGTTGTCCA GCGATAGCGG TAGTCCACCC
TCTGGTGACG AGGCACAAAT GAGCACGGAA AGTACATTCC AACAACCCAG ATCGCGAGGG
GGCTACGTGG ATCAAGAGAG AAGTTCACCG ATGACAATCA TTCCAAGTGA CTCTCTCAAA
ACCGACGTCA GATCAGAAAT CTTTGCGCTG GAGCACACGC AGTCCCACAG TCAATTGGAG
TCGATCGACG CGGAGCTCAA AGCCGCAAAA GCGGCAACGG AGGTAGTGGA TGATTCTTTA
GAAGAGGAAG TCCGAGCTAT GCGAGCTGAA GGGTCGACAT CGGGTCGGCA GGCCTGTAAT
CTAGATTCTA AGCGAGTCTT CGTGACCGAT GACGCTGTCT TAGCAGAATC GGATGCCTCA
AATGAACTGG AGCTGCTAGA ACACCCGCGT TTGGAGTTTT CGCAAATATG CGGCAGACCC
GAATTCGAAA AATACGAATT TCCGGACGAG GAGACGCGAA AAGGCGACGC GGAGGAACAA
GAGAACAAGT ACAAGAGAGA TGAAGTAGCA ATCAAACACA GTATACTTGA AACAGCGGGG
AGTCACAGCG AGTGGACGGA GCAATTGAGG GACGAAACCG GCAAATCTCT GATCCCAAAA
GAGACCGAAA AGCAGGACAA CCCACACAAA GAAAACTCTG TTGTGGGGGA AGAAGCCATG
ATACCGACCA ATTCACGTAT GGAAAACGAA GATACACAGG AAAATTCCAA ACCATCGGAA
CCTGCCGTTG CTTCAGAAGC TAAGCCATCC CTCAAAGCTG GTTCGTTCGC TAGCCATATC
GACAAAGTAA CTTACGACGA AATGGAAAAT TCCTTTGCCG GTGCAAATGA GCACTCCGGC
GCAGTGATAG CCCAATCAAT GAAGCAAGAC ACTGTTGCCC ACGAACTTCA GACTCCATTT
TGTGGTGTTT GGGGGGCGGT ACATGGGGAA CGAAAGCTCG CCGATTTGTC GGTTCTCGTT
TTATTGTTTG AAAACTTGCT TTCTGACAAG AACGACGACA ACGATGCTCG AAACGCGGCT
GAATTGGCTT TACTTGAGCG AAAATTCGAA GATCTTATGG ACGCGGATTA TTTTTTGGCT
GAAAGTGTGC CCTCAAGTAT TGAGAACATT GCTGAATCAC AGGTCCCCAT GGCGAAGTCT
ATCAGGAATG AGTTTGTTGA TGGACTAGAT GACATAGATA AATTTTTCGA AGACATCGAT
CCTCCGGACG AGTTGGATGT TGGAGCTGGT GGATCGTCTA TACAGGAAGT CCTGATGGGA
CAAGGTAGTC GAATCATTCT TAAACGGCTA GTTATAGCAG CTAAAGTCGT TCGAGACACC
GCGGTTGAGA TTAAGAGGAC GCTACTGACA AAAATCGCGG ACGACGACGG TTCGTTCAGC
ATGGCGCGGA GGGAAAAACT TTACGGTATA TTGAGGACAA TACGGAGGCT AACCCGTAAG
AGCATTGAAG CTTTTCGTCG TTTCATTGAA GGACTCTTGG AAGGTGATGT TTTCGATGGA
GAGGATTTTG TCCTTGACTT CACAGTCAAT CCAAATCCGC CCAGCGAGGC GGACGCAGAC
CCGGGAAAAA AAGCATTTCG TCAGCAATCA CAGTCAATAA ACGGTCGAGC TAACTGA
 
Protein sequence
MPDESNVHEE AAFQDHVSSS EHSQKARERR HPVAIANNAE GKSSAEFSDQ TITQTTAADS 
VSDLIGGEDY ETPVVPYANL GNPTLFEQAE TKHFSSQKNS DDISDTVETH SSAATQDNPD
RNRESLIKIN TLSSDSGSPP SGDEAQMSTE STFQQPRSRG GYVDQERSSP MTIIPSDSLK
TDVRSEIFAL EHTQSHSQLE SIDAELKAAK AATEVVDDSL EEEVRAMRAE GSTSGRQACN
LDSKRVFVTD DAVLAESDAS NELELLEHPR LEFSQICGRP EFEKYEFPDE ETRKGDAEEQ
ENKYKRDEVA IKHSILETAG SHSEWTEQLR DETGKSLIPK ETEKQDNPHK ENSVVGEEAM
IPTNSRMENE DTQENSKPSE PAVASEAKPS LKAGSFASHI DKVTYDEMEN SFAGANEHSG
AVIAQSMKQD TVAHELQTPF CGVWGAVHGE RKLADLSVLV LLFENLLSDK NDDNDARNAA
ELALLERKFE DLMDADYFLA ESVPSSIENI AESQVPMAKS IRNEFVDGLD DIDKFFEDID
PPDELDVGAG GSSIQEVLMG QGSRIILKRL VIAAKVVRDT AVEIKRTLLT KIADDDGSFS
MARREKLYGI LRTIRRLTRK SIEAFRRFIE GLLEGDVFDG EDFVLDFTVN PNPPSEADAD
PGKKAFRQQS QSINGRAN