Gene PHATRDRAFT_41042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41042 
Symbol 
ID7198853 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011695 
Strand
Start bp208670 
End bp211831 
Gene Length3162 bp 
Protein Length885 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184990 
Protein GI219129637 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.718194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCGAAC CGCTTTTTTT GGGAAAGGCA ACTCTCTTTG ACGTAACGAG AGTCCTGATG 
TGCACCGTTG CGGAAAAAAT TTGGGAAAGT TCTATCGAGG AACCCATTTC CGTGACAAAA
GTCAAACTTT AAAAAAGGTT GTGACAACCG ACAAGAAGTT TCTTACTGTA AATAGCAAAT
GAAACCATAT TACTTGTAGG TTAAACACAT GATGATCTTG CAAACAAACA TCACTCCTTT
GCCTTTTCAA CGGAACTGCA ACTCCGCAGA ATGACACTGT CTCTCAAAGT GTTATTCTTT
ACGGTAGTTA TACCTGACAA AACGTTGTCG GTGATCAACT CCTTTCGCGG AACGGAAACA
GGCGTCCGTA CTGCGGACAA TTTTCTGTAT TTTACGACGA AATTGCGTCG ACGAACCGGT
TTCTAATACC AAGTAAAAAG GAACCGGTCC CAAGGCATCC CTTCGTCCAG AAAAGACGTC
TTTAACGGAG TGGATCGGGT AGCCTTTTCG AACAGTTCCG ACTCCGAAGT AACTCACCGC
AGTGCGCTCG CGTCACGGAT CATTGAAGCT GAATTAAAGT CCGCTTTCAC GACATTGTCT
TGCCTCGCTA TCTGACACTC CTAGCGCTTT CTTGGCGTCG CTGATCGGAC TCGTGTTGTT
GACCGGCAAC AATCTAAGAA CGACACCATT TCGGACGGAG AAACGATCGC GGCGGACGCA
CGTGCCATCG CGGAGGCGTT TGGTAAACTC GAAATTGCTC CCGAGCACGA GGGCATCGAC
ACTGGATTTG TGACGGACGA CGTTGACGAA CCTGTCTATA CCACGGCAAC GGCTGCCACA
ACGCTTCCCG ACGCGTTGCT GCAGCAAGTC GTTTCCGAAG ACGAGGAGGA AAACGATCGG
TATATTCTGG ATCCGTTGGA TCTTGACAGG ACTTCGGCAC AACTAGAACC CGAAGACGAC
TGGTTTACTC AAGACGAACT TGCTGCGGCA TTGCGAGGGT TGGACCCCAG TCAAGACTTG
CTGTGTCCGC CGGATTCTTC CTGGCAAAGC GACGAGGCGA GCGACGAAGA CTTGCCCGAT
CAGGATCAGG AACTCCCGGA AATCGATATA TTGGATCTGG CTTACGACGG TGATTTTCTG
GCTTCCAACG GAGGCGAATT TTCTTTGGAC AAAAGTCTTT TCGAACCTCT ACAAGATCTC
GATTACGGTT TTGATCACGG ATCGATCGAC AGCCCTAGGA ACTGTCATAC AAGCGAGGAA
TCACTGATAC CTACGAGTGA CGTGGGGGCT CCTGATTGTA ACAAAAGCAA CGCGATCGAA
CTTTTGGAGC AATTGGAATC TCTTACGCTC GACTATCATC AGGATACAGA ACGAGAGGAG
AACTTTGAAG AAATAGACAA CAACGAAATA AGCTTGGAGC AAGAGTTGCC ACCGTGGCTA
TTCTGTCAAC CTTGCAACTC CAATTCATCG GACCTTGAAG CACTGGAGCC ACTACTAACC
GCATCAACTG AAAACTACGT CGAACGCAAT CGTTACGACG AACAAGCCCT GGATTCATTG
CTGGATTTGC GCGGCTTAAC ATTGGAAGAC TACTCGGAAT ACTGCCTACC TCAGGGTGAT
TTAGAACAAT TGGATACGGA GGAACCGACT AGAAAAATGC TACCCTCGCT TTCGGACAAC
GCGACGTCGG TTACAACAGG TGTTGATATT CGACCTGATG TGCAAAGGTT GGCACAACCC
ATTTTGCAGA CTTTTGATGC GATCTATGAA TCAGAACTCA GCAAAGCGGA AACGACCGGA
ACAGTAGCGA AACAAAGCAC TGCTAAGCCA CCCCGGAGCG AACGACTCTG TTTAGGGCAC
AAGGAACGGA TTCTTGGTTT GGATTTGTCT CCGTGCGGTC AGTACTTGGC GACAGCCAGT
CAAGATTCAA CCGTTCGCGT ATGGTCAACC GACACGAACC AATTGCTCGC AACAGTGCCG
CACAATTCGG CCTATGAATG TTTGAGAGTA GTCTGGGCTA GTCCACAATG GGCTGAAAAC
AATATAGATC GTAACGGTTG TGCTTGCCCT TACTTGCTGG CGACGGGCGG TGCCGATGGA
ATTGTTCGAT TATTTCGGAG TGAGAAGCCG ACCGAGTGGG TATTGTGTGC CACTTTGGAC
CATGCGGAAA TGAATCATTT TGAGGGCGAA GAAGAGGCCG ATACACCTCA AGTATATGCA
CTTCAATTCA TTGATCATTG GAAAGCTTTG CCGGGTTCAA AAGAATCTGA CACGAATTCG
TTCCTCTTGA CATCATCGGA TGACCACGTT CATCTATGGG AAATTTGTTC TAAATCCGAG
GGGAAAAAAG AAGAATCCGA CAATGACAGC GGCGAATCGG GGAATCTTCG GTTGCGTGAA
GTCTTTAGTA TGCACTTTGG CGATATGCAT AATCAGGCCT ATGGCGTGCA AGTCGGGCAT
GTTACAGCTG CCGGGCTCGA CATTGCGGAT GCCACGACCA GCCCTATCCC TATAAGTAGC
GGTGGAGATT CAGGTGTTTT TGGCGGAGAT CGTAATCCGA GAGGCCTCGT GTACGTTTTT
GATGCGCGGT ATTGTGAAGC GAACGGACTT CTGGGAGCCG CTCTATCGGA TGGAACCTTG
CGCCTCGTCA ATGGACGTGG GGTATGTCTT TCGTTGTTGC AGCTACCCGG TCATCGCTCG
CACTTAACTT CACTTGCTTG GGATCGAACG GGAGAATGTC TCGCAACTTG CGTAGCGACA
GGGCATTTAA TTACATGGGG AGTCTCCGTT GATGAATACG CCAATCGTGT GCACGCCACC
TGTCGTGCTG TGATGGAAGG TGGCCACGAC AAAGGTCGAC CTCTGTTTGG TACGGAATTT
GTAGGGCGTG ATGGGGATGA GGAAGATGAT CTTTTGATTT CATGGGGTGT CGATGGGCGG
CTTTGTTTGT GGGATTCTTT TTCCACTGAC GAAATAGATC GACCGCTAGC CGTGCTCTTA
CACAAACCGG AATATCCTAT ATACGCAGTG GACTTAATGC AAGATGCTTT TATTGCGGTT
GGCGGCGGCA CGGGGGATGG TGGAGGTTTT GTCGGCATTC CTGTTCATCT TTACAAATTT
CCACCGAAAG AAAAGGCATC ACCAGACCCT CTCCAAGGAT AG
 
Protein sequence
MVEPLFLGKA TLFDVTRVLM CTVAEKIWES SIEEPISVTK RFLGVADRTR VVDRQQSKND 
TISDGETIAA DARAIAEAFG KLEIAPEHEG IDTGFVTDDV DEPVYTTATA ATTLPDALLQ
QVVSEDEEEN DRYILDPLDL DRTSAQLEPE DDWFTQDELA AALRGLDPSQ DLLCPPDSSW
QSDEASDEDL PDQDQELPEI DILDLAYDGD FLASNGGEFS LDKSLFEPLQ DLDYGFDHGS
IDSPRNCHTS EESLIPTSDV GAPDCNKSNA IELLEQLESL TLDYHQDTER EENFEEIDNN
EISLEQELPP WLFCQPCNSN SSDLEALEPL LTASTENYVE RNRYDEQALD SLLDLRGLTL
EDYSEYCLPQ GDLEQLDTEE PTRKMLPSLS DNATSVTTGV DIRPDVQRLA QPILQTFDAI
YESELSKAET TGTVAKQSTA KPPRSERLCL GHKERILGLD LSPCGQYLAT ASQDSTVRVW
STDTNQLLAT VPHNSAYECL RVVWASPQWA ENNIDRNGCA CPYLLATGGA DGIVRLFRSE
KPTEWVLCAT LDHAEMNHFE GEEEADTPQV YALQFIDHWK ALPGSKESDT NSFLLTSSDD
HVHLWEICSK SEGKKEESDN DSGESGNLRL REVFSMHFGD MHNQAYGVQV GHVTAAGLDI
ADATTSPIPI SSGGDSGVFG GDRNPRGLVY VFDARYCEAN GLLGAALSDG TLRLVNGRGV
CLSLLQLPGH RSHLTSLAWD RTGECLATCV ATGHLITWGV SVDEYANRVH ATCRAVMEGG
HDKGRPLFGT EFVGRDGDEE DDLLISWGVD GRLCLWDSFS TDEIDRPLAV LLHKPEYPIY
AVDLMQDAFI AVGGGTGDGG GFVGIPVHLY KFPPKEKASP DPLQG