Gene PHATRDRAFT_41702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41702 
Symbol 
ID7196707 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1944523 
End bp1946669 
Gene Length2147 bp 
Protein Length671 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176877 
Protein GI219110251 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGCGTG GTGTTGGTGG ACGTATCGAG GATGCCTTTG CGGCCGCGAA GGAACAGGGC 
GAAGCGGCCT TTGTGACGTT TGTCACCGCC GGTTATCCGA CTGCTGCTGG TATGTACACC
AGCCGATTCT ATTCTGCTTT ATCAATTTTA TGTTTCCTCA AACTCCTCTG CTTTTGACTT
ACTCAGATAC GCCAGCGATT CTCATGGCCA TGCAGGAAGG TGGGGCGGCG TTGATTGAAC
TTGGAATTCC GTACACAGAT CCGCAGGCCG ACGGAGCTAC GATCCAGCAC ACCAATCAAG
TAGCAATCAA GGGAGGGACG TCGGAGATTC ACCAGTGTCT CGACATGGTA AAGAAATCTC
GCGAAATGGG ATTAACTGTG CCCGTGGTTC TCATGGGATA CTACAACCCC TTCCTTCAGT
ACGATGTAGA CAAGCTTTGC GAAGAAACCA AGGCTGCCGG AGCCGATGGT TTCATCGTTG
TCGATCTTCC TCCCGAAGAG GGTATTGCCT TGAATAAGGC TTGCATTGCC AATGGGCTTT
CCAATATTCC CTTGGTAGCA CCTACTTCGA GTGACAAGCG TATTGCTTCC TTGACAGACA
TGGCTTCTAC CTTTCTATAC TGCGTATCAG TTACAGGAGT AACTGGCGCG CGTGAATCCC
TTCCTCCGGA TCTGGAGGAG TTCATTACAC GCGTACGTTC CAAGACGGAA CTACCTTTAG
CGGTCGGTTT CGGTATTTCA AATCCTGAAA TGGTAAATGG TGTTGCCAAC ATGGCCGATG
GTGTTGTGGT GGGTAGTGCG ATTTTGAAGG CTATGGATTC GCTTGGAGAC ACTGCTACCA
CCGAACAGCG TGCCGATGCA ATTCGTGAGG TTGTGGCTCA CTTGAAAACA GGTACGAAAC
AGTCGGGCGA TGCAAAGAAC CAGTCAACTA AACTTGGTCA AATCCCAGCG GAATGGACTC
TGGGCGAGAA CAACAGCCGA TTCGGGAAGT TTGGCGGGCA GTACATTCCA GAAACCCTTT
CGGTCGCCTT CGAAGAAATT GAAGCGTCCT ACAATGAGCT GAAGGACGAC CCATCCTTCC
TTGCTGAACT CGACGAATAC CGACGGGACT TTGTTGGTGG TCCGACGCCT TTACACAGAG
CGGACCGCCT GACGGAACTT GCCGGAGGCG CGACGATCTG GTTAAAGCGC GAAGACTTGG
CACATACGGG AGCTCACAAG ATCAACAATG CAATTGGGCA AGCCTTGCTT GCCAAGCGAA
TTGGAAAACC GCGTATCATT GCCGAAACGG GTGCTGGACA ACATGGAGTC GCCACTGCAA
CAATTTGCGC CAAGCTCGGC CTCGATTGCA CTGTCTACAT GGGAGCGGTT GATTGTGAGC
GTCAAAAGCT TAACGTTTTC CGCATGAACA CTCTCGGTGC CAAGGTGGTA CCTGTACAAG
ATGGGCAACG GACTTTAAAG GATGCCATCA ACGAAGCGAT GCGCGATTGG GTCACCAACG
TTCGCGATAC GCATTACTTG ATTGGTTCCG CTGTCGGGCC TCATCCATTC CCAACGATTG
TACGTGACTT TCAAAGCGTA ATGGGTCGCG AGATGCGTGC CCAGATGCTT GAAAGGGCCG
GAAAGCTTCC AGACGCTGTT GTGGCGTGTG TCGGGGGTGG ATCGAATGCC ATAGGAGCTT
TCCATCCTTT CGTTAATGAC GAGACTGTTG AGCTTCACGG AGTGGAAGCG GCAGGCTACG
GGATCGACAA AGACGAAGAA CATTGCGCGA CTCTCACCAA GGGAACACCC GGGGTACTCC
AGGGAGCCAT GACATACGTT ATTCAGCAAA AGTCCGGTCA AACGCTCAAC ACACACTCAA
TCTCTGCGGG TCTTGACTAT CCAGGTGTTG GTCCGGAGCA CGCTTTCTTG AAAGATAGTG
GACGAGCTGT ATACGAGGCA GTCACAGATG ATGAAGCACT GGAGGGCTTT AAATTGATGT
GTGAATATGA AGGTATCATC CCGGCCCTTG AAACGAGCCA TGCTATCTAC TATGCGGTCA
AGCTCGCGAA AAAGCTCGGC CCTGGCAAGG ACATTGTAAT CAACATGAGT GGTCGGGGAG
ACAAGGACAT GCCGCAGATC GCGAAGATTA TGGGCGTCGA AGTTTAA
 
Protein sequence
MERGVGGRIE DAFAAAKEQG EAAFVTFVTA GYPTAADTPA ILMAMQEGGA ALIELGIPYT 
DPQADGATIQ HTNQVAIKGG TSEIHQCLDM VKKSREMGLT VPVVLMGYYN PFLQYDVDKL
CEETKAAGAD GFIVVDLPPE EGIALNKACI ANGLSNIPLV APTSSDKRIA SLTDMASTFL
YCVSVTGVTG ARESLPPDLE EFITRVRSKT ELPLAVGFGI SNPEMVNGVA NMADGVVVGS
AILKAMDSLG DTATTEQRAD AIRESTKLGQ IPAEWTLGEN NSRFGKFGGQ YIPETLSVAF
EEIEASYNEL KDDPSFLAEL DEYRRDFVGG PTPLHRADRL TELAGGATIW LKREDLAHTG
AHKINNAIGQ ALLAKRIGKP RIIAETGAGQ HGVATATICA KLGLDCTVYM GAVDCERQKL
NVFRMNTLGA KVVPVQDGQR TLKDAINEAM RDWVTNVRDT HYLIGSAVGP HPFPTIVRDF
QSVMGREMRA QMLERAGKLP DAVVACVGGG SNAIGAFHPF VNDETVELHG VEAAGYGIDK
DEEHCATLTK GTPGVLQGAM TYVIQQKSGQ TLNTHSISAG LDYPGVGPEH AFLKDSGRAV
YEAVTDDEAL EGFKLMCEYE GIIPALETSH AIYYAVKLAK KLGPGKDIVI NMSGRGDKDM
PQIAKIMGVE V