Gene PHATRDRAFT_42980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42980 
Symbol 
ID7196798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1684576 
End bp1687527 
Gene Length2952 bp 
Protein Length950 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176833 
Protein GI219110163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCTGATCCGA AAAGATAGCT GACGACCGGA AACTCTCAGA TATATTTTCA CCATGCGCAT 
CGGAAAGCCC CTTCGCAAGT CCCGCGCTTG GAAGAAGGTT GGAGTCAATG GCACACCCAA
ATATGTATCC CCTGAGAGCG TTGTTGGGAG AATCAATTCA GATATTATAT CAAGTGTTTT
GGCACCGAAA ATTGCCGCAC TTGCTTCAGC GAGTCTCGAA CGATATGCTG GGGAGTTATT
GGTATACGAA GACATTATGA AAAAGGTGAA TAGCAATGAA TCTATCGACG GTCTCTTGGA
AAGCGATAGT TCCATTATTA TCCAAGACGG AATTCCACAG CAACCCCAAC GTCCTGTCTT
TCATTCCCAG TTTTCTGTGG ACGAAGCGGC TTCCATATTT TCAGAAACAT CGGAACACTT
TTTCGCAAAA GCGGGTAAAT GGAAAGCACA CGCCAATGCT GCCAAATTTG AGCGTATTCT
GGATGAAAAG TACGGCATTT TGCGTCCATT TATTACGAAC CATCCCGAAA TTGAACATTT
CATTCGGGGC GTTCAGCGGA AGTACGCCAT GGGGTATTTC AGTCCCTTCC GACAAGGCGA
TCCGCCAATA CCCCGATCGA CTGCTGTCAT TATATTGTTT ATGATGCAAC GAGGTCAGAT
GCGTTGGGAA ATAATGCTGT TGACTACCTT GTTCTTTCTT ATTGGCCTAC AACCTTGGGC
TTTGGTGGCA GTTGTCGGAG TTTTACAAGG TCTTCTCATG CGACGAAAGG CAAAGCCTTT
GGGGAAGATG AAGCGTTTCA TCCCTGCGGT AGAGTCATAC TACACGGATG CAAAAACCGA
TACAGAAAAG CACGAGCTAC TATTGCATCC GGTTGGTGAA CCTTTGCCTA GCAAAGAGGA
AATTGACGCG TCTCTCTTTG ATGCTCTGAT TCTTGGCTCA GGACCAGCTT CACTGTATAT
CGCATCGTTG TTGTCGCGGG CGGGTAGAAA AGTGCTCGTT CTCTCTTCAC GGAACGACGC
TAGTGGCTGC CTGAGTATAA AGCATGCCGA GTATTCAAAT GTCCCATTTG ACGTTGAAGC
TTCGAATGTA GCCAAAATAA GCCGTCAGCA ACAAATCTTG GCCCCTGCTC TGTGTACCGA
GACCGATACT CAGGGTGGAG TCCGATTTGC CCAGATTGGA TCAAATGAAG ATGCTCATGC
TTTTGAAATA CTATCGATAC CAGGAATGGG AACAGATTCG TACGACGAAG AGTTACCATT
TATTTTGAAT GCGGATGGTG GAACAGCCGG TCTCATAGAC GATGCTGCAA AGTATCTGAA
TGATGGCTGG CCAGATGCGG AAGGCGGGAA TGGCAATTCT GTAACGGGAG CGTATGCCGC
TGCGTGCGAA GCAATTAACA GTACAGCAAA CGAGTTCTAT ATTTCGAAGA TTCTCTCGGA
AAAAGTCAAT AGTCTACGGA GCTCTCCTAC CTATCAAGAC AGTGGAATTC GTTACGCTCA
GTCCTTCTTG AACAAAACAT TCACCATCAA CCCCCATACA CGGTCGTTGA TGGCGGGTAT
AGGTATGAAA GGGGAGAACA TCCGACCTGG AGCGACAAGT ATGGCAGCGC ATGTCACCAA
CATTAGCGCA GCTCTCAGTG GAGAAGGTAT GCACTATCCG ATCGGCGGAC CTAGGGCACT
TTGCCGTGCA CTCGCCAACG TCGTTCTCCG TAGCGGTGGC CGAGTGTTGA CGTCGGTTGA
TGTCGCTGAG CTAATATTTG GTGAGCCACG GGAACAAGCG AGCAAAGGAA AGCAAAAAGA
AGGGGACAAC GACGGGCCAC CTCCACCTCG CTGCGTTGGA GTCAAGCTAT CAGACGGGCG
AGAAATCAAG TTTGCGAGCG ACCGTTTTGA TGAAAAAAAT GGTTCCTGCT TACCCGCAGT
TATTTCAATG GAAGGCTTCA TTTGGACATT CATAAACATG TTGCCGGATG ACATAAGGAT
GAAGTACAAA GTACCACGTG GCTTGCCAGC TCTTTCGTCG CGGCGGCCTG TTTTCAAGGT
TCTTTTTGCG TTGAAAGGCA GCGCCGATCA ACTCAATGTG ACGGGTGCTG ATTACTATCG
GCTGCCCAAC GCAGCTGTAG CGCGAGACGA GTTTGATCAG TCCTCTGGAC AGATAAAACA
CGGTGAGATT GGTTGGTCTG ATTCGGACAC TGGTGATAAC GGAGATGCTT ACGCGGATGG
AGGTAAGAAT TTAATGGACG TCATCAACCA GGATCCTGGT TCCATCAGTG ATGAGCATAT
TGTAAACTCC AGTAGAAAAC GAGCCCGAAA GACAAAATTT GAAGCTGGGT CTTCATGGCT
CCACGTTTCT TTTCCTTCAG CCAAAGACCC TTCTTTTGAG GAACGTCACG GGAAGACCAC
AACGTGCGTC GTCACTATTG AGGCGGATGA CGATTTTGTT ACCTATTTTG ACACGAAACC
TAAGATCTAT GTCATTAAGA ATGCCTCGGC TACAAAGGGC GATCTTGATC GCTTGCTAGA
ACGTGTCAAA AAGGATGTGT ACCATATTTT TCCTCAACTA AGGGACAAGG TGGACCACTG
CGAAATTTGT GGACCTTTTC AGAAAGGGTT GAGTCACAAT CCCGAGAGAT TCGCCGCCAA
AGGCATTCGA GCCGACACGC CTTATCCTGG TTTGTTCGTA GGAGGATCGG ACTTGACTGT
CGGCGAGTCC TTTTCCGGTG ACATCGTCGG CGCCTGGTTG GCAGCGAACG CTGTTGAACA
ATACGGCCCA CTCGATCACT TGTTCCTGCA AAAGAACATC ACAACTGACA TTGAGCAATT
CTTAGAAGAA CCAGGCTGGG TTGATGAAGA GGATGTTGCA ATTCCGTACA AATCGGCAGA
TGCAAAGAAG GACAAGGACG TCTAAGCGAC CAGTATGTTT TTCCAACCTT AAGGTCTGCG
TCACGGCAAA TT
 
Protein sequence
MRIGKPLRKS RAWKKVGVNG TPKYVSPESV VGRINSDIIS SVLAPKIAAL ASASLERYAG 
ELLVYEDIMK KVNSNESIDG LLESDSSIII QDGIPQQPQR PVFHSQFSVD EAASIFSETS
EHFFAKAGKW KAHANAAKFE RILDEKYGIL RPFITNHPEI EHFIRGVQRK YAMGYFSPFR
QGDPPIPRST AVIILFMMQR GQMRWEIMLL TTLFFLIGLQ PWALVAVVGV LQGLLMRRKA
KPLGKMKRFI PAVESYYTDA KTDTEKHELL LHPVGEPLPS KEEIDASLFD ALILGSGPAS
LYIASLLSRA GRKVLVLSSR NDASGCLSIK HAEYSNVPFD VEASNVAKIS RQQQILAPAL
CTETDTQGGV RFAQIGSNED AHAFEILSIP GMGTDSYDEE LPFILNADGG TAGLIDDAAK
YLNDGWPDAE GGNGNSVTGA YAAACEAINS TANEFYISKI LSEKVNSLRS SPTYQDSGIR
YAQSFLNKTF TINPHTRSLM AGIGMKGENI RPGATSMAAH VTNISAALSG EGMHYPIGGP
RALCRALANV VLRSGGRVLT SVDVAELIFG EPREQASKGK QKEGDNDGPP PPRCVGVKLS
DGREIKFASD RFDEKNGSCL PAVISMEGFI WTFINMLPDD IRMKYKVPRG LPALSSRRPV
FKVLFALKGS ADQLNVTGAD YYRLPNAAVA RDEFDQSSGQ IKHGEIGWSD SDTGDNGDAY
ADGGKNLMDV INQDPGSISD EHIVNSSRKR ARKTKFEAGS SWLHVSFPSA KDPSFEERHG
KTTTCVVTIE ADDDFVTYFD TKPKIYVIKN ASATKGDLDR LLERVKKDVY HIFPQLRDKV
DHCEICGPFQ KGLSHNPERF AAKGIRADTP YPGLFVGGSD LTVGESFSGD IVGAWLAANA
VEQYGPLDHL FLQKNITTDI EQFLEEPGWV DEEDVAIPYK SADAKKDKDV