Gene PHATRDRAFT_39978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39978 
Symbol 
ID7195579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp503898 
End bp505964 
Gene Length2067 bp 
Protein Length688 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184008 
Protein GI219127575 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAG AAAATAAGGA ATTTCGAGTC ACGCACAACC GTGAACGGTC CGCAACGCCG 
ACACCGACCA TGAACCCCTC TCATGGGCTG GGTTGGGCCG CGACAGACTC GCGTTGGCAA
CGACCCCCGA CTCGTTCAGT CTTTGGAGTT GACACTGACA GTGCGGCCAT GGCGTCAAGA
AAGCCTCTTG CACTGGGAAC TGCTTTTGTT GACGGCGGCT CCTTCGCGCA CCGTCGAGGA
CTCCGACCCC GCTCGGAATC TCCGCGCCAT CATCACGCGC TACTGGCGGA ATCGTCGCCC
TCGATGCACC GTCACGACGT CGACTATCGT ACCCACGACG ATAACGGTAA TCGCAACCGC
AATTACGACA GTAACAACAA CAACAAGGTT CCAAGACCGG GAGATGGAGG ACAGACACAA
TCTTCCTCCA GCCGCAACCT TAACAATCAA TCGATACGTC GATCCATTGT CGCACCAATC
CCCGCCGTCT CTTTCAACGA CAACGCCGGG ATCTTCTACA ACGATTACGA TGGCGGTGAT
GACGACAACC TGAGTAGAAG AAGTTACAAC ACAGCGACGA TGAACCTGCA CAAAACAACC
CGCTTGCAAT CGTTTCTGCA TCTCCTTAAA GGCTACGTAG GCCCGGGCTG TCTCAGCCTA
CCCTGGGCCG TCTCCCAGCT CGGCATTACG TCCGGTGTCA TTGCAACCTT TGTCATGGCT
TACTGGAGCT CGTACAACTG CTGGACTGTT GTGCGCTTCA AACGCATCTG TCAGAATTCC
AACCACTACG GTCCCTTGCC TTTGACGTAT CCGGACCTTG CTGGTTGGCT CTACGGACCC
CGCTTCCAGC GTTTTACCAC AACTTGCATC TGCATTCAGC AACTCGCAAT TTGCACCGTC
TTTCTCAGCT TTGTTGGTGC CAACTTGAGT GCCGTATTGG TGGCCGTTTG GTCCGTTCCG
CTCACTCACG TGCAAGTCAT TTCGTGCTGC TTGCCCGCGG TCCTCGCTTT GTCCTTTCTG
CCCAATCTCA AGGCACTGGC GCCGGCGACG GCGACCGGAG CGGCGTTTCT GGGCTTGGCT
TTGCTCTGTT TGAGTACCGT CATTGGCCTC CAATGGAACG ATCGACCCCG GCACGAAGCT
CTGTCCGTGG ATTGGACCAG TGTGCCCTTG GCTTTTTGTG CCATCTTGTA CAGTTACGAG
GGCATTTGCC TCGTCCTTCC GGTGGAATCC AGTATGCAAC GGCCGGAACA CTTTCAAAGC
ACCTTTGTGA CGGCCATGAT AGCTTCGGCT GTCGTCTTTG CCCTCGTGGC CTCATTCTGT
GTGGCAGCTT TTGGGCCAGT GACGAACGGT TCCGTCACCG CCTTTTTGCT GGAAAAGTAT
GCCGATCGGC GTCACTTGCA GGGATTGTTG CTAGCGGCCA ACGGATTCGT GAGTCTTTCC
GTTCTGGTCA CGTATCCGTT GCAGCTATTT CCCGCTCTGG AGTTGGTGGG ACCCTGGTTT
CGGCCTTGGG AGAGATGGGT GCAATCATGG GGATCGTCGA CGACCACATC AACCTCGACC
AATATACAGA CCAACTTCAC ATCACTTACC AACACCGACG AGTCCGCATC GGACTCGCAC
AATCAGTACA GCGCCGATGA CGTCCACGAC GAACGCATGG AACCGCTCGA TATTAGTCCC
GTATCCAGTG TCGCCCGTTC GGCACTAGTG GAAGCGGCTC CGGAGGCCAG CCATTCCCCG
GTTGCCAGAA TATCGCTAGT AATGCTCACA TACGTGGTCG CCGTGGCAGT TCCCAACGTA
CAGATCCTCA TCTCGTTAGC GGGCGCCTTG GCCGGCTCGT CGACAGCCTT GCTCATTCCT
CCCGCGCTCG AACTGGCGTA TTTGAAACAG TACGGCACGG AAAGTGATAC CATGTCGATA
GGCATGGTTT CCCTGCGAGT ATACATTCTG TTGGCCTTGG GATTGATCTT CATGGGCATT
GGGACTGGGG CGTCTTTGTT GGATATCTAC CGGGTCTATA CGCAAAGTGG CGAAGAAACG
GGTTCCGACA GCGCGTCCTC CGTGTAA
 
Protein sequence
MKKENKEFRV THNRERSATP TPTMNPSHGL GWAATDSRWQ RPPTRSVFGV DTDSAAMASR 
KPLALGTAFV DGGSFAHRRG LRPRSESPRH HHALLAESSP SMHRHDVDYR THDDNGNRNR
NYDSNNNNKV PRPGDGGQTQ SSSSRNLNNQ SIRRSIVAPI PAVSFNDNAG IFYNDYDGGD
DDNLSRRSYN TATMNLHKTT RLQSFLHLLK GYVGPGCLSL PWAVSQLGIT SGVIATFVMA
YWSSYNCWTV VRFKRICQNS NHYGPLPLTY PDLAGWLYGP RFQRFTTTCI CIQQLAICTV
FLSFVGANLS AVLVAVWSVP LTHVQVISCC LPAVLALSFL PNLKALAPAT ATGAAFLGLA
LLCLSTVIGL QWNDRPRHEA LSVDWTSVPL AFCAILYSYE GICLVLPVES SMQRPEHFQS
TFVTAMIASA VVFALVASFC VAAFGPVTNG SVTAFLLEKY ADRRHLQGLL LAANGFVSLS
VLVTYPLQLF PALELVGPWF RPWERWVQSW GSSTTTSTST NIQTNFTSLT NTDESASDSH
NQYSADDVHD ERMEPLDISP VSSVARSALV EAAPEASHSP VARISLVMLT YVVAVAVPNV
QILISLAGAL AGSSTALLIP PALELAYLKQ YGTESDTMSI GMVSLRVYIL LALGLIFMGI
GTGASLLDIY RVYTQSGEET GSDSASSV