Gene PHATRDRAFT_34060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_34060 
Symbol 
ID7197613 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp997564 
End bp999365 
Gene Length1802 bp 
Protein Length576 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178339 
Protein GI219115087 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGATG ACGAGAGTAA GGTTGGCAAT TCCAACAACG AGGCTCATGT CGAGATTACA 
TCCTTCAATG AAGCCCTGGT CCGCCCTATT GCGCCGAAAA CGAAAAAGAA GCGCGCGTTT
AAAAAGAAGA ACCCGGCAAT GGGTGACACT GCATTTCTAC GCAAACGAAC GGCTAATCTG
TTGAGTATAA CGAAGCAAAA TTCCGACAAC AAGGAAGGTG CCCTTGGGGG CGGTATGAAA
GTTGATCGGA AGACCTTCCA CTTCCTCATG GATGCTTGGG CATTTTCCGG AGAAATTGAT
GCTGTTGAAC AAGCGAGTGG ACTGCTGTCC CGTATGGAGG AATTGGCATC TTGGGGAAAT
TTCAATATCG AACCTGACGT TCGTTCGTAT ACCAAAATGA TCAACGCCAT CAGCCGTTCG
ACAGCCCCTT CTGCAGGGGA CGAAGCCGAC GCAATCTTTG CAAAAATGGA AGAGCTCTAT
CAAACAGGGA GCAACCGCGC GGCTCGTCCG AATACATATA CCTATACTGC CGTAATAGAA
GCTCATGCCC ATTCCGGCGC ACGGGGCAGT GCTAAACGAG CCGCAGAATG GTGCGAACGA
ATGATTGATG CGTACGAACC ATCACCGCAT AACGAGAACA AAGAATCAAG TGTCCGCCCA
ACTGCTCGTG CTTTTAACGC TGCAATTTCG GCATACGCCA AGTCAGGAGA AGAAGGAGCA
GCGGCTCGAG CAGAACGCTT GTTTGATCGA ATGGAAGAAT TGTACGAGAC GGGAGTTGAA
GAAGCAAAAC CCAACGCTTT CAATTTTAAC TCGCTCATTA CAGCATGGGC TAACTGCTGC
GAGGAAGGTT CAGCACAGCG CGCGGAGGAA ATTCTAGAGC GCATGGAGTA TTTGTACAAG
CAAGGAGATG AGAAATGTAA GCCGACAACA ATATCATTCA ACGCTGTTAT TGACGCGTAC
GCGAAATCGG GTGATGAATA TGCAGCGCAA AAAGCTGAAG AAGTTTTACG GCATATGGAA
GATCTCTATG GATCGGGACA AAATCTTGAC GCTCGTCCGA ATGTAAGATC GTTCAATTCA
GTCATCAATG CTTGGGCAAA AAGTCGAAAC GAAGAGGCGG CTTGGAAAGC GCAGGATATG
CTGGATTTGA TGGAGAAGCT CTACGCTAAA GGTAACAAAG AAGTGCGACC AGATGTACAC
AGCTTCTGCA CGGTAATTAA TGGTAAGGGA CACACAGATT GAAAAATACA CGTCTGGCAC
CGGCTCCACT CACACTAAGT TTCTGTATAA CAGCTTGGGC CCGGAGTCAA CAGCACGGTA
AAGCTGAGCG AGCCCTGAAT TTGTTTCGTG AAATGAAGCA GCTTCATGAG GCTGGTAACA
AGCACTTGAG ACCGAACACG GTAGCAGCGA ATGCGGTAAT GAACGCGTGC GCGTATACGT
CCGGAGATGT TCATGAACAG AACCGAGCGG TAGAAATCGC TCACACTATT TTAAAGGAAC
TGGAACAATC TCCTTATGGA AAACCGGACC AGGTGACCTA CGGGACTTTT CTGAAAGTAT
GTGCAAATCA AATGCCGGAC TGCAGCACAC GCAATCAAGT TATTTCCGTC GTTTTCAAAA
AGTGTCAGAA GACGGGGCAG GTGGGGAATT TTATTCTACA GCAGCTCAAA GCCATGGCGT
CAGAAGAAAC ATATATGATG TTGTTGGGTC GAGGGATCCA CGAAGACATC CAGATAGCGG
ACCTCCCCTC CGAGTGGTGG TGTAATGTTG TCGAGAACCG GTGGAGGCGT CGTGGAAACT
AA
 
Protein sequence
MDDDESKVGN SNNEAHVEIT SFNEALVRPI APKTKKKRAF KKKNPAMGDT AFLRKRTANL 
LSITKQNSDN KEGALGGGMK VDRKTFHFLM DAWAFSGEID AVEQASGLLS RMEELASWGN
FNIEPDVRSY TKMINAISRS TAPSAGDEAD AIFAKMEELY QTGSNRAARP NTYTYTAVIE
AHAHSGARGS AKRAAEWCER MIDAYEPSPH NENKESSVRP TARAFNAAIS AYAKSGEEGA
AARAERLFDR MEELYETGVE EAKPNAFNFN SLITAWANCC EEGSAQRAEE ILERMEYLYK
QGDEKCKPTT ISFNAVIDAY AKSGDEYAAQ KAEEVLRHME DLYGSGQNLD ARPNVRSFNS
VINAWAKSRN EEAAWKAQDM LDLMEKLYAK GNKEVRPDVH SFCTVINAWA RSQQHGKAER
ALNLFREMKQ LHEAGNKHLR PNTVAANAVM NACAYTSGDV HEQNRAVEIA HTILKELEQS
PYGKPDQVTY GTFLKVCANQ MPDCSTRNQV ISVVFKKCQK TGQVGNFILQ QLKAMASEET
YMMLLGRGIH EDIQIADLPS EWWCNVVENR WRRRGN