Gene PHATRDRAFT_38433 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38433 
Symbol 
ID7203419 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011684 
Strand
Start bp83770 
End bp85368 
Gene Length1599 bp 
Protein Length532 aa 
Translation table 
GC content56% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182471 
Protein GI219124356 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTATA AAAAGACAGC TTCGGCAGAT CGGGATGCCT TGTTTGGTGG AGTGCCAGCC 
GCAACGGAAG GAGGGCGCGA CAAGAAGAAA ACCACGAATC GTACCGCGCG TCTTTCCTCG
TCGACTACCG AGACTGCACG GCCAAAACCT ACCGCTGTTC CAACCAATCA AGGTTATAGA
CCCCAGCGCG GCACCGCCGG AGGGACTGGG AAAAAGTCGC AGCCCTCGCT GTCGGCGGAA
ACGCAGGCAC AAAAACGAGC GGAAGCGGAG GACTACAAAG CAAAGGCCAA CAAGTGCATG
CAGCGTTCGT TTTTCGGCAA ACCCGATCCG GTCGCCGCCA GCACCTTCTT CAAACGCGCG
GCGGATTGTT ACCAAATCTT ACAGGAAACG CGGTTGGAGC AATTGTACCG CGTCGAATCG
GCTGAGTGCA ATCGTATCGT GCAGGCCTGG GCTTCGTGTG CTTCGGATTA CACGCGGGCT
GCCGAGCTCA TGCTCGAACT CATCGACACC ACCGACAACG GTACGGATGC TTCCCAAAAA
CGCCGAGATG CGTCCAAATT TCACAAGCAA GCTGCTGGGG CCTGGACAGA AATGGGCGAG
AAATCCAAAG CTGCCGCTTC GCAAGTCCAA GCCGCCATTG CTTTGAATTT CGGAGAAGAG
TCCACGGTTT TGTCCAAACA AGCTCTCCAG GGTATGGAGG AAGCAATTGA AGCGCACGTA
CCCGACGTTT TGAACCCGTA CGGACGGTAC CGCCAAACCG GCGTTTCTGC ATTTCTGGAT
CCGGAGAATG CGGACGAAAC CGTTGAACAG GCCAGTGCAG AAACCTTACA ATTGGCCTCG
TCACACATGG TGACCCGCTC GTACGCCCAC GAACCGTTGA ACCAGCTCGT AGCCGTACTC
GTCAATGCCG GTGAGTACGC TTCGGCCCTG TACGCGGCTG GTGCCGTGAC AGCGATTTTG
GAAAAGGATG GCATTAGCAC GTTGAGTTTG AGTCGTGCGT ACGCCGTCGA AACAGTCTTG
ACACTAGCCT TGGGCGATCC CGTCATGGCG GAACAATCCT TCTTGTCCCG TCACGTTCAG
TCCACGCCGT ACTTGGCCTC ACGTGAATGC AAGCTGGCCG AAGACTTGTT CCGGGCCGTC
AAAACACGCG ATCTGGATGC CTTGGAGGAG GCCCGCGCGG TTACCGGTAG CAATCGGGCC
GCCCTGGCCA ATCTGGATCC GGCCGTTAGG GAATTGGTGC CCCTTCTGCG CTTGACCGGT
GTCGCGCGAA AGAATGTGGC TAGCAATGCC ATCCCCGTGG CATCGACCTC TGCGAACAGC
CGGCGTGGCG GGAAGAATGA ACCGGATCGG TTGCAGAAAA ATGAGATACC GGAGGCGACG
ACGGAACCGG CAACCTTACA AGAACTGAGT AAAATGAAGA CTGGATACGA AAAGGAGGTC
GCCGAAGGAG CACATTTGGA TGGGAATGCG TTGGCTAACG AATTGGATGA TTTGGATTTT
GGTGCTTTGG ATAGTGATCA CGAGGATGAC GGTGATGGCT TGGGAGGGGT GGGCGATGAT
TCCGACTTGG AGGATGACGA TGACGTTGAC TTGCGATAG
 
Protein sequence
MSYKKTASAD RDALFGGVPA ATEGGRDKKK TTNRTARLSS STTETARPKP TAVPTNQGYR 
PQRGTAGGTG KKSQPSLSAE TQAQKRAEAE DYKAKANKCM QRSFFGKPDP VAASTFFKRA
ADCYQILQET RLEQLYRVES AECNRIVQAW ASCASDYTRA AELMLELIDT TDNGTDASQK
RRDASKFHKQ AAGAWTEMGE KSKAAASQVQ AAIALNFGEE STVLSKQALQ GMEEAIEAHV
PDVLNPYGRY RQTGVSAFLD PENADETVEQ ASAETLQLAS SHMVTRSYAH EPLNQLVAVL
VNAGEYASAL YAAGAVTAIL EKDGISTLSL SRAYAVETVL TLALGDPVMA EQSFLSRHVQ
STPYLASREC KLAEDLFRAV KTRDLDALEE ARAVTGSNRA ALANLDPAVR ELVPLLRLTG
VARKNVASNA IPVASTSANS RRGGKNEPDR LQKNEIPEAT TEPATLQELS KMKTGYEKEV
AEGAHLDGNA LANELDDLDF GALDSDHEDD GDGLGGVGDD SDLEDDDDVD LR