Gene PHATRDRAFT_38848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38848 
Symbol 
ID7203595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp347991 
End bp349646 
Gene Length1656 bp 
Protein Length552 aa 
Translation table 
GC content57% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182948 
Protein GI219125354 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGTCC AGGGCTTGTG GCGTTTGCTG TTGCCCATCG GTCGTCGCAT CTCAATTGAA 
ACGTTGGAAG GCCGCGTGCT CGCGGTGGAC GCCTCCATTT GGCTCACGCA GTTCCTCAAG
GCCATGCGCG ATCCGGACAC GGGCAAGGTC CAACCCGCCG CGCACTTGAT TGGCTTTATC
CGTCGGCTAT GTCGTCTCCG CTTTCACGGG ATCCGCGCCG TCCTCGTCTT TGACGGACCC
ACGCCCGCCA TCAAACGACG TGAAATAATG CGACGACGCA AGCAACGCGA ACAGTTCGCT
ACCTTGGGAC CAGCCGGAGT CCAACGACTC GCGCGACGGC TACTAGCCCA AACCCTACAG
CAGCAACAGC AAAAAAAGCC AACAGTCCCA GAGCCTCACG GTCACGACGC AGCCTTCCAC
CCAACGTCGT CGACCGGAAC GGCACAGTCT TTGGCGCCCG GTTTCAACCC AGGGGGGCGG
GACGGCAATC CGAAAGACGG ATCGACCACG AACGCTGCAA CTTTGGCTAC CGCTGCTTTA
TCGAATGACC CCACGAATTC ATCAGAAGAG CCCGCAGCAT ATTCGGCTAC ATCCGACTCA
CTCGGTACGA AAGCTGCGAC CGATTCTGCG CCTCCGGAAC AACCTGCCGC CGCATACCCT
CCGGATACCG GCAGTGACGC TGCTATTGCC GCGGCGTTGG AATTCGGTTC GGACAATGAG
AACAACACGA ACGACCCCCA CGTCCGCGAC GACGATGGTG ATGATCCAGT CAACGACTGG
GATTTACCGC TCGACCACGA GGCGACTGGG AACGAGTCCA GCAATAGCGA CGACCCCGTA
ACAACCAATA GCAGCTCCCT CGGCTTTCCT CGTTCCAATA AACGCCAACG CCGTTTGTGG
GATGAGCGTC GCGGCACCAT GGACGTGGCG CAGATAGCTG CCTTACCACC CGGCCAACGT
AGGGATGCTA TTGAAGCCGC CAAACGAACG CAGCGACTCC TTTCGCGACG GGAATTCATG
CCGGCCGCCG CCAACCCCGA CGCCTTTTCG TCCGTCCAGG TCACCAACTT TTTGCGGTCG
ATCCGTCTCA ATCAATCCAT ACACGCCATG GCGCTCCGTG TCGTACACGA CGAGGAAAAG
GCGTTTGCAT CCCAACCGGG TGAATTTATG GCGTCGGATC GGAACACCAG AGTATCCCTG
ATACGGGAAG ATGATCCCGA CGATAACGAC CGTACCACAC CACCAGACGC GCCAAGGGAG
CGACCGTCGG CGGCTCTGCG GGCACGGCAA CAGCAAAACC GACGGAACAA TGGTACCCAT
CGATTTTCGC ACCGAGATTC ATCATCCGAC GAGGATTCAC AGTCAGTCGG AATCGGAAAA
TGTGCCAACC CAGCCTTTGC TGCTGCAGGG AAAAAACGAC GACGGGCCAT TTTGGATGAG
GAGGATGACT ACGGCGATTC TTCAAAAGAA GAGGATCGCG TCAATCAAAA GTCTACGCTT
TCGACAAACA GAGCATGGTA CAAGGATCGC CCGCAAGCGA CATCGCATTT ACAACCTCTG
GAATTGGACG ACAGTAGCGA AGACGATGAC AGCGTCCCAA ACGCAGAACT TAACAAAATG
GAATTAGACT ACAGCCAACA ACGGCTTGAC TCGTCG
 
Protein sequence
MGVQGLWRLL LPIGRRISIE TLEGRVLAVD ASIWLTQFLK AMRDPDTGKV QPAAHLIGFI 
RRLCRLRFHG IRAVLVFDGP TPAIKRREIM RRRKQREQFA TLGPAGVQRL ARRLLAQTLQ
QQQQKKPTVP EPHGHDAAFH PTSSTGTAQS LAPGFNPGGR DGNPKDGSTT NAATLATAAL
SNDPTNSSEE PAAYSATSDS LGTKAATDSA PPEQPAAAYP PDTGSDAAIA AALEFGSDNE
NNTNDPHVRD DDGDDPVNDW DLPLDHEATG NESSNSDDPV TTNSSSLGFP RSNKRQRRLW
DERRGTMDVA QIAALPPGQR RDAIEAAKRT QRLLSRREFM PAAANPDAFS SVQVTNFLRS
IRLNQSIHAM ALRVVHDEEK AFASQPGEFM ASDRNTRVSL IREDDPDDND RTTPPDAPRE
RPSAALRARQ QQNRRNNGTH RFSHRDSSSD EDSQSVGIGK CANPAFAAAG KKRRRAILDE
EDDYGDSSKE EDRVNQKSTL STNRAWYKDR PQATSHLQPL ELDDSSEDDD SVPNAELNKM
ELDYSQQRLD SS