Gene PHATRDRAFT_29223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_29223 
Symbol 
ID7203003 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp679358 
End bp681445 
Gene Length2088 bp 
Protein Length439 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182441 
Protein GI219124292 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCTCCGGGGA GTTTAGTGGG CAAAATTGAC GATCTTGCCA GACCACTCGA CGTGCAAATC 
TTCCTTCATT TGGTTGGTAC GACTGAGTTG GGTTTATTAA AGGCTTTACC GACACAACGA
GAGCTACTCG TGCGGGGGAA GCCTGCCTTT CTTGACGGAC GGATCCAGTA TACGTGACAA
AATCTTGGAG GATCTCGTCG TAGAAGTCCT ACGAGAAGCT ACACGACAAA ACGTGGACCT
CGAGTGCCGT GCCGAAGAAA GCCTGCCGAT TTTCGTCATA TCTAACCAGC CCCTTCCTCA
GGAACAACGG TTCATTTACT GGTTGGAGCA AGCTTTGGAG GTTCATTCTA GTTTCTCACT
CCACGAAACG CTACCGTTTC GAGAAGCCGT TTCGAGAAGA ATTTATCCGC ACGAAGAACG
AACGCTGTCG TGAATTGTCG CATTTGGATC GTTGCTATTC CCAAACTTTG GCTTGCTTTT
CATGTCCACA TCGTTCTCGG CGGCTCTCAC ATCTTCAACG ACGACAGCGA TGGAGGCTTC
GGCAACTAGT CCCCGCAAAG GACACGCGAA TAGTATGGTG GAAGCATCCG GTTCCTCGGC
GACATCCGCT CCGGCAGCAA ACGCGCAAAG TGACAACACC CGTAGCGGTC CCTTGTCGGA
ACAGCAGCAA ACAGCATCCA CCGCCGCGAG TAACAATATA TCCTATTCAG CGGAGCGTAT
TATCGGAAAC GGGTCGTTTG GTGTAGTTTT CGAAGCAAAA GTAGTGGGGA CCGGGGAAGT
CGTTGCGATC AAAAAGGTTT TACAGGATAA GAGGTTCAAG AATCGTGAAC TCCAGATTAT
GAAACAGCTA GTCAGAGATC CTCATACCAA TATCGTAGGG CTTAAGCATT GCTTTTACTC
ACAGGTACGT TCGAACTGAG GAGAAATAAC TCTTCATCAT CTCTCCATTA TGTGCTCAAC
CAATTTTCCT TCCACAGGGC GAAAAACCGG ACGAGCTGTA CTTGAATTTG GTGCTTGAAT
TTGTACCGGA AACCGTATAC TCTATCAGTC GAAGACATCA GAAGCATTCG ATGCAACTGC
CACTGATGAG CGTCAAACTC TATCTTTACC AGCTCAGCAG GGCCCTAGCT CATATCCATT
GTTTGGGAAT TTGTCACCGA GATATCAAAC CGCAGAACTT ATTGGTGCAC CCGCAAACTC
AGCAACTAAA ATTATGTGAT TTTGGTTCTG CCAAGGCGCT CATTCAGGGC GAACCTAACG
TATCCTATAT TTGCTCACGA TACTACCGTG CACCGGAACT GATTTTTGGA TCGACGGATT
ACACCACCGC GATTGACATT TGGTCGCAGG GTTGCGTGGG CGCAGAATTA CTGCTTGGAC
AACCCCTATT TCCGGGAGAT TCGGGTGTCG ACCAGCTCGT AGAAATCATC AAGGTACTGG
GGACACCAAC TAAGGAGGAG ATACGATCCA TGAATTCGAA CTATATGGAA TTCAAATTTC
CACAAATCAA AGGTTGTCAG TGGAAAAAGA TTTTTCGTAA CAAGACACCG CAGGACGCCA
TGGACTTTAT CGCGGCGACC TTGGCTTACA CGCCGTCGGA ACGGATCTTG CCGCTCGAAG
GATGCGCGCA CGAATTTTTT GACGAACTGC GACAGGAGTC GACTGTACTG TCAAACGGAG
GCGGCAAGCT CCCGCCTCTA TTTGATTTTA CAACTCACGA GTTAGCAAAA TCGCCCCAAC
TTTTGACAAA GTTAATACCG CCGCATTTGA AAGGATCGTT CGAGATTCCA TCGGTAGAAA
CTGATGACGT CGCTTCCGCA ACAACTCCTA TACCATCGTC ACTGGATCGG AAGCAGGAAG
CTACCCTTCG ATGAGCTTAA TGCACAACAA AAAGACATAA TCAAGCCGTA CCGTACGCAC
TATCTATGTC CATGTCCACT GGCGCTCGTC TGGGCATTGG TCAGCGTGCG GATACGAAGA
GGGACGGTAG AGTCGTCCTT TTCCGTGTCG GCATGGTTTT CCACGAGCAA TCCCACCCAC
AAATTTCCCA AACTTAACGG AAAAACTAAT TCCTTCTGTT TTCCTATA
 
Protein sequence
MSTSFSAALT SSTTTAMEAS ATSPRKGHAN SMVEASGSSA TSAPAANAQS DNTRSGPLSE 
QQQTASTAAS NNISYSAERI IGNGSFGVVF EAKVVGTGEV VAIKKVLQDK RFKNRELQIM
KQLVRDPHTN IVGLKHCFYS QGEKPDELYL NLVLEFVPET VYSISRRHQK HSMQLPLMSV
KLYLYQLSRA LAHIHCLGIC HRDIKPQNLL VHPQTQQLKL CDFGSAKALI QGEPNVSYIC
SRYYRAPELI FGSTDYTTAI DIWSQGCVGA ELLLGQPLFP GDSGVDQLVE IIKVLGTPTK
EEIRSMNSNY MEFKFPQIKG CQWKKIFRNK TPQDAMDFIA ATLAYTPSER ILPLEGCAHE
FFDELRQEST VLSNGGGKLP PLFDFTTHEL AKSPQLLTKL IPPHLKGSFE IPSVETDDVA
SATTPIPSSL DRKQEATLR