Gene PHATRDRAFT_39571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39571 
Symbol 
ID7195241 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp160215 
End bp161581 
Gene Length1367 bp 
Protein Length404 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183561 
Protein GI219126642 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGCGAA TTCTACTTTT TGCTATGATT CTTCTGGATG CAGCCAGCCT TGCTCAGCCT 
AATGACGAGG CGCAAATCCA TAAAAGCGTA TTTGAGAAGC GAGAGCTGAT GAATAATTAC
ATAGAACTGA GCATGAGTAT GCCAGTCATT GGGAAGGGCA AGGGTAAAGG TAAAGGAAAG
GGAATCGACT ACAGCCTATC ATCGAAATCC TCAAAGGCCA GCTGGTCCAA ATCGTCCAAG
TCAAAGGGAA AAGGAAGCAG CAAAAAGAGC GATAAAAGTG GAAAGGGTGG TGGCAAAGAA
ATTTCGCTCG TTCCCACAGT TCCGACCCCT ACTACGGCTC CAATTGGTAA GTTGCGCTTG
TCTTACATTA AAAAGGAAAA ACCCAAATTG AAAGATCCTA ACCGACTTAA CATTCAGTGG
CACCTCCTAC TATCACTACA ACCTCCCCGT TTGGTAGTAC AATGAGTGAG TCACGTAAGT
ACAGCGAAGC AGAACCTATT TGCCCGAAAT TTAATAAACT AGATTTTTCA CTAACTGTAC
CTTCCCGTCC ATCAGCAAAT TCGACGCCTG CGCCCACGTT GACACCAGTA TCTATTTCTC
CATTCACGAT TCTGTATACC ATCGAACAAT TCCGCCTCCC TCTCGCAAGC GAGTATGTTT
CGGTCGCAAA TCTCACAGCG AACTATTTGA ATGAGTACTT TCGAGCAAAT TTCCAAGAGA
CGAGTCTTGT AGACTTTGTG ATTGCCGACA CGATGATGAC TGACAACAAC TTTCAGTTTG
GACAGCCTGT CGAGATTGAT TACAGAACAC AGCTTACGTT TGCGTCGGCT TCCTTTATTC
CTTCTACAGA GGAATTCGAC GAGTTGTTGG CCAGTGCGTT CCGCGACGAC AACTTGGCAA
TCTACATTTC TCTTTTGAAC AGTTTGCCAA TCAGCAACAT CTTCCAAACA ACGTCGCTTG
TTACTTTTGA AGGATCTACT TCCGCTACCG TACCCGCGAC TAGTGCAGAC TCCGCCGCAA
GAGCAGCCGG AATCGCGGTA GCAGCCGGTG CTGGAGCCCT TATCTTTATC ATCGCTGGTG
TAGTTATGTA TCGCCGAAAG GAAAGAGAGG AAGTTGGCAA GCGCCTCGAT GAAGATGGGC
AGATGACAGT TGCTGGTGAC ACGTGTGGAG GCTCGTCAAT GGATTCCCAA TCGGTGGTTA
ATCAAACACA TGCAATGAAC GACGCAGATG GCTCCTCGGT ATCGGAATTG GGCGACTTTC
AGGTCGCTCC ATCAAATCAT CCCATTCTCG AAGAAGGAGC TGAAGAGGAA ACGGACGACG
AGTGCGACTT TGAAGAAAGA GGCCAACTTT CGGAAGTCCA ACTGTAA
 
Protein sequence
MMRILLFAMI LLDAASLAQP NDEAQIHKSV FEKRELMNNY IELSMSMPVI GKGKGKGKGK 
GIDYSLSSKS SKASWSKSSK SKGKGSSKKS DKSGKGGGKE ISLVPTVPTP TTAPIVAPPT
ITTTSPFGST MSESPNSTPA PTLTPVSISP FTILYTIEQF RLPLASEYVS VANLTANYLN
EYFRANFQET SLVDFVIADT MMTDNNFQFG QPVEIDYRTQ LTFASASFIP STEEFDELLA
SAFRDDNLAI YISLLNSLPI SNIFQTTSLV TFEGSTSATV PATSADSAAR AAGIAVAAGA
GALIFIIAGV VMYRRKEREE VGKRLDEDGQ MTVAGDTCGG SSMDSQSVVN QTHAMNDADG
SSVSELGDFQ VAPSNHPILE EGAEEETDDE CDFEERGQLS EVQL