Gene PHATRDRAFT_36284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36284 
Symbol 
ID7201897 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp65431 
End bp68358 
Gene Length2928 bp 
Protein Length924 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180746 
Protein GI219119995 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0182118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGACGAG GACAACGACA ACTCCCCCCC CAAAAGGCCT GGTTTATCAT TCCTTCCTTG 
TCTCGTACGT AGTAAGAAGA ACAACTGTAC TCAACAACCA GAGATACAAT TCTTCGTAGA
ATGGAGGATC CTTCGCTTCG CACGACTGTC CGTGTCGGGA ACGACTATCG CGTGAACGAC
GACGTCGACC TGGACGAAAA AGACGGAGGG GACGCTCACC TCCGCGACGA TGACGACGAC
TCCACAAACA GCCCTCGTCG TCGCGGTGTA CCGCCGTTAC TCCCCGCGGG TCGTCGTCGG
ATGATTCTGA CCGACGAAGC GGACGAAGAG ACGTCGTCTA CCACTGTATC GAGCCTCACA
ACCAAAGTCG GGTCATCCTC GACACGACAC CAGCCCCCCC GACCTTCCGT CCGTTCCGTC
CGACACAGCG TACTCCCCAC CCACGCGCCG GGACGTCGCC GACGTCGCTC GTCGGCACGC
TTTTTGCGTC TCAGTGGTCA GCATCATTCC CGTCACAGTC GCAACGATGG CACAATGACA
ACGACGACAT CTTCGTCCGC GGCACAACTC GGCGAACTCT ACAAACAAGC CATTCGAATG
AACGCCGAGA ATAGGATCAA CGCCAGCAAT AGTTGGAATC TAGCACTCAT TGAGAATATT
GATCAATTCC TGCTCCTTGA AGAAGAAGAA GAAGAAGAAC AGCACGAGGA CCTCCCTCGC
GATCACCGCC GTCCCGAAAA CGACAAGAAT CGACTCACTC TCAACAACGC GCAACCCACG
CCAACGCCAC GCCGTCAGCG CGTTAACTTT ACCAAGGCAT CTTGTACACT TGACGCATCG
GTCAAGATTT ATTCCTACCG GGTCGACGAC GTACACCTTT CCAGTTACAA GGTGCTCGCC
AATCTCAACC GCAACGACCA AAACGCCAAC CACAAGAACG CCGATAGTGA CAAAGACAAA
AACACCCATC CGGATCCCGA TCACCACAAT GCTGGCAATC ACAAAAAATC CACTCACTCG
TCCCACGCAT CCACCTTGGA AACCAATATG GGTGCGTAGC CAACCTTCCT GTAGATGTAC
GCCCTTTGTC CCCCTCTCAC ACCTTGCTAC TTTCCCATTC ATTCGTTCCA TCCACATATC
CATTCTTCTT GCGCCTACGG AACAGCCAAC ATTAACCTCA ACAAGCTCGA TGCCGCATTC
GATATTGATC CACTCTTTCA CAAAATGTCC AAGACTTTTG ATGAAGGGGG TGCCAAGGGA
CTGCTCCTCG CAAATCTGGG CGTCAGCAGT CACGGTTGCA ACGTCGTCTT TGACAGTACC
AGTAATGACT CCAATCCAGT CGCCGAAGAA AAAACAGAAG ATGACCGTCA CGTTAACGAG
ACAACTTACG CTCCCGTTGA CATTTCCTCA CTTCGCGCCA AACTCGAAGC TCTCGTCAAC
ACAAACGACC CCGACGGCAA CACCGGTGGT ACCTTTTTGG AAGATTTGGC GCTCGTCCCG
CAGCTTACAT CCCTCCGTGC CGAACACGAT CGTCTCGCCG CGGAAGGATT CGTTCTCGAC
GACAACCCGA CGATGACGAG CACCAAAAAT CGTGGTTCGC AACGCTACGC CCCCACAGCG
GACGAAGAAA CGCAGGCCGA CCAAAGCATT CACCAAGAAG CCTTGGAACG AAGTCGCCGG
ACCAACAAGT CGTTTCTATC CGAAACGGAC CAAGAATATG AAGAAATCAT CGGTTCGAGT
ACATCACAGC AGCCTCCGCA ATCATTATCG ATTGGATACG ACGACGCCGA CGACTTTGGT
GGTGGTTTTG ACGATGGGGA CGACGATGAT GCGGGTTTTG ACGATTTTCT CCAACGCGAC
CAGCAGGGGG CTCGGTATTC CTCCATATCC TTCTCCGGAT CGGTGCAAAA CTTTCAAGCC
CAGGAAGGCG CATCCGACAC CGACGTACCG ACATCCACCG CATTGTTGGA GGCGTTGCTG
GGGTCGCAAG CGTTGACGGA CCAGGATCAG TACCGTTATT TCGACGCCGA GTTGTTGTCG
TCCGCCGTTC ACGCGAATAA TGCTTGGGCC GGAGCCACGC ACTGGAAACG CACGCCCAAA
GTTGCAACCA CCACGGGACC TTCCGTCGCC AAGACCAAGT CTCAACGCAA GAAACCCCGT
GCTTTGGTTG ATTTGACTGC CACGGCCTGT CTGGACGATG TACTTCGTTC ACCACCGAAG
ACGTCGAGTT TGTCCTGGAG TCAAGCCATT GTGCAAAAGT ACACGAATGC GGAGCATTCC
AACCTGTTGC CTCCCGACGC CGAAATGGAC GTGGAGACTT TGTCGACCCT CTTTTTGCGG
CCTCAGAGTG TCTGTCGTGG TCTATCGGTC GCCGGGGGAG GGGACAAGGT ATCTACGCCC
AAGGCGGTCG GCTTTAATAT GGGTGGCGTC GAAACCTTTG GTTGGGATGA TGGTCACGAT
GACGACGGTG AAGGTGGCGG CTACGACTTT GGTGGTGACG ACGATGACGA TATGAGTTTC
GTTGCACCGC TCGAAGACAT CCGCAAGGTG GACAAGGTTC ACGTGGGCTA CGCCACGGTC
GCCAAAAAGG TGGACGTCAA GCGACTCAAG AAAGACTTGT GGATCGAGCT GGAAGCAAAA
CTGGCCGAGC CAGCCAAGCT CGGCGAACAC AAGGATCATG ACGCGGACGA CAGCTCCATG
TCATTGAGCG ATGCTGTGAC CCCTTCTAAA CCATCACTGC CACTATCCTT TCAAAAGGCC
GTGCAAGACC TGGAAGCCAC CAAAACGCAA GCAGACGTGA CCTTGCCGTT TTACTTTATT
TGTATTTTGC ACTTGGCCAA CGAAAAGGGA CTGCGACTGG ATAGTCACGG TTTGGAAGAT
TTCGGTATTG TCTATGATGC AGCCGGGGTA CCGTTGGCGG GGGTGTAG
 
Protein sequence
MGRGQRQLPP QKAWFIIPSL SHTILRRMED PSLRTTVRVG NDYRVNDDVD LDEKDGGDAH 
LRDDDDDSTN SPRRRGVPPL LPAGRRRMIL TDEADEETSS TTVSSLTTKV GSSSTRHQPP
RPSVRSVRHS VLPTHAPGRR RRRSSARFLR LSGQHHSRHS RNDGTMTTTT SSSAAQLGEL
YKQAIRMNAE NRINASNSWN LALIENIDQF LLLEEEEEEE QHEDLPRDHR RPENDKNRLT
LNNAQPTPTP RRQRVNFTKA SCTLDASVKI YSYRVDDVHL SSYKVLANLN RNDQNANHKN
ADSDKDKNTH PDPDHHNAGN HKKSTHSSHA STLETNMANI NLNKLDAAFD IDPLFHKMSK
TFDEGGAKGL LLANLGVSSH GCNVVFDSTS NDSNPVAEEK TEDDRHVNET TYAPVDISSL
RAKLEALVNT NDPDGNTGGT FLEDLALVPQ LTSLRAEHDR LAAEGFVLDD NPTMTSTKNR
GSQRYAPTAD EETQADQSIH QEALERSRRT NKSFLSETDQ EYEEIIGSST SQQPPQSLSI
GYDDADDFGG GFDDGDDDDA GFDDFLQRDQ QGARYSSISF SGSVQNFQAQ EGASDTDVPT
STALLEALLG SQALTDQDQY RYFDAELLSS AVHANNAWAG ATHWKRTPKV ATTTGPSVAK
TKSQRKKPRA LVDLTATACL DDVLRSPPKT SSLSWSQAIV QKYTNAEHSN LLPPDAEMDV
ETLSTLFLRP QSVCRGLSVA GGGDKVSTPK AVGFNMGGVE TFGWDDGHDD DGEGGGYDFG
GDDDDDMSFV APLEDIRKVD KVHVGYATVA KKVDVKRLKK DLWIELEAKL AEPAKLGEHK
DHDADDSSMS LSDAVTPSKP SLPLSFQKAV QDLEATKTQA DVTLPFYFIC ILHLANEKGL
RLDSHGLEDF GIVYDAAGVP LAGV