Gene PHATRDRAFT_54883 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_54883 
Symbol 
ID7203722 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp11991 
End bp15092 
Gene Length3102 bp 
Protein Length980 aa 
Translation table 
GC content60% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182883 
Protein GI219125219 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0342548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACTGTGAT CTCCATCGTC GTTTTGTGTT CTTCAATAGA ACTACGAAGC AACCCCACCG 
TCTTTACGCT CCTAACCCCG CCCTACCGCC CATCCGCAAC CTCTTCGGTC CCGATGTCGA
CCTCGGCTCA TTTCAAACTG AGCGACTTTC CTCACAAAGT CCTCGACCCG ATCGCCACCC
TCACCGTCCC ACCGACCTAC GCGACCATCA AGCGTGCCCA ACGCCAGCTC ATGACTAACG
CCGCCGCCAT TCCCACACTC AACGGTGGTG GCGCCCATGG CCATATGGCC CTGACCTTGA
CCGCCCTTGC CTACGCCGAC ATCAGCAACG TCCCGTTCGT CATTCCCGTC GCCCCTCCGG
CCAATCCGCC TCCTGGTGCC ACGCAACCGC AAATCACCGA AAACAACCGC ATTCATCAAC
ACGATGCCGA CATCTACAAC CTTTATGTCG CCGTCAACAA CGCGCTTCGC CAGCAACTTC
TCGACGCGGT TCCCCGCATT TATGTCCGCG CCCTCGCCCA TCCCATGTTC GAGTTTAGCA
ACGTCACGTG CCTCGACTTG CTCTCGCACC TCTGGACCAA ATACGGTACC ATCAAGCCCG
CCGAGCTCCA GAAAAATTTC CAGTCCATGT ACACCCCTTG GAACACGACC GAGCCGCTTG
AATCAGTTTT TCTTCAGCTC GACGAGGCCA TCGCTTTCTC TGTTGACGGT AACGACCCCA
TCTCGGAAGC TGCTGCTGTT CGCGCAGGCT ACGAAGTCAT TGCGCACTCG GGCCTGCTCC
CCCTGGACTG CAAAGAATGG CGCAAATTGC CTACTGCTGC TCACACCCTT GCCCATTTCC
AGCAGCACTT TTCCCTGGCC GACGAAGACC GGCGCCTCAC GGCAACCACC GGTTCCCTCG
GATACGCCAA CGTGCTTGCT GCTGCCCCCT CTCTCGCTCT TGCCACGACC TCCGACACTC
TTAGCCTTCC TTTCTCCGCG CTCTCTGTGT CCCAGACTTC TGTCTCTTCG CCGGACATGA
CCTACTGCTG GACCCATGGT ACCAGCAAAA ACCGACGCCA TACAAGCGCC ACGTGCAAGA
ACAAGGCCCC TGGCCACCGC GACGACGCGA CCGCCACCAA CACTCTCGGC GGCTCCACCA
AGGTTTGGAC GGCTCCCAAG CCCCCTGAAT AGGAAAGAGG GACGGCTACG CCGATGGTTA
ACTCTAGTAA TACCGATTAT TTAAATCATA TTACTAGTCT TAATTCATCT GTAGTCCCCT
CCCCGCCTAG TCCCCATACC TCGGCCATTG CCGACACCGG TTGCACCGGC CATTACATCA
CCATCAACTG CCCCCACACC GACAAACGTC CTGCGAATCC CAGCCTTGCC GTCCGTGTCC
CTAACGGCGC CGTCCTCCGC TCAAGCCACA TTGCCACCCT GGCCCTCCCT GGCTTCTCCC
CTTCCGCTTG CCAGGCCCAC ATCTTCCCCG GGCTTGCCTC GCACCCACTC ATTTCGATTG
GGCAACTTTC CGACGACGGC TGCACTGCCA CTTTCTCAGC CACTAGCCTT GAGATCCACC
GCGACACCAC ACTACTCCTC TCCGGCACTC GTGCACCCAC AACCGGCCTC TGGCACCTCG
ATCTTACCCC CGCCAAGCCT CCCAACACGG CCCATGCGCT TGTTCCGCAC ACACCCCTTG
CCGACCGCAT CGCTTTTGTC CATGCCTCGC TCTTCTCCCC GGCTCTCTCC ACATGGTGCC
AGGCCCTCGA CTCCGGCCAC CTTGCAACTT TCCCCGACCT TTCCTCCCGC CAAATCCGCA
AGTATCCACC TAGTTCCCCT GCCATGGTCA AAGGTCACCT TGACCAACAA CGCGCCAACC
TTCGCTCCAC CAAGCTTCCC CCTGTCTGTC CCCCCACCAC GACGGAACCC CCAGCCGCCG
CTGTGCCCGA CTTTGATCCT CCCGACGCCC ACCCTATCGC ACGCACACAC CATGTCTTTG
TTGCCCACCA ACGGGTCACC GGTCAAATCT ACACGGACCA ACCGGGCCGT TTCCTCACTC
CCTCAAGTGC CGGACACAAC GACATGCTTG TGCTCTACGA TTTTGATAGC AATGCCATCC
ATGTCGAGGT CATGAAGAAC AAGTCCGGCC CCGAGATTCT TGCCGCCTAC AAACGCGCAC
ACTCTCTCTT TACCCAACGC GGCCTCCGTC CCCAGCTCCA ACGCCTCGAC AACGAAGCCT
CTACAGCCCT CCAATCCTTC ATGACCTCGG AACACGTCGA CTTTCAGCTG GCACCTCCCC
ATCTGCACCG TCGTAATGCC GCCGAACGGG CCATACGTAC CTTCAAAAAC CACTTTATTG
CTGGCCTCTG TACCACTAAC CCGGATTTTC CCCTCCATCT TTGGGACCAC CTCCTCCCAC
AGGCCCTTAT CACCCTAAAT CTTCTTCGTC GCTCCCGCAT CAATCCCAAG CTGTCCGCCC
ACGCCCAGCT TCATGGTGCT TTCGATTACA ACCGCACCCC GCTTACTCCT CCCGGGACTC
GCGTCCTAGT CCACGTCAAG CCGTCCGTCC GCGAAACTTG GGCCCCCCAT GCTGTCGAAG
GTTGGTACCT CGGCCCCGCC CTGAACCATT ACCGTTGCCA CCGCGTCTGG ATCACGGAAA
CACGTGCCGA ACGTGTTGCT GACACCCTTT CCTGGTTCCC GACCCGCATT CCCATGCCTG
CCGCTTCGTC CACCGACCGC GCCCTGGCCG CCGCCCGTGA CCTAGTCCAT GCCCTCCAGA
ATCCTTCCCC TGCGTCTCCG TTCGCCCCCC TCGATGCCAC CCAGCACAAG GCACTCACCG
ACCTTGCCAA TCTCTTTGCC ACCGTGGCCG CCCCGGCCGC CGACGTCCCT GCACCTGAAC
CTGTGCCTCC GGTCCGTCCT CCTACCCCAG CACCTCCCCC TGCTCAGGTC CGTTTTGCCG
TTCCTCTTGT CACGGCCGAA CATGCCCCTG CACTTCCGAG GGTGCCCATT CCGGCCACAG
CACTTCCGAG GGTGCCCACC ACGGCCACCT ATCACTCTCG CACCCGCAAC CCCGGCCGCC
GCCGCCGCAC AGCACGCAAC CAACCGGTAA CCCCAACCCT AG
 
Protein sequence
MSTSAHFKLS DFPHKVLDPI ATLTVPPTYA TIKRAQRQLM TNAAAIPTLN GGGAHGHMAL 
TLTALAYADI SNVPFVIPVA PPANPPPGAT QPQITENNRI HQHDADIYNL YVAVNNALRQ
QLLDAVPRIY VRALAHPMFE FSNVTCLDLL SHLWTKYGTI KPAELQKNFQ SMYTPWNTTE
PLESVFLQLD EAIAFSVDGN DPISEAAAVR AGYEVIAHSG LLPLDCKEWR KLPTAAHTLA
HFQQHFSLAD EDRRLTATTG SLGYANVLAA APSLALATTS DTLSLPFSAL SVSQTSVSSP
DMTYCWTHGT SKNRRHTSAT CKNKAPGHRD DATATNTLGG STKERGTATP MVNSSNTDYL
NHITSLNSSV VPSPPSPHTS AIADTGCTGH YITINCPHTD KRPANPSLAV RVPNGAVLRS
SHIATLALPG FSPSACQAHI FPGLASHPLI SIGQLSDDGC TATFSATSLE IHRDTTLLLS
GTRAPTTGLW HLDLTPAKPP NTAHALVPHT PLADRIAFVH ASLFSPALST WCQALDSGHL
ATFPDLSSRQ IRKYPPSSPA MVKGHLDQQR ANLRSTKLPP VCPPTTTEPP AAAVPDFDPP
DAHPIARTHH VFVAHQRVTG QIYTDQPGRF LTPSSAGHND MLVLYDFDSN AIHVEVMKNK
SGPEILAAYK RAHSLFTQRG LRPQLQRLDN EASTALQSFM TSEHVDFQLA PPHLHRRNAA
ERAIRTFKNH FIAGLCTTNP DFPLHLWDHL LPQALITLNL LRRSRINPKL SAHAQLHGAF
DYNRTPLTPP GTRVLVHVKP SVRETWAPHA VEGWYLGPAL NHYRCHRVWI TETRAERVAD
TLSWFPTRIP MPAASSTDRA LAAARDLVHA LQNPSPASPF APLDATQHKA LTDLANLFAT
VAAPAADVPA PEPVPPVRPP TPAPPPAQVR FAVPLVTAEH APALPRVPSE GAHHGHLSLS
HPQPRPPPPH STQPTGNPNP