Gene PHATRDRAFT_36476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_36476 
Symbol 
ID7201589 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp551376 
End bp553239 
Gene Length1864 bp 
Protein Length520 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181026 
Protein GI219120581 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCAGC AGAGAGATGC CTTAATGACA GCACTCAATA GTGGATTCAA GAATTTTTTT 
AAAAATGTAT CCATTCCTGA CGAGAATCGG GACCCGCTAG GACGTCCCGT ACACTACGCC
CACGCTTGGG CCGAGTATCG TCGGCAGGCA CGGGAAATTC AACGCCGATT TGGTGGAGAG
ACTGGGGACG TGTTAGGCTT CGGACAAGAT GATTGCGGCC AAATTGGCTA CGTCCTTTCT
AAGGACGAAG ACAAGCCAAC AACCTACTTG CCCTTTGTCA TCAAGTCGCT CGTTCGCAAG
GATGTTCGTC AGATTTCGGC CGGTGGTGTA CATTCGCTTG CCGTCACGGC GGACGGTGAC
GCCTACTCAT GGGGTACTGA CGATGACGGC ACGCTTGGCC GTAAAAACGA AGCAGACACA
GCGATTGATG CCACCACGCC GAGTCCGGTC GTCGGTTTTC GTACGGTGGA CGGAATCAAC
GAAGATAGGC GTAAGTGAAT TTGCCGAAAC GAAACCTGCT TCGTGGGAAT GCGGCACGCA
TCACCTCACT ACGACTTTCG TTCATCATTC AGAAATTGTT CAAGTATGCG CCGGTGCGAG
CCATTCCCTC TTTCTATCCT ACAGTGGAAA TGTATATTCT TCAGGGTACG TTGTCTCGTA
CTGTTTAAGC ATTCCGCCAT AATTTTTCGA TTGTGTTTGA CGCTCACAAT ACCCCAACCC
GGCAGAATGA TGAAAGACAT GGATTCGGGT AAATTCCGTG ACATACAAAC CGTCGAAGAT
GACCCTGCTG GCTATAATGA GAAACCGGTG CATGTGGCTC TGATGCCTAA AAAGGTCACT
TTTATTTCCA CGACGACCGC ATTTTCTGCA GCCATTTTGG AGGATGGAAC AATGGTGACC
TGGGGTGAGT AAAACAAAAG AAACGGTCGG TTCTATGCAA TTCAATCCTC TCTTTTCCTC
ACGTTCTGAG ATTGCTCGAA AGGATTTGGA AACCATGGGG AGCTGGCTCG TACGGCCACC
ATGGGCGCAA AGAAAAACAA GGAAGGAAGA CCCGATTTAG GGCAAGGCTT CTTCTACACG
ACAAAGCAAG AAGACGGGGA CGGCAATGTC CGATTTGTGG CCACTCCTTC GTTGGTTCGT
GAACACTTTT TGACTCCCAA GCCCCCCATC TGGTCCTTTG GGAGTCCACA AAAGAAAGTG
ATCAATGTAG CATGTGGTTC GTATCACCTG TTGGCGGTTG CACGGGAACC TGACGATGCA
AAGTTGCGGG TGTACTCAAG TGGCATCAAC AATTATGGCC AGTTGGGACA AGGTGACTTT
GGAGTAGAGA CCGAGCGCCA TGAGCTCACA ATGGTACGTA AGAAATGCTT TGTTTGGTTA
TGTGCGTAAA CAATATCTCA ATCAATTAGT GGGCTTAATC TTGAAGATTA AGGCATTGGA
AGATGAAAAT ATAGTCAAGG TTGCATGCGG TGAATTTCAT TCCTTGGCTC TCAATCTAAT
TGGTACGAAA GTATTCGCGT TCGGCCGTGC GGACTACGGT CAGCTCGGGA CCAAACTTTT
CGACTTTGGT GAATGCGGGG CAACTCCCGA ACAGGTCGCC TTTCCCAGTG AGGAACGTGT
CATAATAGCA GATATTGATG CTGGAAGTTC TCACTCGATG GCGATTACCA TCGATGACGA
AGTGTATTCT TGGGGCTTCG GAGATGGCAA CACTGGATTT GGCGATGTTC AAAGTGATGT
TGTATATCCA CGAAAGCTAA CACTCACGGC CAAGCAAATC AATGCCAAGG GCCGAGTTCT
TGCCACCAGC GGTGGTGGAC AGCATGGGCT TATGCTAGTC AAGCGATACG CATTTCAGAC
GTAA
 
Protein sequence
MKQQRDALMT ALNSGFKNFF KNVSIPDENR DPLGRPVHYA HAWAEYRRQA REIQRRFGGE 
TGDVLGFGQD DCGQIGYVLS KDEDKPTTYL PFVIKSLVRK DVRQISAGGV HSLAVTADGD
AYSWGTDDDG TLGRKNEADT AIDATTPSPV VGFRTVDGIN EDRQIVQVCA GASHSLFLSY
SGNVYSSGMM KDMDSGKFRD IQTVEDDPAG YNEKPVHVAL MPKKVTFIST TTAFSAAILE
DGTMVTWDCS KGFGNHGELA RTATMGAKKN KEGRPDLGQG FFYTTKQEDG DGNVRFVATP
SLVREHFLTP KPPIWSFGSP QKKVINVACG SYHLLAVARE PDDAKLRVYS SGINNYGQLG
QGDFGVETER HELTMIKALE DENIVKVACG EFHSLALNLI GTKVFAFGRA DYGQLGTKLF
DFGECGATPE QVAFPSEERV IIADIDAGSS HSMAITIDDE VYSWGFGDGN TGFGDVQSDV
VYPRKLTLTA KQINAKGRVL ATSGGGQHGL MLVKRYAFQT