Gene PHATRDRAFT_42307 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42307 
Symbol 
ID7198187 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp45215 
End bp46930 
Gene Length1716 bp 
Protein Length453 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184291 
Protein GI219128169 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000154334 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTGCGG ACGCTACGAC GGAGGCCCAC GGCCAAGGTC CGCAACAACC CCATCTGGCC 
AAACAAGACG CGGCAACGGT AGATCCCGCA CAACTCACAG CCCTGTCTCC CGAAGTGGTA
CGTACCGGTA CGGTATAACG CAGCGTAAGG TTGTTTGCGT TCGAGGGATG CTGTTGGTAG
GGTTGGATCA GACTGGGTAG GGTTGGGTAG GAAACTATGT TCGTCTCGTC TCTAGCGACG
TATTGTTACT GTCTGTTATT GCGAAATGAA TCGAGGCATA GCGACTGCAC ACAGTCTCAC
TTTAATATAT CTCTCTATCT GCTTTGTTAG ATCTCTCGCC AGGCCACCAT TAATGTGGGA
ACCATTGGAC ACGTGGCGCA CGGCAAGTCC ACCGTTGTCA AGGCCATTTC CGGAGTGCAA
ACGGTCCGCT TCAAGAACGA ACTCGAGCGG AACATTACCA TCAAACTCGG TTACGCCAAC
GCCAAGATAT ACCAAGGACA GCCCGTGGTA TCGGAGGAGA ACCTCCACGA TAACGAAGAC
GCCAGCACCA CAACCGCGGA GACTCCTCTG GACGCGGACG GCACCGTCCA CAACACCACC
ATTGCCACAG GGCCCCTCTA CACTTCCCGA GGCTCTTCAC ACGCCGATAT CTTCACCGAA
GGAGGTCGAA CCTACCATCT GCGCCGTCAC GTGTCCTTCG TCGATTGCCC CGGACACGAC
ATTCTCATGG CGACCATGCT GAACGGGGCC GCCGTCATGG ACGCCGCCCT CCTGCTCATT
GCCGGAAACG AGACTTGCCC CCAACCGCAG ACTTCCGAAC ATTTGGCCGC CGTAGAAATC
ATGCGCCTCG AACATATCCT CATTCTGCAA AACAAGGTCG ATCTCGTCAA ACCCGATGCC
GCCGTCGCGC AACAGGAACA GATTCGCAAG TTCGTCGCTG GAACCGTGGC CGACGCCGCA
CCCATTCTCC CCATTTCCGC CGTACTCCGA TACAATATGG ACGTACTCTG TGAATACCTC
ATTCGACGAA TCCCTCTACC GGTACGGGAC TTTACCTCCC AACCCCGCCT CATTGTTATT
CGATCATTCG ACGTCAACCG CCCCGGACAA GACGTTTCCA AACTACAAGG CGGCGTCGCT
GGAGGAAGTA TTCTACAGGG TGTCCTCCGT GTTGGTGACG AAATCGAAGT CCGACCCGGA
ATCGTACACA AGCAGGACGA TAAAATTGTT TGCGTACCCA TTTTCAGCAA GATTTCCTCC
CTTTACGCCG AACAGAACGA TCTCCAGTTT GCCGTCCCCG GAGGCTTGAT CGGCGTCGGG
ACCAAAATCG ATCCCACCCT TACCCGCGCT GATCGTCTCG TCGGACAAGT CCTCGGTCTC
AAGGGACAGC TGCCGGATGT CTTTTCAGAA ATCGAAATCT CCTACTATCT GCTTCGCCGA
TTACTCGGAG TCAAAACATC CGACGGTGGC AAACAGGCCA AGGTACAGAA ACTCACCAAG
AACGAAATTC TCATGGTGAA CATTGGTTCC ACCGCCACCG GTGGACGAGT GTCCGCCGTC
AAGGGAGAAT TGGCCAAAAT AACCCTCACA CAACCCGTGT GTACCGAAGA AGGTGAGAAG
ATTGCCCTCT CCCGACGCGT CGACAAGCAC TGGCGGTTGA TTGGTTGGGG ACAAATTCGC
AAGGGAAACG TTGTGGAAAT AGCCGAGTCG GCGTAA
 
Protein sequence
MRADATTEAH GQGPQQPHLA KQDAATVDPA QLTALSPEVI SRQATINVGT IGHVAHGKST 
VVKAISGVQT VRFKNELERN ITIKLGPLYT SRGSSHADIF TEGGRTYHLR RHVSFVDCPG
HDILMATMLN GAAVMDAALL LIAGNETCPQ PQTSEHLAAV EIMRLEHILI LQNKVDLVKP
DAAVAQQEQI RKFVAGTVAD AAPILPISAV LRYNMDVLCE YLIRRIPLPV RDFTSQPRLI
VIRSFDVNRP GQDVSKLQGG VAGGSILQGV LRVGDEIEVR PGIVHKQDDK IVCVPIFSKI
SSLYAEQNDL QFAVPGGLIG VGTKIDPTLT RADRLVGQVL GLKGQLPDVF SEIEISYYLL
RRLLGVKTSD GGKQAKVQKL TKNEILMVNI GSTATGGRVS AVKGELAKIT LTQPVCTEEG
EKIALSRRVD KHWRLIGWGQ IRKGNVVEIA ESA