Gene PHATRDRAFT_21983 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21983 
Symbol 
ID7203088 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp300399 
End bp302134 
Gene Length1736 bp 
Protein Length484 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182364 
Protein GI219124130 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.383606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCAGTTCCTA TCCAACAAGT AGTACAAAAT CTGATCCGCG TTCGAAGTGT AAAGATATTC 
CTTAGTTTCA TTATGCAAGG AGGTCAACAA CACAATGCTG CTCAAACGGA GGTGCTCAGT
TTGACGGCCG CGAGTCGAGC CCAGCAAGAA GTCCACCAAG CCATGCTACT GGATCTGGAG
GCTAAGAAAA TTGCGGCTTC CTTGGATGTG CCCACGCTGC CGGAACAAGT GCGCGCCGCT
TTGCGAGAAA TGGGACAGCC CGTCCGTTTG TTCGGAGAAA ATTTGGCCGA TGTGCGCCAG
CGCTTGCGCG AAGCCATGGC GCATCAAAAA GTCTCACTGG ATGCGGCGTC GCTCTTTAAG
GAAGAAGATC TGACTGGGCA ACGCGGAGCA AATGAAAGAT ACGAAGAAGA GGTGACAAAA
TACACCCGTG CGGAGCAGGA GTTGATTGAG GCTCGTCAAG CGATTGCCAA TTTTTCATTG
AAACGGGCGG GGGCGCGACT GGAACGGGAA CGCCGACTCC GCTTGCAAGC GAACCGGCGC
AAGCGTAAAA TTGACGAAAA ACCCGGCACC AGTACGGAAG TGGACGTGCT CGATGAATCC
TGTCACAAAA TGTACCAATC CATTCAGATG ATGGCCCTGC AGGGTTCTCA GTATGGTGAT
AGCCGTGTTG TGAGCTGCAT CAGCGCACAA ACTTTGGATG GGATTCCCGT CGTTGCAACT
GGGGGTTGGA CTGGAAGCGT TCAGTTATGG GATGGAAGTT CCTCCGCGCT TGAGATCTTA
GGGGGCAAGA CCATGTGCCA CGAAGACCGG ATTATGGGCT TGGATACAAT GAAAGTAAAC
GAAGACCTGG CAATTATGGC AACAACGTCC ATCGATTTGA CTGCTAAGTT GTACCGTGTG
CAGAGCGCTC ACACTGTCAT GTTGGACGAT GCAGGAGCTG TCGATAGCAC CGAGCGCTTT
GCCGTAACTG AGCAAGCGGT CTTACACGGT CATCAATCCC GATTATGCCG AGCAGCTTTT
CATCCGATGC AACGACATGT CGCAACCACC AGTTTTGATC ATACCTGGCG TTTATGGGAT
ATTGAAACTA GCCAGAATAT TCTGCTCCAA GATGGTCATT GGAAGGAGTG TTACGGTGTT
GGTTTCCATC CAGACGGCAG TCTATGTGCC ACGACTGATT TCGGCGGAAT CGTACAGGTC
TGGGATTTGA GGACTGGCAA GTCTATTAAA CACTTTCTGG GGCATGCGAA GCGTGTGCTA
AACGCTATTT TTCACCCGAA CGGCTTTCAA TTGGCCACGG CCGGTGACGA CGGTACGATC
AAGATTTGGG ATCTGCGAAG GCGGAAACTG GCTGCATCCT TGCCAGCGCA TTCCAACGTG
GTAACCAAAC TACAGTTTGA TGCGTCCGGT GAATATCTGG CGTCGTCTTC CTATGATGGG
ACGGCACGGT TGTGGGGCTG CCGTGATTGG AAAATGTTGC GCCAGCTGCA GGCCCATGAA
GGCAAGCTAT CGGGAATAGA AATTCTAGGC AGCAACTCCA TTCTAACCTG TGGATTTGAC
AAGACGCTCA AACTGTGGCA GTAAAAGTGG ACTTGTGAAA ATTGAACTGA GATTGTTCTA
GATCCATTCC TATGTTTATT TCAATGTAAA GCAATCTACT GCCAATGAAT CGGATTTAAT
GTAATGTAAA GTAGAAAGCG GCCAGAGGGA GGGGCAATTC CTTTCAGTTA CATAAC
 
Protein sequence
MQGGQQHNAA QTEVLSLTAA SRAQQEVHQA MLLDLEAKKI AASLDVPTLP EQVRAALREM 
GQPVRLFGEN LADVRQRLRE AMAHQKVSLD AASLFKEEDL TGQRGANERY EEEVTKYTRA
EQELIEARQA IANFSLKRAG ARLERERRLR LQANRRKRKI DEKPGTSTEV DVLDESCHKM
YQSIQMMALQ GSQYGDSRVV SCISAQTLDG IPVVATGGWT GSVQLWDGSS SALEILGGKT
MCHEDRIMGL DTMKVNEDLA IMATTSIDLT AKLYRVQSAH TQAVLHGHQS RLCRAAFHPM
QRHVATTSFD HTWRLWDIET SQNILLQDGH WKECYGVGFH PDGSLCATTD FGGIVQVWDL
RTGKSIKHFL GHAKRVLNAI FHPNGFQLAT AGDDGTIKIW DLRRRKLAAS LPAHSNVVTK
LQFDASGEYL ASSSYDGTAR LWGCRDWKML RQLQAHEGKL SGIEILGSNS ILTCGFDKTL
KLWQ