Gene PHATRDRAFT_40874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40874 
Symbol 
ID7198718 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp162523 
End bp164056 
Gene Length1534 bp 
Protein Length497 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184834 
Protein GI219129309 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00530628 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGCAT CCCAAAAAGT CATAGCACAT TATGTATCGA AGCCCAACAC ACTTGCAAGG 
TTTGAAATCG CACGGCCACA CAAAGATAGT TTCACAGCAG ATTCAATTTA CAGGGTTGCA
AAGCCTTATC GTCGTAGCGC TGCTGCTAAA GTTTTCCAGG GATCTATCAT GAAAAGTAGC
TTGCTCGTCG AGCGTTCTTC GAGCAACAAA CTTGAGTCGA TCGCGAACGC TTCCTTTCGG
CGTACCAATA AACGTTCCGA TCGCAACAAA AGCATTGACA CGGAATTGAC CACAGATTTG
TCCTCCTCAG GTGCAAGCAA CAGTGTCAGT AGCAGCAGTG ACGACGTTGC AGTCTCCTCT
TCGTCTGAGG CTTCCTCTTC TGTGTCTTCT AATGTAAGCG GCTTCGACTC CAGTGAGAGC
GATTCTCAAA GCGAAGGAAG CTACAACAAA GATGACGATT CGACGGAAGC ACTCATAGAT
GACGAAAGTC TCTGCGATTC TGACTATGAC CGTCTAAATT CATCCGATAG CTCCCTCGAA
CTATTGATTC CGTCGACAAA GACGCAAGTA CTACCGGATG ACACACCGCA AGCAAACGAC
TCCTTGTCGC ACATTCCGTC GCTTCTGGAC AACGTATGTG GCGATGGAAG CGACGACGAA
GAGGACTTGT TCATCGATAT TCCTACTCCT TGTGAAGCGC AGTTGACGGA CATGCCTGAT
CAAATACCTA CGAGCAGTAA CGTCTGGACC GACGATGATG CCTCGTCGGG GGGTGATACT
TTTGCGTCGG AAGCCAAATC GGTAGAGTCT TTTATTAGCA TGGCTTCTAA ACGCAGCGTA
GACAGCGCTG ACCCTCCCCC TCCGCCTCGG CGCTCAAGAA AACCTCGGAA AAAGCAACGC
AAAAGAGTAA TCAACATTGA CGACAACGAA GATGCCTGTA AGAACTACGA TCTTGACAGC
TCTACCCACA GCACCAACGA AGCTCTCAGC AAAAGCCAGC GAATCAAAGA TGAAATGGAC
GCTTCCATTC ACAGCATCCA AGAAAGTCAA GCAGCCATTG AAGCTTTGGA AAAAGGTGTT
GAACAAGATT TTGAAGACGT TCAGAAAACG ATGAAGGATA TAGAATTGCA ATCTGTAGAG
GGCTCCGAAA GCTGGCGATG GAGCCCAATC TGTGATTCAA AAAAAAATGG CAGAAAAACA
AGCTCTACGC AAAGCTCGAA TGGAGAAAGT TCGCGCCCGA ATTGCTCGCG AAAAAGAAGA
AAAAGACAAG GCCGAACAAG AGGAGAAAGA CAAACAAAAA CTGAAGGATT CTCAAGCTGC
CAAACTTGAC TTCTCTGAAG AATCTCGTCG TGAACGGGCA TACACTTGGT ATACAAGGTC
GGGAATGCCC AATAAGAAAA ATTACAAAGA GCGTATCCAA GCAATGCCAC TATCATCTGG
CATCACAGAA GAAGACATAG ATCTATTGCC CTGGAATGCT CGCAATGATA TGGTCAACGT
TGCAAAAATG AATGCATACT TGTTTAAGCG CTAG
 
Protein sequence
MTASQKVIAH YVSKPNTLAR FEIARPHKDS FTADSIYRVA KPYRRSAAAK VFQGSIMKSS 
LLVERSSSNK LESIANASFR RTNKRSDRNK SIDTELTTDL SSSGASNSVS SSSDDVAVSS
SSEASSSVSS NVSGFDSSES DSQSEGSYNK DDDSTEALID DESLCDSDYD RLNSSDSSLE
LLIPSTKTQV LPDDTPQAND SLSHIPSLLD NVCGDGSDDE EDLFIDIPTP CEAQLTDMPD
QIPTSSNVWT DDDASSGGDT FASEAKSVES FISMASKRSV DSADPPPPPR RSRKPRKKQR
KRVINIDDNE DACKNYDLDS STHSTNEALS KSQRIKDEMD ASIHSIQESQ AAIEALEKGV
EQDFEDRAPK AGDGAQSVIQ KKMAEKQALR KARMEKVRAR IAREKEEKDK AEQEEKDKQK
LKDSQAAKLD FSEESRRERA YTWYTRSGMP NKKNYKERIQ AMPLSSGITE EDIDLLPWNA
RNDMVNVAKM NAYLFKR