Gene PHATRDRAFT_43494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43494 
Symbol 
ID7197546 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp626125 
End bp627660 
Gene Length1536 bp 
Protein Length456 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177971 
Protein GI219112439 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0119061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGATCAATCG CGTTTTTTCG GCTTCAAAAT GAAAGTTGCT TTTGCCTTCC TATGCGCCTT 
CGCAGCGAAA GCCTCCGCGG ACAATTTCCT TGAAGGTAAG TGATGGTAAT CTAGGAAGTT
TTCTCAGGAA AGGGATAAGT TGTTACAGTT AACTGTCCAT GCCGATCGCA GACTGGAAGA
ATAAGGGAAA CCTCGCACCT TCTGACACCT GCAACATCCC TTCTATCTGC AGCTGGACTT
CGAAAGCTCG AGGGAGCAAC CGAAAAGTGC TCCGATAAGA TAGTGAATTT CGAGGGTTTC
TCTGCGGGAG AGAAAGTCAC TACTGAGAAG TTGAAAAGCT TGGGATTTAA AACGCTCTCC
ATCAAGGTGG AAGGCAAGGG AAAGTGCGTC GACGGCGAAG CTCGAATCTT CAACTCCGGC
AGTGTCACCT GCGGCGACAC TGACTTGTCC TCTAGCGATG GCATGGTGGC CATTATTCAG
GAGAAGAATG AGAACATCTG CGCTCCCAAT GATTGTGCCG CTGGAGGCAC AATGACTTTT
GAGTGGACTA ACAAGGTTCA AATCAAGTCC ATCCGTCTCT TGGACAATGA CCAGCCCGTC
AAGGTCGTTC TTACAACGTC CACTGGTACG ATTACAAAAC CACTGGTTCT TGTTACAGAT
AAATCCAAGG ATGGGAAACA CGAAACTTTT TCCATCGGAG TAGATGATGT CTCGAAGATG
GATGTGGTTA TGAACGGATC TGGTGCCGTG GCAGAAGTGG TGTACAGAAC TTGTGGAACT
GGCGCTAGCG GAGACCCGCA TTTTAGCACC TGGACTGGAC ACAAGTTCGA CTACCACGGG
CAGTGCGACC TTGTGCTTGT CAAGGCCCCG GTTTTTGAAG GTAAAGGCCT CGACATTCAC
GTACGCACCG AGCAGCGCTA CTTCTACTCC TTCATCAAAA CAATCGCGAT GAAGATTGGA
GATGATTTGC TGGAGTTTGG ATTCAATCAA GTGCTCCTGA ACGGTGCCAC TGCCAACAAA
GGCTTGACTT CGGGAAGCAG CTTCGACTTC GCGGGATATC CTGTGTCGTT CCACGACGAG
CCCATGCCCA ACGGACGTCC CCGTAAGCTA TACCGTGTTC AGACCCCGCA CGAGGCTATC
GTCATCAAGG TTTTCAAGCA GCTGATGGCT ATTGAAATTG AAGATGCGTC TCACGTTAAT
TTTGCAGATG CCGTTGGTAT CACGGGCGAC TACAACTCTG GTCTGATGCT GGGCCGTGAC
GGTGTCACCA TATTGCCGGA TCCCAGCGAC TTTGGTCCGG AGTGGCAAGT CACCAGCGAC
GACCCTAGCT TGTTCAGTTC CGTGCAGGCT CCCCAGTTCC CCGAGAAGTG CTGGGAAGCC
CCGGCTATTG ACAAGGTCCG CCATTTGCGC AACGGAGTGT CACAGGCTCA AGCGGAAGAG
GCCTGTGCGA TCTTGGGAGA AGACGCTGAT ATTGAAGACT GTGTGTTTGA CATTATGGCC
ACCGGAGATA TCGAGATGGT CGGCGCGCAC CTCTAA
 
Protein sequence
MKVAFAFLCA FAAKASADNF LEAGLRKLEG ATEKCSDKIV NFEGFSAGEK VTTEKLKSLG 
FKTLSIKVEG KGKCVDGEAR IFNSGSVTCG DTDLSSSDGM VAIIQEKNEN ICAPNDCAAG
GTMTFEWTNK VQIKSIRLLD NDQPVKVVLT TSTGTITKPL VLVTDKSKDG KHETFSIGVD
DVSKMDVVMN GSGAVAEVVY RTCGTGASGD PHFSTWTGHK FDYHGQCDLV LVKAPVFEGK
GLDIHVRTEQ RYFYSFIKTI AMKIGDDLLE FGFNQVLLNG ATANKGLTSG SSFDFAGYPV
SFHDEPMPNG RPRKLYRVQT PHEAIVIKVF KQLMAIEIED ASHVNFADAV GITGDYNSGL
MLGRDGVTIL PDPSDFGPEW QVTSDDPSLF SSVQAPQFPE KCWEAPAIDK VRHLRNGVSQ
AQAEEACAIL GEDADIEDCV FDIMATGDIE MVGAHL