Gene PHATRDRAFT_46220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_46220 
Symbol 
ID7201187 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp645451 
End bp646638 
Gene Length1188 bp 
Protein Length356 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180477 
Protein GI219119433 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0715961 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTCCCTT GGAAGCCAGT ATAGTGTACA CAGCTTGGTT TTTTGCCCAA GCGAGGAACG 
ATGGGAAGCG TATCCGGCAA GAACGATAAG AGTCACGCCG TATGTGCCAG TGACCAATAC
ATTCGGCGTG CCCATCACGC TGGTTCGTGG TACGAAAACG ATCCCGTTAC CCTCCGTACA
ACGTTGCAGC AGTACCTTGA TCACGCTGCC TCGGACAGCA GTACCAAGAA CGGCCGTGTT
AATACAAGCG GTGCCGATGG GCGAATATTT CTGCGAGGTT TGATCGTCCC ACACGCGGGC
TATTCGTATT CCGGACCGAC TGCGGCCTAC GCATATCAAC CCCTTTTCCA GGAACTATCG
CGAGTCGACT GTCCCATTCA AATTTTGCTC GTGTTGCACC CATCCCATCA TGTTTACCTA
GACGGATGTG CTATATCCAA CTCCCACACA ATCAACACAC CGGTAGGGAA CTTAGCGACC
GACGATGGGA TTAGGGAAGA ATTACTCCTC CTCAACCACA ACAATAAATC TATTTTTACC
GTCATGTCAC AAAAGGAAGA CGAGGAGGAA CATTCTGGTG AAATGCAGTA TCCGTATATA
GCCCACATTC TTCAAGCATG TGGAAAACTG CACAACAATG GCAGTAACAA ACCAATTCGT
GTGCTGCCAA TCATGTGCGG AGCTCTATCG AACCAACAAG AAGCAAGCTA CGGGCACTTG
TTGCAACGTG TTATTGCTCG AGAGGATGTT TTGACAATCG TTTCGAGCGA TTTTTGTCAT
TGGGGTCCAC GGTTCCGGTA CCAACCAATT CCTACCAAAG AAAAAAGTTA CAAAGATTCG
ATGCCTCTTC ACGAATTTAT CAAATCCCTG GATCGCCAGG GCATGGATGC CATCGAGGCG
CAGCAACCGG GGGCGTTTGC AAATTACTTG GCACGCACAC GCAACACCAT TTGCGGCCGA
CACGCCATTG CCGTATGGAT GCAAGCCATT GCTGCATCCG AAACTACTAT TGGCAACAAG
GACGACACCG ATCCAACCGG TGAATTGCTG CGAGTGCGAT TTGTGCGCTA TGCACAGAGC
AGCCCTGCTG AAAGCCTACG GGATAACAGC GTGAGTTATG CCGCAGCTTT AGCCACAAGG
ACAATTGCAG CAAAACCAAA TGATGAGTCT GCTCTTTATG CTCTTTAA
 
Protein sequence
MGSVSGKNDK SHAVCASDQY IRRAHHAGSW YENDPVTLRT TLQQYLDHAA SDSSLIVPHA 
GYSYSGPTAA YAYQPLFQEL SRVDCPIQIL LVLHPSHHVY LDGCAISNSH TINTPVGNLA
TDDGIREELL LLNHNNKSIF TVMSQKEDEE EHSGEMQYPY IAHILQACGK LHNNGSNKPI
RVLPIMCGAL SNQQEASYGH LLQRVIARED VLTIVSSDFC HWGPRFRYQP IPTKEKSYKD
SMPLHEFIKS LDRQGMDAIE AQQPGAFANY LARTRNTICG RHAIAVWMQA IAASETTIGN
KDDTDPTGEL LRVRFVRYAQ SSPAESLRDN SVSYAAALAT RTIAAKPNDE SALYAL