Gene PHATRDRAFT_39760 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_39760 
Symbol 
ID7195338 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011688 
Strand
Start bp636166 
End bp637667 
Gene Length1502 bp 
Protein Length467 aa 
Translation table 
GC content52% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183785 
Protein GI219127108 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0199514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCGCC GAACACTTGA ACAAACGCAA GCTCAGCAGC CTCAGCCCCA GCCCCAGCCC 
CCACGGCCTC GTCAGGGTCG CATTCCGGCG GCCTTACAGT CCGTTTTCTT GGCGTCTAAC
CGAGGAAAGA CGGTGGATCG CCCTCCTCCG GGCAAAGTTG CCTACAAGGT ATTCTTTCGC
AAGATATTCT TCCAAAAGAA GCCGGAAGCG GAAAAGAATA GCGGCTACAA GAATGGCTAC
AAGAAGTGGA ACAAGAAAGG CAACAAAAAC AGCGGCAAAT TCGTCAAGAA CAACTTTTAC
AAAAATAACA ACAACAACAA CAAGCAATCA CGCTTCTCGG CACCTCCGCA GAGCGTCGAC
AAGTCGAGTA AGATTTTGTC CAGCACGTCG ATTAGTTTTG GAAAGGCTAG TAACGCCAAA
AGCAAACCAA CTCCGTCAAC TCGTGCAGCA GCTGTGGTAC GTGAAGTAGT CAATCATTGG
TTGATCGAGG ACACTCGTGT TCACGTTTCC ATTGTATACG TAACTCACGC TCTGAGCTTT
CTTCTCTTTC GCAGATTGAA GAAAAGCAGG CTATTTCGCC CCAATCCTAC TTGGATGATA
TGATCGCGGC ACGCGGTTAC TCCACGGAAA AGTTTAAAAC CTTGCAAACG GCCTACTATA
ACAAGCCCAC TGCGTTGCAG CAAGCTTCCT ACGACGTCTA CCTCATTGAT CTCGTCAAGA
AAAACGGAGT GGAAACCCTT CGGAATATCT TCAAGTCCGG TGTTTCGCCC AACCCCTGTA
ACACTTTTGG GGAAAGTCTC TTGCACATGA TCTGCCGTCG CGGTGACGTC GATTTGTTAA
AGGTACTTTT GGAGTGCGGT ACCAACCTGC AGGTGGCCGA TGATTACGGC CGAACGCCGC
TACACGACGC TTGCTGGGCG GCGAAACCGG CCTTTGCGGT TGTCGACTTG ATCCTCGAAC
GCGATCCTCG TTTGCTGTAC ATGTCCGACT GTCGAGGCGC CTTGCCGCTC TCTTACGTGC
GCAAGGAACA CTGGTGTGAG TGGGTCCCGT ACCTTGAGGC GAGGAAGCAC ACGTATTGGC
CGGTGCTGAC CAACAACACC GACACGGATA GTCAAGTTAA GGCAGAAGCA CCCCCGCTGC
TGTGCACACA AGGAGCCAAT ACCCGACCGT TGCGCGACCC CAAGGACGCT TTGACTTGCG
AAATGGCCAA AATGGTAGTT TCCGGTAAGA TGCAACCGGA CGAAGCTCAG TTTTTGCAAT
ACGATGTGAC CGATGAAGAC GACGAGTCTC GTTCTAGTAG CGAGGCGGAG GAAGACGCCG
AAGAATCTGG TGACGAGAGC GGTAGTGATG AAGGAATTGG CAGTGGTACT GAGAGCGACA
GCGATGATGA CAGCGAGTAC GATAGTGAAG ACGATGCGAG CGATTTCAGC TTGGACGAAG
ACGAGATGGC AAGTATTCTG AATACATTGG CACCCCGGGC AGCGTCAGTC GAGAAACAGT
AG
 
Protein sequence
MVRRTLEQTQ AQQPQPQPQP PRPRQGRIPA ALQSVFLASN RGKTVDRPPP GKVAYKVFFR 
KIFFQKKPEA EKNSGYKNGY KKWNKKGNKN SGKFVKNNFY KNNNNNNKQS RFSAPPQSVD
KSSKILSSTS ISFGKASNAK SKPTPSTRAA AVIEEKQAIS PQSYLDDMIA ARGYSTEKFK
TLQTAYYNKP TALQQASYDV YLIDLVKKNG VETLRNIFKS GVSPNPCNTF GESLLHMICR
RGDVDLLKVL LECGTNLQVA DDYGRTPLHD ACWAAKPAFA VVDLILERDP RLLYMSDCRG
ALPLSYVRKE HWCEWVPYLE ARKHTYWPVL TNNTDTDSQV KAEAPPLLCT QGANTRPLRD
PKDALTCEMA KMVVSGKMQP DEAQFLQYDV TDEDDESRSS SEAEEDAEES GDESGSDEGI
GSGTESDSDD DSEYDSEDDA SDFSLDEDEM ASILNTLAPR AASVEKQ