Gene PHATR_10544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATR_10544 
Symbol 
ID7204274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011671 
Strand
Start bp204927 
End bp206417 
Gene Length1491 bp 
Protein Length496 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002186299 
Protein GI219113431 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0977616 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTAGCGCTGC ACGCTCTACA GCACGTACTG CAATCACGCG CTACAATTTA TCAACACAAT 
CAACGTATTC AGGCTTTGGA AGACCAAGAA GACGAGCAAC AAAAAGTCTC TGACAATGAT
AACGATGAGG AAGAAGAATC GTGGCGTGAT CAAGGATTTA CCCGACCAAC GGTGTTGATT
CTCCTACCTA CCCGAAGTAC GTGTCATGAT TTTGTCAAGA CACTCCTGTC TCTACTACAG
CAGGATCAGC CTCAGCAGCC GAATTCCAGG GATGACGATG CTCGAGACCG GTTTGATCAA
GAATATGGTC CCTTGCAAGT GGACGATGCC GACATGGACG AGAATGCCCG TGTATATCGC
CAGAAGGCTA TTCAAAGTAA GGGCTCGGAT TGGCACGAAT TATTTGGAGA GACTGCAAAC
GATGACGACG ACTTCAAACT CGGGTTATCC CTTCACCCCA AACGCAGAAA CAAAAAGAGC
GTCACGGAAT CTACGTGTGA TATCAAGCTC TATTCCGATT TCTACAAGAG CGATATTATC
GTGGCATCGC CGCTAGGACT GAAAATATCT GTCACACCGG AGTCGATTTC GGAAACTTCA
GACAACGACG CAGATTTCCT TTCCAGCATC GAGATGTGCA TTGTGCACCG CTCCGACGTG
CTGTTGATGC AAAATTGGGA TCATGTGATG GATTTGCTGC CATTGCTGAA CCAGCAGCCA
AAAAAGACCA ACGATACGGA TTTCTCGCGC GTACGACCGT ACCTTCTAGC CGGACAGGCT
GCACAATGGC GACAACTCAT CATGACAAGC CAATTTTTGG ATCCCTTGAT TTTGTCTACA
TTCAAGCGTT TCTCCGAGAA TCGACAGGGT CAAGTCCGTA TTCGCCGCAA GACACCCGCG
GAAGAAGCCA ATGTCACCAG TGTACTGTTG CCCGTTCGTC AAGTTTTTCA GCGTGTGTCC
TGCAGCACCA TAGCGAATCA AGGAGCCGAT CGTGTACGCT ATTTTGTCGA CAGCGTACTC
CCTCAGATTC AAAGGCACAA GCAGCATCAC ACCATGATTT TCATTCCGTC TTATTTTGAC
TTTATATCGC TCCGCAACAT CTTGTTGAAG AAGGAAGTTG AATTCGTATC CGTGACCGAG
TACGCCCGGA CTAGCGAAGT GAGCCGGGGT CGGGCTCGTT TCCTGCAGGG CCGCAAACCG
ATCATGCTGT ACACGGGTCG AGCACATTAC TTTTTGCGGC ACCAGATCAA GGGAATCCGA
CACCTAATTT TTCTAGGTGT ACCGGAAGAG GCATCTTTCT ACGCGGACCA CGTGAATCTT
CTCAATGAAG GGTTGGAGAA GAGGGACGAT ATAATTATGG ATGACGGATT GGCAAGTTGT
TTGGTGTTGT ACACCAAGTA TGACTCGTAC GCTTTGGAAC GGATAGTTGG AACGGCCAAC
TGTAGTCGTA TGGTAAGGGG AGAAAAGTCG AGCTTTATCT TCGCCTCGTA A
 
Protein sequence
LALHALQHVL QSRATIYQHN QRIQALEDQE DEQQKVSDND NDEEEESWRD QGFTRPTVLI 
LLPTRSTCHD FVKTLLSLLQ QDQPQQPNSR DDDARDRFDQ EYGPLQVDDA DMDENARVYR
QKAIQSKGSD WHELFGETAN DDDDFKLGLS LHPKRRNKKS VTESTCDIKL YSDFYKSDII
VASPLGLKIS VTPESISETS DNDADFLSSI EMCIVHRSDV LLMQNWDHVM DLLPLLNQQP
KKTNDTDFSR VRPYLLAGQA AQWRQLIMTS QFLDPLILST FKRFSENRQG QVRIRRKTPA
EEANVTSVLL PVRQVFQRVS CSTIANQGAD RVRYFVDSVL PQIQRHKQHH TMIFIPSYFD
FISLRNILLK KEVEFVSVTE YARTSEVSRG RARFLQGRKP IMLYTGRAHY FLRHQIKGIR
HLIFLGVPEE ASFYADHVNL LNEGLEKRDD IIMDDGLASC LVLYTKYDSY ALERIVGTAN
CSRMVRGEKS SFIFAS