Gene PHATRDRAFT_49253 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_49253 
Symbol 
ID7195548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011689 
Strand
Start bp380396 
End bp382396 
Gene Length2001 bp 
Protein Length566 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002183862 
Protein GI219127271 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.836677 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAATATTTGA GTTGAATCAT CAACAAAAGA CTAACTGTAA AAACATCTAC ATTACCTTCC 
GAACAAATCA TGAAGATAGT GATTGTCGGA GGAGTCGCTG CCGGGGCTAG TGCCGCTGCC
CGTGCTCGCC GTTTGGACGA ACACGCCGAG ATTCTTCTCT TGCAGTCTGG ACCGGACGTA
TCCTTCGCAT CATGCGGCAT GCCTTACTTG ATCGGAAACG AAATAACCGA CCGCGCCTCT
ATGGCCGTTC AAACACCGCA ATCCTTGAAC GCTCGTCTCA ATATCATTGT TCGAGTCAAT
ACCAAGGTCA ATGAAATCAA TACAACCGAC CAGACTGTTG TCGCTCGGAA TGAAACCACC
GGGGATATCT ACACCGAACC CTACGATGAA CTTGTTCTGG CAGTGGGTGC GGCTCCCTTC
AAGCCTCCGA TCCCTGGTAT TGACCGTCCT GGATTGTTCA CGCTCCGCAA TCTTCAGGAA
ATGGATGCCA TCGTTCAGTG GCTCAATGTC AAAACTGAAA CCAAGAAGCC CGCTGACATG
CACTGTGTGG TTGCTGGGGC GGGATTTATC GGACTCGAAA TGGTAGAACA GCTACATCAT
CGCGGCATGA ACGTGACTCT GGTCGAAATG ATGCCACAGA TCCTGGCCCC TATGGATCAG
GAAATGGCAG CTATGCTACA CAAAGATCTC GAAGACCACG ATGTTAACGT GATTGTTGGA
GACGCCATCA AAGAATTTGC CGCCTACGAG AAGGATGCAG ATAGCTCGGT GCTGACTTTA
CAATCCGGAC GTGTCCTTCC TCCTGCCCAG TTGACGATTC TTGGCCTTGG TGTCCGCCCC
GACACGGGCG TAGTCAAGGC AGCGGGCATC GAGCTTTCGC CTAGAGGCCA CATTCTCGTT
GATGAACACT TGCACACATC CGCAGCAAAT GTTTGGGCTG CTGGTGACGC CGTTGAAATC
ATTAACCCAA TTTTGCCGGA TGAAAAGTGG GCCGTCCCCT TAGCAGGTCC AGCGAATCGC
CAAGGTCGCA TGATTGCCGA CAACATATAC GGCAAAAAAC GCTCTTTTCG GGGAACGTAC
GCGGTCAGTG TAGTACGGTC CTTTGATCTA TACGCAGCCT GCGTCGGTCT TAATGAAAAG
TTTCTCAAAG CCAAGAATGT TCCCTATAAT GTCGTGCATG TACATCCCAA CAGCCATGCC
GGATATTATC CTGGCGCCGA AAAGATCCAT CTCAAACTGG TCTTTGACAA GGAGTCTGGA
AAAATTTACG GCGCTCAGGC CGTAGGCAAG GACGGCGTTG AAAAACGGAT TGACGTGATT
GGGACTGCCA TGCAGGGTAA AATGACGGTG TCGGACTTGG CCGAGTTGGA GCTCTGCTAT
GCCCCTCCGG TTGGTTCGGC AAAAGATCCA GTCAACTTTG CCGGCATGGC GGCTCAAAAC
ATTATGGACG GGCTCATTTC CAATGTGGAA TGGTACGAAA TGGATGATCT CGTCAAAAAT
CCTGACGTAT TTGTTTTGGA TGTTCGTGGA GGAGCCGAAA TCGAAAAGAC TGGTAAGCTT
GCAGAAAAGG CCGTCAATAT CCCCGTCGAT GATTTGCGTG CCCGACTCTC CGAGGTACCG
AAGGACAAGC GTATTGTTGT GTCGTGTGCT TCAGGGCAAC GGTCGTATTA TGCTTGCCGT
ATTTTGAAGC AAAACGGGTA TGCCAACGTG GACAATTTGG ACGGTGCCTA CTTAACCTTT
CACGCCGCCC ATCCAGAACC GGCTGCGTAA GGTTGCTTTC TCTTACGCCA ACCCATAGGA
TAGAATAAGT CCAAAATAAG GAAAGCCAAC AGTCTATACC TATAGCGGAC ACCATAAGTA
AAGACTCCAA TCCGAACTCT GTGCGTTTCT GCAGAGAAGT GCATGCAATC TGTATTGCCG
GTCCAAAGAA TCTCGGAATT GTTGCCATAA ACAGCTCAAT ACAGGTAGTT TGTATGTGAA
AATCGAGAGA CAATGGCCAG C
 
Protein sequence
MKIVIVGGVA AGASAAARAR RLDEHAEILL LQSGPDVSFA SCGMPYLIGN EITDRASMAV 
QTPQSLNARL NIIVRVNTKV NEINTTDQTV VARNETTGDI YTEPYDELVL AVGAAPFKPP
IPGIDRPGLF TLRNLQEMDA IVQWLNVKTE TKKPADMHCV VAGAGFIGLE MVEQLHHRGM
NVTLVEMMPQ ILAPMDQEMA AMLHKDLEDH DVNVIVGDAI KEFAAYEKDA DSSVLTLQSG
RVLPPAQLTI LGLGVRPDTG VVKAAGIELS PRGHILVDEH LHTSAANVWA AGDAVEIINP
ILPDEKWAVP LAGPANRQGR MIADNIYGKK RSFRGTYAVS VVRSFDLYAA CVGLNEKFLK
AKNVPYNVVH VHPNSHAGYY PGAEKIHLKL VFDKESGKIY GAQAVGKDGV EKRIDVIGTA
MQGKMTVSDL AELELCYAPP VGSAKDPVNF AGMAAQNIMD GLISNVEWYE MDDLVKNPDV
FVLDVRGGAE IEKTGKLAEK AVNIPVDDLR ARLSEVPKDK RIVVSCASGQ RSYYACRILK
QNGYANVDNL DGAYLTFHAA HPEPAA