Gene PHATRDRAFT_38005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_38005 
Symbol 
ID7202717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011682 
Strand
Start bp637916 
End bp640933 
Gene Length3018 bp 
Protein Length1005 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182106 
Protein GI219123591 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCGTG AAGTGAGACT ACTCTTGCGG CCAATGCGAG GGCTTTGTTT CCTGGTGTCG 
GTGGTATACG TGGTCGCCGT CTCCGGCCCC TCGTCGTCGA CGACGACACC GGGGTATGTT
CCTCCTGATC GCCTTTGGTT GGACGTGGAC AACACGTTGT ACAGTGAATC CATCCTATCG
GGCATCGGAA GAGGCATCGA AGCGCAAATC GTACGCGGCG TTCACGAATT TTATGAGCGT
TTCGAATCGG ACGATACCGA CAAGGTCGCT GCTTCAGCAT CACCACAAGA ACGAGCCGAC
GCCCTTCATC AGCAGTACGG GTCCACAATT GAAGGCTTGC GACAAACACG GTGGAAGAAT
CTTTTGCCAA ACGAGTTATC GGCAAAGATG CGGAACTTTT ACGAACGTGT CTACCAAGAC
GTGGACGTAA CCGCACTACT CGACACCGAC CGTTCCCGTC ACTCTCACGG AGGAGCATCT
TCCACGGGTT ACTCACACAA CGCTGTGGCC CAGCAACAAA CCTTGTTACG CGATGCGTTG
CGATACTCCC CCGTACCCGT TGGCTTGGCC TCTAATTCGC CCAAGCGCCA TATTTCTAAA
GTCATCCAAG CTCTTGGACT GACCCGTATT CCATGGCACG CCGTCTGCAC ACCCGACTGC
GCCGACGCCC CGTCTCTGGG AGCGAATATT CCTTTCCGTA GTGACACTGG CTGCAAAGAG
GATTTTCCCA CAAAGTTGTC ACCAAACTTT TTCCCGAACG TGCGGGGTAT CTGTGAAGTA
TTGCTGGACG ATTCACCAAC CATCCTGCGG TCTGTGGAGG CTTTGCAGAA GAATTTACAG
GGGATTCTCG TGTCGGAAAA ATCTTCGTTG CTGCAGGGAC TAGGGCAGGC AATTGGTTGG
ACAGATCCGG CCTTCGAATT TTCCCAAATG GAGTATTTGC GGGCCAAAAA CGTCGTCGAT
ATGGAATCGA TTCACGAAGG AACCTGGCAA CGACTTGGAA TGGAGCTGCG AGCACAACGC
AAGGAGGAAT TCAACAACGC GAAAGGTACT CTAAGTACAG GCTCTCCGCT ATGTGTAGTT
GACGTAGGGG CCGGGCTTCT CTCCATGTTG CGACTGATAC TGCACGGACA CGGAGCCCGG
TTACCTTCGT TGGTACACCT TTTACGAGAA AATCAACATC CTGGCGTCTC CTCTCTGGAA
TACTACGCCT ACGAACCAAA CCGTGAGTTA GGATATGCAG CTACCGTAGA GTTGGAGCGT
CTGGGTTTTG GTTTGCAACA AACGTTGCAA TGGGAAGATT CGAGTAGTCC GACGCCACAG
TCTTGTCAAG AATTTATCAT GGTCAAACCC GCGAACACGA CCGACGATCA GCCAAAGGTC
ACTGTTTATC TTCGTTTTTG GGACTATCAA CGCGAAATGC ACCGACCGCA ACCGACGCCA
CACGTCATAG TTGGTTGCTG CTTTGCCGAT CTCATGGATC CGTACGAATT GTCCAGATCC
CTCCTTCGAC GATTTCTAGC ACCGCCATCT TTGAGTCATT TTGACCATAC CCTAGTCTAC
TTCCCCATAA CCTTTTGCGG TGTCACTCAG TTCTTGCCAC CCCAACCAAT GGAATGGATT
GCAAACATAC CGTCGGATAC AACTGCATTC GCACTCTACG CCAAAGCGCT CCGGGAGATT
CACGGACACA GCTTGGATCC GTATAGTTTG GAACAGGCAT TGGGGGACTA CGGCGCGACA
TGTTTGGCAA GAGGTCAATC TGACTGGCAA ATCGATCCTT CTCGTGACGA ATACTTGTGG
GAAACTATGC TGTATTTTTT CGGAACCGTC ACCTCGAGTG TACTCGAGAA GGCGGCATGG
AATGCTTTGG GTTGGTTGGA ACGGACACGG GGCCTCAGAC CTTCGATTCA AGTTTCCAAC
ACGGACTTGC TCTTTCGCTT TCCGCATGTG GGAAGTTGGC AGGTAAAATC TGAGCAGTCG
AGTGATACAT CGCGAAATCA GACACACACG TTCCAGGAAA TCCAATTTAC GGCTCCCTTC
AAAGTGAAAG CAATATCAAG AAAGCTCGTC GCCCTTGGAC CCAATCAAGT CCGCATTCGA
GCCATACACT CACTAATTAG TTCTGGGACG GAATTGAAGA TATTCAAGGG GCTATTTGAA
GATGCTGCTC TGGACCTTAA CATAGAGGGA ATGACAGAGG AGCGCATGTC TTATCCCCTT
TCATATGGCT ATTGCTCCGT CGGTCGTGTT GTGGAGTGCG GCATGGATAT TTCCAATCCA
GGGGACATTT TGGGCAAGCT GGTATTTACT TTTTCGTCTC ACGCCTCGGA GGTAGTAACG
GATAGAGATG CCATACAGAT AGTACCTGAC GGCATCGGCG CTCTTGACGC AATATTTATG
CCGTCGGTAG AAACAGCCTT GTCGATTGTC CACGACGCTC ATATTCGTAT GGGAGAAAAC
GTGGCTGTTT TCGGCCAAGG TCTTATTGGT CTCTTGGTGA CGGCGCTGTT TTCTAAGCAA
GGCTTTGATA CTTCGGGACG ATTGCGAGCG TTAACGGTCT TTGACATGCT TCCCGATCGT
CTTGCGATGT CAGCACTGAT GGGAGCGACC CAGGCGCTTT TGCCATCTGA AGTGAAGACG
GCGGGCCCTT TTGACGTGGC AATTGAAGTC AGCGGTAACG GCCGCGCTCT CCAAGCAGCG
ATCGACAACG TGAAAGAAGG GGGGCGTATT GTCATCGCGT CATGGTACGG AAGCACTGCT
GTAGATCTTA ACCTTGGTAT TGAGTTCCAC CGCAGCCACA AGATTTTAAA AACTTCACAA
GTGAGCAAAA TCCCTGCCGA GCTTGGATCG ACATGGACCA AGGAGAGAAG ATTTGCGCTA
GCGTGGGAGC TAGTGCGAGA ATATCGTCCG TCGCGACTCG TAACAAAAAG GACGAAGCTA
GAAGACGCTC AAGAAGCTTA TGACGCCCTA GAGAACGGCT CCGAAATTGC GATTGCTTTT
GATTATGATT TAGCATGA
 
Protein sequence
MRREVRLLLR PMRGLCFLVS VVYVVAVSGP SSSTTTPGYV PPDRLWLDVD NTLYSESILS 
GIGRGIEAQI VRGVHEFYER FESDDTDKVA ASASPQERAD ALHQQYGSTI EGLRQTRWKN
LLPNELSAKM RNFYERVYQD VDVTALLDTD RSRHSHGGAS STGYSHNAVA QQQTLLRDAL
RYSPVPVGLA SNSPKRHISK VIQALGLTRI PWHAVCTPDC ADAPSLGANI PFRSDTGCKE
DFPTKLSPNF FPNVRGICEV LLDDSPTILR SVEALQKNLQ GILVSEKSSL LQGLGQAIGW
TDPAFEFSQM EYLRAKNVVD MESIHEGTWQ RLGMELRAQR KEEFNNAKGT LSTGSPLCVV
DVGAGLLSML RLILHGHGAR LPSLVHLLRE NQHPGVSSLE YYAYEPNREL GYAATVELER
LGFGLQQTLQ WEDSSSPTPQ SCQEFIMVKP ANTTDDQPKV TVYLRFWDYQ REMHRPQPTP
HVIVGCCFAD LMDPYELSRS LLRRFLAPPS LSHFDHTLVY FPITFCGVTQ FLPPQPMEWI
ANIPSDTTAF ALYAKALREI HGHSLDPYSL EQALGDYGAT CLARGQSDWQ IDPSRDEYLW
ETMLYFFGTV TSSVLEKAAW NALGWLERTR GLRPSIQVSN TDLLFRFPHV GSWQVKSEQS
SDTSRNQTHT FQEIQFTAPF KVKAISRKLV ALGPNQVRIR AIHSLISSGT ELKIFKGLFE
DAALDLNIEG MTEERMSYPL SYGYCSVGRV VECGMDISNP GDILGKLVFT FSSHASEVVT
DRDAIQIVPD GIGALDAIFM PSVETALSIV HDAHIRMGEN VAVFGQGLIG LLVTALFSKQ
GFDTSGRLRA LTVFDMLPDR LAMSALMGAT QALLPSEVKT AGPFDVAIEV SGNGRALQAA
IDNVKEGGRI VIASWYGSTA VDLNLGIEFH RSHKILKTSQ VSKIPAELGS TWTKERRFAL
AWELVREYRP SRLVTKRTKL EDAQEAYDAL ENGSEIAIAF DYDLA