Gene PHATRDRAFT_43116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43116 
Symbol 
ID7196733 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2075113 
End bp2077657 
Gene Length2545 bp 
Protein Length726 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177439 
Protein GI219111375 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00100373 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAGAGGTTGT GCGTCTCACG TTCCAATTCA TTCTACCATT GCTTCCTTTG CGCAGAACCT 
ACTTTGCATT CACGATCGAC GTTCCTTTAC CATGAGTACT GCAAAGGCTA ACGCAGTGGA
AGAGTTGGAA TCCGGTATAC GAGTCGCCGA TCCAATCCGG GCTGACGACG ATCACCGCAA
CGATGAGATC GTGGACGACC AGGTGGCGAC GCCCATTGCT GCGGTGGAAG TCCAACCCCG
TCTCACGAAA ACTTCCGAAG ATGTGGTGCT TTTGGAAAAA CAGCACGTCA ATGATCAAGG
AGAACCGATT GATGTAGCGC GAGCCGAACA AGACCGGAAC CTTACCAGCG AATCCGACTT
GTCGGATACG AATCGTAAGA TATGGGTCGT TACGACTGCC GCCATGCCGT GGAGGACGGG
AACTTCGCTT AATCCCTTAA TGCGAGCCCT ATATTTGACG CGGGGACGTC CCAAGCATAG
TATCACTTTG GTAATTCCTT GGCTGGAAGA CATCAAATCC CGGAAAAAGT TATACGGAGA
TGCCTTGTGT TTTGACGATG GGGGCAAGCA AGCCCAAGAG CAATGGATTC GAGAGTACTG
TCGGGAACGT TGCAAGTGCG AAGGTACGTA CGGATTGCCC CGGCAAGTCC TTTCTATGTT
CGTGTCTCTC ACTATGCTGT TTCGTGACTC GTCAAGAGGA GGAACAGAAT TTACGCATCA
TGTTTTGGAG AGGACGGTAT CACGACGGAT TTGGCTCCAT CTTTCCAGTG GAAGATATCT
GTAGCCTAAT ACCTAAGAAA GAAGCAGACG TGGCCATTCT AGAAGAGCCA GAGCATTTGA
ACTGGTTCCG ATTACCAACA AAAGTGGGAA AGAACGAAGA AAACCAGGAC GTGGACCGAC
TTGGCTGGGC TCACAAATTC AAACATGTGG TTGGGGTGGT GAGTTGTGTT GCGGAGTATA
CATACACCAA GCACCACATT TATACTTAAC TAACAGGTTT TCTTTTTGTT GCAGCTCCAC
ACAAACTACG GTGCCTACAT TCGTCAATAT GGAATGGGAA CTTCTTTCGT GACGGCACCT
GCGTTGGACG CGCTCAGTTC CTTGGTGGTG CGAGCCTACT GTCACCGCCT TGTTCGCCTG
AGTGCCACGT TACCTTCACT AGATTCAGAT ATTGAAGTCA CCAGCAATGT TCACGGGGTT
CGTTCCGAGT TTCTATCACC TCCCCAACGA AAGTCCGAAA CTACTAAGCC ACATGCTCCT
GTTTACTTTG TTGGCAAATT AATCTGGGCC AAGGGTTTTG ATAAAGTGCT CGAAGTACAG
GAAGCCTATC ATGAAGTGGC GGGGGAGTAT TTTGCAATGG ATATTTACGG TGGTGGTGAC
GACATGAAAG CGATCCAAAG AGGTTTCTTT GGGCGGCACA AATCAAATTC TAACAGGAGC
GACGACTCTT CGGACTCGTT GTCGCAAATC GAATCTACCG ATTCCATGGA TGACTCGCAA
GCTGCCGACG TTTTCGGTAA GAGCGAGTCA CTTAGAGAAC AGATTCTTAC CAGAAACAGA
GCTCATCATG TAAATTTAGC AGGAAAAGAA AAATACAAGT CAGATGATGA AGCCAATTCT
GAAATAAGTG GAGAAGAGGA CATTTTGGAA GGAGCCGACG ATAACGCACC CCTGGATATC
CTAGGCGATG TGTCCGGTAA GGCTTTGAGT ACCGGTGCCG AGACGGCCAG TGCAGCCATG
AAAATGATCG AATCGATCAT GTCGGCCGGA TTTGGTGCGT TTGGTGGCGG TAGCGAGAGC
AACTCGGAAA ACAAGTCAAC CGACGAAATC GGAAGCAAGC GGAGCAGGAG TAATGTACCT
TCATTTATGT TTGGACCGGC CCGCTCTCGT TTCAAGTGGC GCCGAACGCC AATTCCGGCT
CGTTTTCTTG GTGTCGAAGA TCATATTGTC GTCAGAGATA TTCCGGACCA TAAGATTTTT
CTGAATATGT CAATAACAGA GGTTTTGTGT ACAACATCAG CGGAGGCGCT GGCTATGGGC
AAATTTGTGA TTTTACCGAA ACATTGTACG TTTTGTGATA TAAACTTTTT GGCATCTCGA
GTGCGGCACT TTTCTCACCA ATGCTCTCTG CTGTCATAAA GCTTCCAATG AGTTCTTTTA
CTGCTTTCCA AATTGTTTGG CGTTCGAAGA CATGGATGAC TGCGTACGTA AGATACAGTA
CGCACTGACC AACAAACCGG AACCATTGAC GGACAAGTTT GTACGAATGC TATCTTGGGA
AGGCGCCACT GATCGCCTGT ACGATTCATC AGGGATGACG CGGGACGAAG CAGATAAGTT
AAAAGAGGCC GGTAGAGTGG AAAAACGCGA GAAGGCTGCT CGCTTTCATG TAGATAGCGC
CCAGAAAAGC CATTTTGTGA CGAGCTTACT CAGCGGAAAG ATGTTGAAGA AAGCGCTATC
TTCCCATTCC TTCAATTAGC ACCAGAAATT TGCAAACCAG CGAATTCCTT CTATTCAAAA
ACGGTAGTAA ATGATGAGTT CTTAA
 
Protein sequence
MSTAKANAVE ELESGIRVAD PIRADDDHRN DEIVDDQVAT PIAAVEVQPR LTKTSEDVVL 
LEKQHVNDQG EPIDVARAEQ DRNLTSESDL SDTNRKIWVV TTAAMPWRTG TSLNPLMRAL
YLTRGRPKHS ITLVIPWLED IKSRKKLYGD ALCFDDGGKQ AQEQWIREYC RERCKCEEEE
QNLRIMFWRG RYHDGFGSIF PVEDICSLIP KKEADVAILE EPEHLNWFRL PTKVGKNEEN
QDVDRLGWAH KFKHVVGVVF FLLQLHTNYG AYIRQYGMGT SFVTAPALDA LSSLVVRAYC
HRLVRLSATL PSLDSDIEVT SNVHGVRSEF LSPPQRKSET TKPHAPVYFV GKLIWAKGFD
KVLEVQEAYH EVAGEYFAMD IYGGGDDMKA IQRGFFGRHK SNSNRSDDSS DSLSQIESTD
SMDDSQAADV FGKSESLREQ ILTRNRAHHV NLAGKEKYKS DDEANSEISG EEDILEGADD
NAPLDILGDV SGKALSTGAE TASAAMKMIE SIMSAGFGAF GGGSESNSEN KSTDEIGSKR
SRSNVPSFMF GPARSRFKWR RTPIPARFLG VEDHIVVRDI PDHKIFLNMS ITEVLCTTSA
EALAMGKFVI LPKHSSNEFF YCFPNCLAFE DMDDCVRKIQ YALTNKPEPL TDKFVRMLSW
EGATDRLYDS SGMTRDEADK LKEAGRVEKR EKAARFHVDS AQKSHFVTSL LSGKMLKKAL
SSHSFN