Gene PHATRDRAFT_43471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43471 
Symbol 
ID7197176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp551445 
End bp553979 
Gene Length2535 bp 
Protein Length785 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177638 
Protein GI219111773 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGAAATATCA GTGGACGGTC GCACCCACAT GCACACAATC GGTCTTTTTC ATTCTGCCGT 
CCTCATCTCC GCTCTCCACC TTTTTTTAAC CCAGGAGAAC GAAGAATAGA TGTCGAGCGG
ATTTATTGAC CCTATTTCGT ATCGTCGAAT CTGATCAGAA AACCATAAAG CTCAACCATG
AGGCACAGCG GAGTCGCAAC AACTGCGCTC GGTTTGTTTG CTTTACCCGG GGCACGTCCG
TTTTCTTTTT CTTCAACAGG CACTTATTCT CCGCAAACGT TTGCTCCTTG TTGCGCTGCC
ACTAGTCGCA GGAATTCTGC AACTTGCAAC AATGAAAGCT TTCAAGCAGA CGACATCTCA
CTCCTCGACT CTATGCCAGA TCGAGAATTT AAGACCTCAA CCCCGTCGAA GGGAGATATT
TCAGAGTTAT ATCTTTCCTT GTCGCATTTT ATTGATATGA TGGATAAAGC TCCACCTGGA
ACACTGCAAA CGGAAGAAAT AAGGCTGCTT AGAGAAATGA TGGCTTCGAT TGCGGAAGAG
AATGAAGGCC ACGTGGATAC TATGGAGCGG CTGCTCTACC GTATGATAAG CGAATGGGGG
GCAGCTACTC AGAATATGAG AGACGATCGA ATCATTTTGC TAGAACCCAA GCTGGAAGAT
TTTTTGCTTG TTTTTAAGGC GTGGCAAGCC ACTTTGGAAA CGACACTCAG GTCGAAAACA
AGTTCCGGGA AACTTAATAC AAAGGGGCTC TCAGTCACGG TTGAGCGTGT GTGGCGGCTT
TTTGCGACAC AAAAATCCCT TTTCGAGAAT GGACTGGAAT CTGTCAAACC TGACCAAGAA
ACCTTTCGTA CCGTGCTTTC AGTTTTCTCG GTGTCACGGG ATCGCGGAAT GGACCGCAAG
GTGTGGTCGT TGTTTGAAGC GATGCAATCT TCCTACGATG TGGAACCAGA CGAATCAATT
TACAATTTCG TTATCCGCTC CTTAGCCAAA AGCCGACAAC GCGGGGCCGC CGAACGTGCT
GAAAATCTGC TTCGAGAAGC AGTGAAACAG TATCCTCCTT GGATGGACGT GAAAGGTCGT
GTCCGCGGGA TGAGCGTCGA AAGCTTCAAT GTAGTTTTGA CGGCGTGGGC AAAAAGCGGA
CTCGATTACG GAGCTGAGCG TGCCGAAAAA TTGATCATTT TTATGGACGA AGTGGATTCA
GCGAACGGTA GTCCAGGGAC CGTGAAGCCC AACATTTCGA GCTTTACCTC ACTCATCGAC
GCGTACGCCC AGCAGAGCGA GTGGGATGGC GTTGCAGCTG CCGAACGTAT TCTCAACCGA
ATTTTGGACT TTTATTTGGA AGGCGAAGCC GAATTTCAAC CTAGTATTGC TACTTGGACC
ATTGTCTTGA GTGGATGGTC CCGTCTCTCG AGAAAAGGAA ATCGCGGAGC TGCTTCTCGC
GCTGACCGGC TTTTGAAAAG AATCGAGTCA TTGCACCAAG AAGGTCGGAT CGAGTTCGAG
CCGGATGCGA TTGCTTATGT CACGGTGATG AACGCTTTTT GCTCGTCAAA GACTCCAGAC
GGGCCACCTC GCGGGGAAGA TATTCTGGAC GAGATGAATG AGCGCTTCAT GGACGGCGAC
GATTCGATGC GACCCAGTGC TAAATCCGTA CGAATGGTAA TTGACGCATG GGTTAAAAGC
AATGGCCCTG GTTGCATGGA GAAAGCCGAA CGTGTATTGG ACCGATATGA AGATCATTTA
GCTTCCTCAG GCCCACCAGA CGAGCCACAT GGAACTTTTG AAGATACCAC CGAAATTTAT
AAAATCTTGT TGTTCGGATG GGCCAAGAAT GGGGATCCCG AGCGCGCGCA GGACTACTTA
TTGGACATGG TGGACAAAAA CATGCAGCTC GACTCGTTTT GTTTCGACAA AGTGATTGAG
GCTAACACAT TTCTTAACGG ACAGGACTCC TTGCAACGCT CGACACAAGT TTTCGAGCTA
TTGGAGAAAG CCCGCAAAAC AGGAACAATC AAGCCGAACG AACGTGTGTA CACATCATAT
ATTCGCGCAA TTGCTAAGGC TCGAGTACCA GATGTGGCTG AAAAAGCCGA TGCTGTTCTG
CAACGTATGC AGGACCTGTT TGCCGAGGGA AACAGGGGGA TTGAACCTAC GGTATTTACG
TACAACGCTG TTCTTATGGC GTGCTCCGAA ACCCCTAATA CGGAGAAAGC CAGCAACGCT
TCTGCTTTCA AAATTGCGCT GCGCATCTTC AACGAAGTTC GAGGCCAACG AAGAGGACCC
GATCACGTGA CTTTTGGCAA CATGCTGAGA TGCACGAACC TGGTCCCGAA AGGCGATCAA
AAGGACAAAC TGGTCAAGGC CACCTTTCAG CTTTGCTGCA AGACTGGCTG GGTGAACTCG
TTCGTCCTAC GGGACTTACG CGACGCTGCT TCAGCAGAAT TATTGGAATC GCTTTTCTCA
CAATCTTTGG ACAGTCTTGA TGTGGAGCAG CTACCGGCAA GCTGGCAAAG GCAATTTGCA
AACAAACGAA GGTAG
 
Protein sequence
MRHSGVATTA LGLFALPGAR PFSFSSTGTY SPQTFAPCCA ATSRRNSATC NNESFQADDI 
SLLDSMPDRE FKTSTPSKGD ISELYLSLSH FIDMMDKAPP GTLQTEEIRL LREMMASIAE
ENEGHVDTME RLLYRMISEW GAATQNMRDD RIILLEPKLE DFLLVFKAWQ ATLETTLRSK
TSSGKLNTKG LSVTVERVWR LFATQKSLFE NGLESVKPDQ ETFRTVLSVF SVSRDRGMDR
KVWSLFEAMQ SSYDVEPDES IYNFVIRSLA KSRQRGAAER AENLLREAVK QYPPWMDVKG
RVRGMSVESF NVVLTAWAKS GLDYGAERAE KLIIFMDEVD SANGSPGTVK PNISSFTSLI
DAYAQQSEWD GVAAAERILN RILDFYLEGE AEFQPSIATW TIVLSGWSRL SRKGNRGAAS
RADRLLKRIE SLHQEGRIEF EPDAIAYVTV MNAFCSSKTP DGPPRGEDIL DEMNERFMDG
DDSMRPSAKS VRMVIDAWVK SNGPGCMEKA ERVLDRYEDH LASSGPPDEP HGTFEDTTEI
YKILLFGWAK NGDPERAQDY LLDMVDKNMQ LDSFCFDKVI EANTFLNGQD SLQRSTQVFE
LLEKARKTGT IKPNERVYTS YIRAIAKARV PDVAEKADAV LQRMQDLFAE GNRGIEPTVF
TYNAVLMACS ETPNTEKASN ASAFKIALRI FNEVRGQRRG PDHVTFGNML RCTNLVPKGD
QKDKLVKATF QLCCKTGWVN SFVLRDLRDA ASAELLESLF SQSLDSLDVE QLPASWQRQF
ANKRR