Gene PHATRDRAFT_47805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47805 
Symbol 
ID7203049 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp99172 
End bp101302 
Gene Length2131 bp 
Protein Length683 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182324 
Protein GI219124047 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.950548 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAAACC GATTGTGTAC GGTTCTTTTA GCTGCCGGAG TTTTCTCCGA ATCGCATGTG 
ACCTTCGCTA GCCAAAACAC AGCGTCGGTT AGTGACTTCC ACTCGGTTCC TGCTTTCACG
GTGGAAGAAC TATCTTTTGG TCACCGAACC AACGATCTTG CTGAAGCTCT TCGTTCTACA
GGGATGATTG CTGTGAAGCT TGATCATCAT GTGAAGCACA ACGATTTTGA TATTGCCGAA
TCTCGAAGAA TTGCTTTGGG TGGCTTGTGC GGCTGCGCTA GCAGCGACGC TTCGTCGTTG
TTCCAATCTG TTTCGGGAGC TGACACTTCT CTACTCTCTG ACGGATTGAC GACGCGCAGC
ACATTTGCTA CGGCTACAGT AGGAGGCACT CCTTTGCAGC TTCCAGTCGA AGGTCTCGAA
AAGGCATGCG GAAGGGAGAC GACCGCAGCC ATGGAATCCT TGCGCGACAA CGTTGCAGTT
GCTTCCCGAA GCTTCGTGTC TGCCGTAGAC CGTCTACGCC GGGATTCTTC AGCCCTACCC
AACAGTCTTG CCCCTATTCT GCAAAATATT CAAGGTGGAA CTTACTCATC GCTCGCGTCC
ATTGTCAAAG CTTCTACGCA CCTTGAGCAC TTCCATCTCT ATTCCAAACA ACAAAGACAA
GCTCCCAGCG GGATGTCCCA AGCATCGGAT ATGGAGGAAG TCCTCCAGGT CCACACTGAT
GCAGGCCTCT TTTTGTCTTT TGTTCCGGGT CTCGCCTGTG GGAATTCTCA CACCAGCGAA
AGCGAGCACT TTTCGGTTCT CATTCATGGG AAGCTCCATC GCGCTGTCTT TCCTCCTGAC
TCGGTTGGTA TCATGCTCGG AGCTGGAGCG CAGCATTGGC TGAAGAACGT GGATTCTTTG
GAACTCCATG CCACTCACCA TGCAGTTCAA ATGCCCAGTG GTTCGGCACG TGCCTGGTAC
GGAATGAGTA AGTCAATCAG TCTATACGGG AATAGAACGA GAAATGAATT CACCTCACAA
GCACCTATAC TTTTTCTTCA TTCAAGTGCA TCTGGTCCCA CAGAACGCGA TAATCCAAGA
AGACCCCGAG CCAAAGACCT TTGCTGATAT GCGAAAGGCA ATGGTGCTTT CCGGCTCCAA
GACACACAAG TACGGCGGTG GCTCGCTCCA GACCGACGAA GCCCTTTCTA TTGGTTGTGG
TGCGGGGTCT CAAAAAATCA ACGATCACAA TTTCGGATCA TTCGGCGTGT CAGCAAGTCG
TCGACATCGC CGTCGTCTTC AGATGGTTGA TGATGCAAGC GTCTGTAACA ACTCGACAAA
TTTCTATTGC TGGATGAGCT GCCTTGACAT TCCAGAAGCG CAAAATGCCC AAGGTTACCT
TAACGAAGGA TACTCGTTGT ATTGCCTGGA CCCTGCAACC TTGTCAGCTT CTGGAAATCG
AGCATCAAAA GCCATGGAAC CTTGCATCGA GGATGGAGTG GTGGGTCTCG CCATGAATGA
AAATTGCATG GGAAGCTGGC AGCCAACTGC TCCTGGGGTT GTTGCTCAAG AGCTTGCCCT
CAACTTTACC ACCTCCGAAG AAGAGCCGTT CTGCTACGGT GGGACGTCCA TGTACATGGA
CGGTTTCCAC TGGCTAGACT CAACCTGCGT CATCTATCTC TTCCCCGAAT GGGTTCTCAG
TACACCTGGA AAGTTGGTTG CGGCATGTGT AGGATCGATT TTCTTTGGAA TGTCTCTTGA
GGGTGTCATT CGAGGTCGTC GTGATCTGGT TCAGTCTATT GCTGTTGGCT GGAAGCGCCT
CTTCATTTCG TCCGGCATCT ACGGTCTGCA GCTTACAATG GGATACTTCA TCATGCTCGT
TGTCATGACC TATTCAGGAC CACTTTTCAT GTGCGTCGTT CTTGGCCTTA TGTTCGGTCA
TATTGCCTTC AACGCAAAAG ACGTTTTGAA AGCCAAAAAG CAAGAAACTA AAGAAGGTCA
AGAAGAGTGT TGCGGAAAAG AAGATGAAAG CGAAATTACT GACGAAGAAT TGTCGCCTTG
CTGCCAGGGT GGCATGGAGA AGAACAATGT TACAAAGGTT TCCGCGAATG TTCCGGAAGG
CAGTACACCG TGCTGTCAGA ATTTATTGTA G
 
Protein sequence
MTNRLCTVLL AAGVFSESHV TFASQNTASV SDFHSVPAFT VEELSFGHRT NDLAEALRST 
GMIAVKLDHH VKHNDFDIAE SRRIALGGLC GCASSDASSL FQSVSGADTS LLSDGLTTRS
TFATATVGGT PLQLPVEGLE KACGRETTAA MESLRDNVAV ASRSFVSAVD RLRRDSSALP
NSLAPILQNI QGGTYSSLAS IVKASTHLEH FHLYSKQQRQ APSGMSQASD MEEVLQVHTD
AGLFLSFVPG LACGNSHTSE SEHFSVLIHG KLHRAVFPPD SVGIMLGAGA QHWLKNVDSL
ELHATHHAVQ MPSGSARAWY GMMHLVPQNA IIQEDPEPKT FADMRKAMVL SGSKTHKYGG
GSLQTDEALS IGCGAGSQKI NDHNFGSFGV SASRRHRRRL QMVDDASVCN NSTNFYCWMS
CLDIPEAQNA QGYLNEGYSL YCLDPATLSA SGNRASKAME PCIEDGVVGL AMNENCMGSW
QPTAPGVVAQ ELALNFTTSE EEPFCYGGTS MYMDGFHWLD STCVIYLFPE WVLSTPGKLV
AACVGSIFFG MSLEGVIRGR RDLVQSIAVG WKRLFISSGI YGLQLTMGYF IMLVVMTYSG
PLFMCVVLGL MFGHIAFNAK DVLKAKKQET KEGQEECCGK EDESEITDEE LSPCCQGGME
KNNVTKVSAN VPEGSTPCCQ NLL