Gene PHATRDRAFT_47949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_47949 
Symbol 
ID7203198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011683 
Strand
Start bp529665 
End bp531300 
Gene Length1636 bp 
Protein Length523 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182410 
Protein GI219124227 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCGGAAAGC CATCGTCGTC AGGATGGACT TTGCTCGATG GATGTCCTTC CTCGGTGGTT 
GGGACTGCGT CGAGGTAGAA GAGAAAGATT CCAGTAAAAC CTGGACCACG CGCTTGCAAG
ACGACCTTGA GGCTGTATGT TTCCCCCACA AAAATTTGGA CTCTCCCGAA CGTATAAAAA
TTCCAAACCG AACTGTGTCG GAAGAGGAGT CTCCTCGTTT GCGGCCGGCA GAGGTCCGGA
TACATTCATC GGAAATAGTG AACGAGGAAT GTCCACGCTA CGGGGACAGT GTACGAAAGC
CGGAGACGCA CGTACTCACG GTGGACCCGA AAGTGGCAAC TCACACCAGT TCTCAACCGC
AAAGAACCTT TCCCTCCGCG ACACTGCCAA CGCCACCCCA AAACAGGCAA AATATTCCGG
ATAACACCGC CAACAACCTG CCTTTGCGCA AAGAGGACCT TGCGACTCCT CCCAGACCAC
CTGTATTTTC TCGCGCACAT AGTTTCCAAT CGCACACAAA ACACCAAACG AGAGAACGAC
TCGGAAATCG GGATAGAATT CTTTCAGAGA TGCCGGCCGT CAAGAATTTC GACGACGATA
TAAACAGGCT TGTCGCTACG AGACATGGCA GCAAATATGG CTCTGGTGTC GAGGCAGCCA
ACCTTTCGGA AGATTCTTAC GTGGCCACGG AAACGAGTAT GGACTCGGGA GATCTCCGCA
CCCTGGAGAG GGGCGACCGT AGTCCGGATC TTTGCGTTGG TGTAATGACA GGTTCGCCAA
ATGTCGCATC AGGTACAGAG CAACAAGATC GATTCAGCGG TGATGCTTTC GAGGGCGATT
CCCGATCTAC CGACAGAGCT TTGCCCTTGG GCAAAATAAA CTATACACGA AAAGAAAGGC
ACTTTCGCTG GTTCGCTCAT GGAAAAATGT GGACGTCTAT TGCCGTTCTG CTTTCCATTT
GTGGGTCGTT GATGTCTGTC CTTTCCAGAC GAAGTACGAG GTTTGTTGTA CTAAGCGAGC
CACTCAATAT TGCTCCCGTC TATAACGCTG TGGACAAGAT CGGAATGATC AGAATGGAGC
TCTGCTACAA CACCTCTGTA GTCAGTGAAT CTGGCTGTAC GGTCATTCCC TTGACAACTG
AGGATGTTGA CGATAACATG TTTGAGTTGG CGCGGATCTT TTTGACGCTG TCGGCACTGT
CGGGCGTGTT CTTCACCATC TTCTTGTGCT CCGCTGTTTA CTGGCAATCA ATCAATCTGA
AGCCCATTGG AATTGGCTTC ATCGTAACCT ACTTCTTTCA ATCTTTTTCG ATGATTTTCT
TTGATTCCAT GATATGCTCT GACAAAAACT GTCGAGTAGG GTCTGGTAGT CTCTTAAGCA
TATTTGCCAG TCTCTGCTGG ATTGGAGCTT GCCTTGCGAC GGCAAAAATG GATGCCTTCA
AAATCATTGC TCAACGCAGA CGCCGACGTC ACGCTCGGCG TCTCGCAAAA ACGAGGAAGA
TGGTGAGAAA GGCTTCTTCG GAAACTGTCA AGACTACCTC ATCAAGGGAG AGCGATGGTA
GCAATTCCGT TATCGATCTG GAAGTAAATG GATAGGGGAA AAGTAGCTAC TTAAGTTTTG
ATAGTCGACA CTGTGT
 
Protein sequence
MDFARWMSFL GGWDCVEVEE KDSSKTWTTR LQDDLEAVCF PHKNLDSPER IKIPNRTVSE 
EESPRLRPAE VRIHSSEIVN EECPRYGDSV RKPETHVLTV DPKVATHTSS QPQRTFPSAT
LPTPPQNRQN IPDNTANNLP LRKEDLATPP RPPVFSRAHS FQSHTKHQTR ERLGNRDRIL
SEMPAVKNFD DDINRLVATR HGSKYGSGVE AANLSEDSYV ATETSMDSGD LRTLERGDRS
PDLCVGVMTG SPNVASGTEQ QDRFSGDAFE GDSRSTDRAL PLGKINYTRK ERHFRWFAHG
KMWTSIAVLL SICGSLMSVL SRRSTRFVVL SEPLNIAPVY NAVDKIGMIR MELCYNTSVV
SESGCTVIPL TTEDVDDNMF ELARIFLTLS ALSGVFFTIF LCSAVYWQSI NLKPIGIGFI
VTYFFQSFSM IFFDSMICSD KNCRVGSGSL LSIFASLCWI GACLATAKMD AFKIIAQRRR
RRHARRLAKT RKMVRKASSE TVKTTSSRES DGSNSVIDLE VNG