Gene PHATRDRAFT_42426 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42426 
Symbol 
ID7196637 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp22835 
End bp24639 
Gene Length1805 bp 
Protein Length473 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176505 
Protein GI219109501 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCAACCAT GGACAAGGAA CAGGAACTTG GAGATAGTAG CTTTCATCCA CAACCCTTTC 
CGTCCGTGGA ATACAACACT TGCACTCTAT TTTAACAACG GACACCATGA TGCGCATGTC
CATCATCTTC ATCTTCCTAG CCCTACTGGT GCAGTCAGCG ACGCCCAAAG TGCACCGCAG
CGTTAATCGC AGCATTCAAA TCGTAAACGA GTCCGCCTCG AAAATTGAGA TATTCTGGGT
TCATCCAGAG ACGAGAGAAC CCTCGCTCAT GTCGAATCCG TTCATTGTCC CAGGGGCCGA
CTTTTCGCTG AATAGCTTTG TGGGTCATGA ATTTCTAGTT AAAGAAATGC CGGGCAAGAA
TGGATGCCAA GTAGACTCGT GCAAAACTGA AAACTTCAAG GTTTCGCCCA ATGATGAGCA
AGTGATTCGG GTCAGTCCAG AGATCACGGT AACCTTTGTG GACAACAAAA TCCGAGCTCG
AAAAGAAGCT GACGAGCTCA TCAAAGCCTG TCAGGTGGAC GCTCGCAAGC GCGTAGAGCT
AGCAGGGCAA GACAAGGCTG CCGCTCTGGA CGCTATGGAC GATCTTGTCA ACTGCGTTCA
AGGAGGAGTT TCCTCTCGTC TCGAGACAGT CAACGAAGAA ATTGCCTTCC AAGCTTCGGT
GCGGACGGAC ATTGCCGCTT TGTTGGAAAA TTACACTTGC ACGGACGACT CCCTAAATTC
TTCGAAAGAC ATTACGACTC AGCAGTGGAA GCAAGCTGAC CTCACACGCA CGGTGCATAT
CAAACACGAA CGACCCACCT CGAGAATTCA CGTGATTGAG AATTTTATTT CCGATGATGA
GTGTGACGCT ATGGAAGCTG CCGCGCAAAA ATCTCTACAC CGGGCCACTG TCGCCGATGG
GAAAGGAGGC TCTCGCCTCA GTGACAATCG CAAAGCCATG CAGGCTGGTA TCAAAGTTCC
TTGGAAAGAC GAAGCCTCGG GTAATGCCAT AGCTCGCTTA AGCCGCCGTG TCTATGACTA
CACAAATCAT GTCCTTGGAT TAGGAATCGA GGAGCACGGC CAGGAAGATC TCATGTCGAT
CCAGTATTTT GGAAGGGGCA AGAACGATAC TGAACCTGAT CGCTACACTC CTCACTGTGA
TGGCGACTGC ACTGGCCTCC CTCACAAACA CGGTACACGA ATGGCTACCA TGGTGATGTA
TTGCGATGTG GCGGACCTTG GTGGGCGTAA GTTGCAGTGG TTCCAAAAAG TAGGCGACAC
GTTCTATGAG TGTGTTGATT AACCCTGTGG TTTCTCTTTG TCTGCAGATA CGAATTTTCG
CAACGCCGGT GTCCACGTCA AACCAGAACG AGGCTCGGGC ATCTTTTTTA GTTACATCGA
TCCCGAAAAT CGTGTCATGG ATACTGGATT TACGGAACAT TCAGGTTGCC CGGTGTTCGA
GGGCGAAAAG AAGATCGTTA CGCAGTGGAT TAGGCTAGGT GTTGACACTG AGAATCCTTG
GGACAGTTTT AACACTCTCG GAATCAAGAA GTCCGAAATG GAAGATTTCG AATCGGACGG
CGAGGAAGAA ATTGATGAGA CCGAAGATAC TTCTTCGGAT GAGTTGTGAA GTGCTTTCAT
CTCGCAAAGT CTTTATTGAC ACCTGCACAT TTGCGACCAA GCACAGTTTA CCATATGTCT
GTTACTCCTA CGGTTTTGTT AATGCAAGCG AAATTACGAC TGCACCCTTT TGCAGGGACC
TGCACTTGCC TGGACAGACC ATAACATCAT ATTAGTTTAG CATGAGTCAT CAATCGCGCT
TTTCA
 
Protein sequence
MMRMSIIFIF LALLVQSATP KVHRSVNRSI QIVNESASKI EIFWVHPETR EPSLMSNPFI 
VPGADFSLNS FVGHEFLVKE MPGKNGCQVD SCKTENFKVS PNDEQVIRVS PEITVTFVDN
KIRARKEADE LIKACQVDAR KRVELAGQDK AAALDAMDDL VNCVQGGVSS RLETVNEEIA
FQASVRTDIA ALLENYTCTD DSLNSSKDIT TQQWKQADLT RTVHIKHERP TSRIHVIENF
ISDDECDAME AAAQKSLHRA TVADGKGGSR LSDNRKAMQA GIKVPWKDEA SGNAIARLSR
RVYDYTNHVL GLGIEEHGQE DLMSIQYFGR GKNDTEPDRY TPHCDGDCTG LPHKHGTRMA
TMVMYCDVAD LGGHTNFRNA GVHVKPERGS GIFFSYIDPE NRVMDTGFTE HSGCPVFEGE
KKIVTQWIRL GVDTENPWDS FNTLGIKKSE MEDFESDGEE EIDETEDTSS DEL