Gene PHATRDRAFT_31973 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31973 
Symbol 
ID7196453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1378558 
End bp1380522 
Gene Length1965 bp 
Protein Length654 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177281 
Protein GI219111059 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTCCGG ATGAGAACTA TAAACCAAAC TTGGTGGAAG AGGAAAAGAC GTCGATTATG 
CAGGTCATCG CGCCGGACCC AAACTATCAG AAAAGGCTGC AAGAAGAACA AAATTCGAAG
AGAATTCCTC CTTCCGAAGC TTTTCAACAA CGTGTGCAAT GCATACATTG TCATAAGTAC
ACACCCGGAA TGGTAGTCCA GCAGCCTGCA AAGGCTTCAT CGGAGGCTGC GAGCACCCAG
TCGAACGCTC GCCGTCTTAC ACCGACAACA CCTGGCTTTC ATACTCCTTC TCCTACGACG
CCACTCACTC AATCATCCGA TCATGTTAAT GGTTCGGAAG TTCATTTAGT GGAGCATCGT
GTCCTGCTGG CCGTCCGAGA TACAGCGTTT GATTTAGCGT CACCGGAGTT TGCCAAGAAA
TCATCCAAAG ATGTCTTTCG CTGGCTCGCT AGTGCACAAC CCATCAACGA CAAGCTTCAA
AAGCGATTGG AACGCCGCAT TAGTGCAGAC CCATCGATAT GTAAAGCACG TTCTCATGGC
ATGGAACCCC TCTGTCCCGA TGGACTTACT CCGTTTTTGC TGGCGGCCCA TTCCAACCAA
GTTGCGGCAG CCAAAATTTT ACTCCTATTG GGACCCCGAA CCGAGCAATT GCAGACCGTT
AATTTACAAG GCAAGTCAGC ATATCACCTT GCCGCAACTC GCGGCAATCT CGAATTTCTT
GATTACATCA AGACCGTATA CGAAGATCCG CAAAATGGGA CTCTCTTCTC TTCCCCGACA
CCCGTTGACT TGCTGGGACG TACACCGCTC GGGGCCGCCT TAACGAGTCC TGAACCCCAG
GCCAAGCGAA ATAAACAAAC AATGATGGAC AGGTTGTTCT CACCCCAAGA CATGTCGATT
TTGGGTAGTC CCGCCCCTGT AACGCAGAGA ACAGCGACAC TTTCTGAACT ACAGCTTGCG
TACGGATCTT CCCACATGCC TGGCAAGCGT ATTATGAATG AAGACGCAAT TCTGACAACC
AAGATCCTTC TCAGCGACGA TTCTACTGTA GGAGTCTTCG GCGTCTTTGA TGGCCATAGC
GATGCTGGAA AGGTTTCGAA CTTTATTGCG TCTCAGATTC CACACGCTCT ACGCGATGCG
ATGCAACAAG CAGGCGACTG GAACAGCTGG TGTCGACATG CATGTCTCGA AATTGATGCG
AATTTGAAAA AAAGCAACAT TGCAGGTGGT TCAACCGCCG TCTTTGCCAT AATTACGCTG
GATCAAATTG TTGTTGCCAA CGTGGGTGAC AGTCGCTGTA TTCTAGTACA ACACGATTCA
GTTAGTGTGA GCAATGTCGC GGAGGGTGTG GAAAGGCTAT CCATTTCTGA AACGGTATAC
CCGCAAACAG GGACCGAAAC GAATACCTTA AGTGGCGCGT TTTTAGTAAA GGCGTTGTCA
GAGGATCACA AACCCGAGGC TTCGGCGGAA CATGCCCGTA TTCAAGCCGC AGGCATGACC
ATCACGGAGG AACGGTTCGA AGAAGATGGC GAAGAAGTTG TCATTCACAA GGTCCGGTTG
TCGGACGGCA ATCGCATGGC ATGTTCCCGC TCCTTTGGAG ATTTTGAATA CAAAGCCAAC
GAGACTTTAG AGGCCGAATC GCAAGCTATA GTTGCTGTTC CTGACGTTGT CGTTCACGAA
CGCAGTCATG CTGACTGCTA TCTTGTTTTG GCGTGTGACG GCATTTGGGA TGTCATGAGT
AGCGATGAAG TGGGACAATT TGTAGTGGAG CACATCAAAT CATGTGGCGA AACAGAAGGC
GTTTTGCCCG AGGTTGGTGA CCGGCTGCTG GCGGAATGTT TGCAGCGCGG CTCTGGAGAT
AACCTCAGTG CCGTCGTGGT AGCCCTCTCA AACTCAGCCG AGCATTTGTC TTCTGGACAA
GTGTTGAAAG GCAAGGCACT GGATTTTTCG GGAACGCCAC CGTGA
 
Protein sequence
MAPDENYKPN LVEEEKTSIM QVIAPDPNYQ KRLQEEQNSK RIPPSEAFQQ RVQCIHCHKY 
TPGMVVQQPA KASSEAASTQ SNARRLTPTT PGFHTPSPTT PLTQSSDHVN GSEVHLVEHR
VLLAVRDTAF DLASPEFAKK SSKDVFRWLA SAQPINDKLQ KRLERRISAD PSICKARSHG
MEPLCPDGLT PFLLAAHSNQ VAAAKILLLL GPRTEQLQTV NLQGKSAYHL AATRGNLEFL
DYIKTVYEDP QNGTLFSSPT PVDLLGRTPL GAALTSPEPQ AKRNKQTMMD RLFSPQDMSI
LGSPAPVTQR TATLSELQLA YGSSHMPGKR IMNEDAILTT KILLSDDSTV GVFGVFDGHS
DAGKVSNFIA SQIPHALRDA MQQAGDWNSW CRHACLEIDA NLKKSNIAGG STAVFAIITL
DQIVVANVGD SRCILVQHDS VSVSNVAEGV ERLSISETVY PQTGTETNTL SGAFLVKALS
EDHKPEASAE HARIQAAGMT ITEERFEEDG EEVVIHKVRL SDGNRMACSR SFGDFEYKAN
ETLEAESQAI VAVPDVVVHE RSHADCYLVL ACDGIWDVMS SDEVGQFVVE HIKSCGETEG
VLPEVGDRLL AECLQRGSGD NLSAVVVALS NSAEHLSSGQ VLKGKALDFS GTPP