Gene PHATRDRAFT_37043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37043 
Symbol 
ID7202213 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp29004 
End bp31002 
Gene Length1999 bp 
Protein Length643 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181288 
Protein GI219121886 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACACTG AAGAGAATGA GGAACCGCTC TGGGACCCCA TTCAACAAAT TTATACCGGA 
GGAGTACTAC CCCAAACAGT CGAAATCCAG GATTTGATTG CGGAGAACAA CGGCACCCTC
CGACTGTTCG GCTACGGGTC TCTGTGTTGG AACCCCGGCA CCGGAGCCCT TGCCGATCCG
TCAGTGCGAT ACGCGCCAGG CCAGGCACGA GGATACCGAC GCTGTTGGGC ACAAAAGTCG
ACCGATCATC GCGGCCTTCC TTGCTTCCCT GGAATTGTGT GTACTTTGCT GAAAGACCAA
GAATTTCGAG AATTTCTCTC GTCTGGCGTA GACGAGGAAA CGTTGACGGA AGGACTAATT
TTCGAAGTCC CTCCGCCTTT AGTTGAAGAA TGCTTAGCGG AGCTAGATTT TCGAGAAAAA
GGTGGCTACG CCAGAGACAT AATAGAAGTT GTCGAAGACA AGAGTGGTAA AGTGGTTCAG
GCTTTACTGT ATAGAGGCAC CCCACAGAAT CCGGCGTTTT GGCCAAGAGC ATTGCGAGAT
CTTCCGTTTG CAGCAGGCGA GTCCCTTCAG AGTGGCACTA CGTGCCAGTC AAAGCATTGA
TACATCTCGC TCACTATCAT AATTGTCTTT TTTTTTAGCA ATCATGGCTA CGGCGATCGG
TCCGAGTGGT GAGAATGAAG TTTATTTGAG CCGTTTGGAC CACTTTCTAG GAAAAGTAGC
TTCAGGTTCC ACACTGAAGA AATACGACGA TACACTGGTG CTTGCATCTA TGACGAAACA
GTTACGCAAT CAAAATTTGC ATTTCATGTT TGGCTCGGGG TCTAATCAGC GAAACCAGCT
TTTACTGCAA ACAGAAAATA ATGCGGCCGC TCTATTTAAC AACGAAGATG CCCATGAAAT
GAAAGAAATC GTTTTGTGCG CTGACCAAAC CGACAAAATC GAAGTGAATG AAAAAGTTAT
TTCTTTATTT GCAGGAGGAG GACACAGTGC GATCCTTATG CAAAGCGGAA GGCTATTCCT
ATTCGGGTCA AACGAACATA GTCAGTTGGG AGCAAGTGGA ATCACACAAT CCTCCTTTCC
GCTCCCTATC CTTACTTGTC TCCGTGATTT GTTTATTTCC TATTGCTCGC TTGGTTTTTC
TCATTCATTG GTGGTCGAAA AAGAGACAGG TCGTGTATAT TCCTTCGGGG ACAACGCAAG
AGGCCAAGCT GACCCTGATA ACTGCGCCTC CACCATTCCA TTGCCAACTG CTCTACCGCT
CAAGGAACAT ATTGTGGCCG TGTTCGCTGG AGTTTTCCAT TCTGCTGCCG TGAGCGAAGA
TGGCGAACTC ATAACCTGGG GCTGTGGTCG ATTTGGACAG TGCCTTCCTG TTGTACGGCA
TAGATTGTAC GGGCACTGGA AACCAGATGA CGGAAGCAGG GTGCTTGGTG TTGCTTGTGG
ACGTCGCCAC ACAGTTACGT TTGACGATCG TGGGCGAGTG TGGAGTTTTG GTGAAAATAA
ATACGGCCAG CTTGGACGCG ATCTTAAAGG TGAAAAATAC AGTAGGGTAC CATCGCTGGT
GGATGGCGAT TGGGGGCTCG ACAGCTTGTC AGTCACCGGA GTGCACTGCG GCTGGTCTCA
CACTATCCTT CAATTGGAAA ATGGCAAGGG AGAACTAATA TTATTTGGCT GGGGAAGGAA
TGACAAAGGC CAGCTTGGGG TTGGCACAAG CAGCATCGTC TTTAATCCTG TACGGTTGTA
TCCTTCGCAT AAAATTAGAC TTGTCGCTTG CGGATCCGAG TCTACTGCAA TCGTCGACAC
TGACGGCGAA ATATGGAGCT GCGGTTGGAA TGAGCACGGA AACCTGGGTT TAGGACATGA
CTTTGATGCA TTCGAACTAA CCAAAATCAA AGGGGCTCCG ATCACTTTGA CTCCAGGCTA
TTCAGAAAAG AGTAGTTTAG GACTCGCCTT GGGTGGAGCT CATATGATTG CTATGCGACT
CGCTAAAAAG ACAAGCTGA
 
Protein sequence
MDTEENEEPL WDPIQQIYTG GVLPQTVEIQ DLIAENNGTL RLFGYGSLCW NPGTGALADP 
SVRYAPGQAR GYRRCWAQKS TDHRGLPCFP GIVCTLLKDQ EFREFLSSGV DEETLTEGLI
FEVPPPLVEE CLAELDFREK GGYARDIIEV VEDKSGKVVQ ALLYRGTPQN PAFWPRALRD
LPFAAGESLQ TIMATAIGPS GENEVYLSRL DHFLGKVASG STLKKYDDTL VLASMTKQLR
NQNLHFMFGS GSNQRNQLLL QTENNAAALF NNEDAHEMKE IVLCADQTDK IEVNEKVISL
FAGGGHSAIL MQSGRLFLFG SNEHSQLGAS GITQSSFPLP ILTCLRDLFI SYCSLGFSHS
LVVEKETGRV YSFGDNARGQ ADPDNCASTI PLPTALPLKE HIVAVFAGVF HSAAVSEDGE
LITWGCGRFG QCLPVVRHRL YGHWKPDDGS RVLGVACGRR HTVTFDDRGR VWSFGENKYG
QLGRDLKGEK YSRVPSLVDG DWGLDSLSVT GVHCGWSHTI LQLENGKGEL ILFGWGRNDK
GQLGVGTSSI VFNPVRLYPS HKIRLVACGS ESTAIVDTDG EIWSCGWNEH GNLGLGHDFD
AFELTKIKGA PITLTPGYSE KSSLGLALGG AHMIAMRLAK KTS