Gene PHATRDRAFT_55073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_55073 
Symbol 
ID7198275 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011691 
Strand
Start bp190964 
End bp193891 
Gene Length2928 bp 
Protein Length934 aa 
Translation table 
GC content58% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002184417 
Protein GI219128432 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGTTGG AAGAGGAGGA CGAGGAGGTC CAGGCGCGAC CGACGAACGT GGACCCGCCG 
TCGACCCGTA CGGTCCCCCG AAACACCACC GTCCCGACTT ACGCGACGCC GGGAAAACCC
AGTACTGCTG TTCCCACTCC CGCCACTCAC ACGGCCCCAC CACCACCACC CTTCGTGACT
CCCGCCGCAC GCGGCGTCGC GTCGGACCTT CACGAGACGA CACCCTTGCT GACCGTCGTC
CCACCGATCG ACCCGCCCTT GCTCGATCCG AGACTGCCGG AATCCTCCCG GGACGGGGCC
AATCCGGAAA CCGACGCAAG TGTTGATAGT GACACGTCGG CGCCTAAGCA AGTCCGTTTC
GATGCGCACG TTTCGCCTGC ACCAGCCGAA GCGGCGCGTA CCGGATTGGT CTCGCGACGG
GGAGGTCGCA TGGCGTCGCC CGCGAGACGA CGGCCCATTC CACCCTCGTC CACTGGTGCC
GACGCGGCGG GAGCTGCCGG CTCTCCGTCG ACCGTGTCGT CGCCCCTCGC CGCCACAGTC
CTAGAGTATC AAACCTTTCG CAGCCCCTCC CGCAAAAATG ACGAAGTCCC GTCACTGTGC
TCCATCCATA GTGGTGGGGG ACCGTTGGGC AACCGCACGG AATCCTTTCC CAATCACCAA
GACAAGGCAC CGGCAGATTC CATTTCTTGT GATCCTTCGG AAGAAAATAC TACCAGCCCC
TTGGCAACCG ACTCGCGGAA AGACAGTGTC TCGTTTTCGC CCAAAATGAA CGACGAAACG
GTACGTAGTC TTGCCGACGA CAATGACGAC GGCAGCTTTC GGAACCTGGC GAGAATGTTC
TCACCCACCA CGTGTGTTTT TTTCTATTTC CTCCACGGCC CGTCCCACTC TGCATACTTC
TAGACTGCCG CGGATCACCG CTCGACCACG CCCACCGATT TCAGCGTACA GCGACTGCCC
AAGTTGGATA CCGACGCTTC GCAGCCCACC AAGACCCCAA GTGTCTTTCT GTCACCCAGC
CCGTCCTCGC GGCCGTTACC CGAAGCTTCC TCATTCGAAG AGAACAAGGG CAGCCACGCA
GCTGTCGAAG GAGGTGGTGA CGCCAAGCCG TCACCGCGTC ACGAACGCAA TTTGGCTACC
CCCACCGACT TTGCCTGGGA TTACGGTCGG GGACCACCAA CGTCGACCGG CTCCTTCGAC
CACTCCAACG TGCTAGCGTG GTTGCAGTCT CCCACCGCCA ATGGATTCTT TTCACCGGGC
GGATACGGCT CAATTGTAAA TACGCCACAT ACCGGAATCC CGCGGACTCC CGGGACGCCC
ACCGTCAGTA CGAGCTTTTT CTTTACGGAC GTCGCTACCC TGCCACGAGG CAACGATCTG
ACGCCCCGTA ACGGTGATGG CGGCGAAACA CCCGTGAGGG ACGGTTCACG GCGCTCGGCC
TCCCACGGTA TTTCGAGCAT TATCTGTATT TCCCCGTTGG CCTCAGCCAA GGTACGGGGA
TCAACCACGA GCACGCCCAT GAATTTGAAA GATGTCTTTG CCTCACCCCG AGAAAAGTCC
AGAATGCGGG GACTGCCGTT ACTGAACGAT ACCCCGGCGA AGCGGCAACA GTTACGCGTT
CCCCGGAGAA GCTCCAGTAA GGATCCCAGC GTGGATGCGG TACATTTGGC GGAACGTGAT
TTGATGGAAG ACGAAGATTT GAGTGTGCTG TTGCAACTGG CGTCCAACAC GCCACAACAC
CGAGAACCAG ACGGTGCGCA CGGGAATGGA GTCGATCCGG TCTTTCGGTC GCCTGACTAC
CGCAAGAATA TTGGGGACGA CGAAAGTGGT GAGAACCTTC CAACACTACA GCTGCCAATG
ATTGGTAACG GGCGGCACGA CGAGTTACTG TCAAGCAGAC TGGCACAAAA GTCGCGATCT
AGAGACCTCG GTGACGCCGA CGACTTTGCT CCTCCTCCTC ATCTCGGAAT GCGGTCAACC
TCTTCCAGCG GATCAAAAGA GGCTTATGCA AAAGTGTTGA GTCTACCCAA TAGGAGCGGA
AGCGGCAAGG TCAACAAAAG TGAAGGTCAA GCCGCCTCGA ATCAATACCC GACGCATCCA
TCGTACCCTC AACCGATTTC CAGCCAGGAT CATTCTTCCT ATTACTCAAT GCCCCACGGA
GTCCCCTCAG GTCCTTCGGG TAGTATGCGT ATCAGCATGG GTGGACCTCC ACCTACAGCA
GCTGGAAAAG GATCACCGTC TCGTCCACAT GGAGGAAGGT CGCCTCACGA TGCGCCGCAT
CCGTATCACG ACTACTCGAC ACACAATGGA ATGCAATACC CTTCTCAACA TCCCATGTAT
TCGCAATACG CGCCGTACGG TACATATCCT TATGCATATC CGCCTCTGAG GCAAATGCCA
ATGTACAGTG CGCAGCATCC GTCAGCGGGA CCCACAACGC CGCTCAAGAA GAGTGTGGTC
AAAATGAAAT CTGGCACCAA GAGACCTTTG ACGGAGAAGC TAGCACTGGG CTCAGCAAAG
AAACAAAGAA AATCTCCGAG CGCCAGCGCA AAGAAGAAAA ACAAGTCGCC ACAAATTACC
GACAAGGCGG AGCTGCAAAA AGCTGCTGAC GCCATTCGAG CGGTGAATGC AGCGAGCGGA
GGAAAGAACG ACAAGGCGGC GGCCTTGGCA GCAGCTATTT TGCGTGGTGT CACAATGCGA
CCTTCAGGAA AATGGCAAGC GCAACTGTAC TTTTCCGGCA AGTCGCGGTA TATCGGGGTG
TTTGATACAC GAGAAAAAGC GGCGTTGGCG TACGAGATTG CCCGAGAGAA ACTCAAGGCG
GGAGGCGGTG AAGGGGCCGG TAGTCAGAGT CCGAAAACAA CCGAAAACTT GGTGAACACA
GCTCGAAAGG CAGCCTTTGA TGGTGTCAAT GAAAAGCTTG CCAAGTAG
 
Protein sequence
MVLEEEDEEV QARPTNVDPP STRTVPRNTT VPTYATPGKP STAVPTPATH TAPPPPPFVT 
PAARGVASDL HETTPLLTVV PPIDPPLLDP RLPESSRDGA NPETDASVDS DTSAPKQVRF
DAHVSPAPAE AARTGLVSRR GGRMASPARR RPIPPSSTGA DAAGAAGSPS TVSSPLAATV
LEYQTFRSPS RKNDEVPSLC SIHSGGGPLG NRTESFPNHQ DKAPADSISC DPSEENTTSP
LATDSRKDSV SFSPKMNDET TAADHRSTTP TDFSVQRLPK LDTDASQPTK TPSVFLSPSP
SSRPLPEASS FEENKGSHAA VEGGGDAKPS PRHERNLATP TDFAWDYGRG PPTSTGSFDH
SNVLAWLQSP TANGFFSPGG YGSIVNTPHT GIPRTPGTPT VSTSFFFTDV ATLPRGNDLT
PRNGDGGETP VRDGSRRSAS HGISSIICIS PLASAKVRGS TTSTPMNLKD VFASPREKSR
MRGLPLLNDT PAKRQQLRVP RRSSSKDPSV DAVHLAERDL MEDEDLSVLL QLASNTPQHR
EPDGAHGNGV DPVFRSPDYR KNIGDDESGE NLPTLQLPMI GNGRHDELLS SRLAQKSRSR
DLGDADDFAP PPHLGMRSTS SSGSKEAYAK VLSLPNRSGS GKVNKSEGQA ASNQYPTHPS
YPQPISSQDH SSYYSMPHGV PSGPSGSMRI SMGGPPPTAA GKGSPSRPHG GRSPHDAPHP
YHDYSTHNGM QYPSQHPMYS QYAPYGTYPY AYPPLRQMPM YSAQHPSAGP TTPLKKSVVK
MKSGTKRPLT EKLALGSAKK QRKSPSASAK KKNKSPQITD KAELQKAADA IRAVNAASGG
KNDKAAALAA AILRGVTMRP SGKWQAQLYF SGKSRYIGVF DTREKAALAY EIAREKLKAG
GGEGAGSQSP KTTENLVNTA RKAAFDGVNE KLAK