Gene PHATRDRAFT_31544 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31544 
Symbol 
ID7196082 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp326727 
End bp330132 
Gene Length3406 bp 
Protein Length1033 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176564 
Protein GI219109619 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGTTTC ACACCAGGAT TGTCACGTCC ACCACCAATA GAAGGCTCCA TCTGCTAGGG 
CTAAGGCGTG GTGAGGAATT ACTTGGTATG GACAGTGGGT GGGAAAATAC GACAAAGCTT
TTTCGGCATT TGCAATGTCG GATTCTTTCT TCACAAAGGA CTTCAAGCCA ACCGAATCCC
CGATACGATC CCACTGCGAA GCGACCAAAC AAGGTTTGCG ATCCATACGG CCAAGGCGGA
AAAGCCATGC CTCTGTCAGA TATTCAGGCT TTTCAGGCGA CCATAGACGA TCAATGGAAA
GTGACGGAAG ACGGGACGGC TCTTGTCCGG GACTTTGTAC ACGCAGACTT TCTGACAGGC
GCTCGTTTTG TGCAAAAGAT TGCCGCCGTC TCTCAAATGA ACAGCCACTT TCCAATAATC
GGACTACAAA GGTGTATTGT CAAAAAAAAT TGGCAAGTGG TCACTCGTAT TGAATGTAGT
ACAATGGTTT TGGGGGGCCT GTCGGCTCAC GATTTCCATT TAGCTATGGT ACGTGCACAT
CACCTCTCGT GGTGTATCGC GAGATGCAAA TATGAACGGT AACGAATCTG ACATTTTGAG
ATTGTTTATG CTACTAGCTG ATTGATGTTG AAACCGCGCG ACCGGAGGTG CATGCTCTAC
TGGCGGCTGC GGACGACAAG TAATGAAGAG AGGACATTGT AGCATCTTAA AAGGTATCTA
TATGGACGTT GTGCTCTGCC AAAACACCTA TCTGACAAAG GTAGCGCTCT TGCTTTATAC
GAAAACAGCA CGTGTTACTG TTGGAAAATG TAAAGATAGC CTGACATATG TTCCCCAATT
GTGACATAGC TTTCGTGAAT GAATATCAGA AGGCTTTTAA GATCGATCGT CGACGTGTCT
CGGGCCGCTG GTTTTCCGTG TCAAATTTAT GTCAGTCAAC TCGAATCTCG TCTCCAACAT
CACGTGCTAA CAGCGGATAC AATTGCTTTT GTAGCGTCGA ACATCAAGCG CTGCAATATA
AACGCAAACA ATTCGTTTGA TGGCGAGGAT GAAGACGAAG TGTGTGGTAT GGGTATCTAA
TGCCATTGTA GCCGCGTTGG CATTTCAGCC GTCCTATCAG AACCTTCCCC TTAGACCTGC
GCAGCACCTG CGACTGACTA CATCCGGATC TAGCGATGAT GGTTTCAGCC AAGAAAACTC
GGAGACCGGT ATTTCTCTTT CTCTCCGAAA CACGGGTACG AATGAATATA CCAACCCCGT
GAAGCAGCGA ATGAGCTATC CTTCATCGAA ACGCCAAAGG AAACAACACC GGGACATTGT
GATCGTCGGC GGAGGTCTGG CAGGCTTGTC AGCAGCCCTG TACGTCTCGC AGATTGACCC
GACACGACAT GTAACTATTC TTGATAAGCA GGATCCGGAG TCACACCGTT CTAAAGATTC
TACTGTTGCT AGCTACGCAG CGGCGGGCAT GTTGGCCCCG AATTCAGAAC GCTTACCAAA
AGGTGATTTA CTTAATCTAT GTTTGGAAAG CAGAAACATG TATGGAGACT TTTGCGACAT
GGTCGAGTCT TTGGCTCAAG AATCTGGAGA AGAAGGAATG AAATATTTGG CAACATCCTC
ACAGAGCGCT GACGGATTAG AACCCTGGAG TATTGGATAC GTTGCTTCAG GTGGTTTCTT
GGCTCCAGCA TTTGCAGGTG ATTCAGTCGC CACGTGGGCC CCACCAGATG ACGGTGGGGC
AGCAACATGG CTGGACGCTA CCCAAGCACG AGAGCTAGAA CCTAACCTGC ACCCAGACGT
TGTCGGCGCC TACTGGTTTC CCGAAGACGC TAGTGTCGAT GCTCGGCGAT TGACGAATTC
TTTGCGGGCT GCCTGTGTAG CGGCAGGAGT CCAGATTCTG CACGGACCCT CCAACGAAGT
CACATCACTG GATCTCTCAG AAGGGATCTG CAAAGGTGTC CGGTTGCAAA GTGGACGTTA
TCTGAGTTGC AATTCGATTC TGGTCGCCAA TGGTGCATGG ATGCGCAATC TTTTACCGGT
TCCTATTGAG CCACACAAAG GCCAATCCTT GTCGCTTCGC ATGCCGAAAG ATCGTCCGCC
AATTCTTAAG CGCGTTCTCT TTGCCCAAGA TTCATATATT GTACCGAAGG CAGACGGTCG
CATTGTTGTC GGTGCGACTG TAGAAGCAGG AAGCTACGAT CCTAACGTGA CGCCTGGCGG
TCTTTTGCAC ATTTTGACAC ACGCATTGCA GCTGGTACCC GCATTGAAAG ACCTTCCCAT
TGAAGAAACA TGGGCGGGAC TTCGTCCAAC CACGCCGGAT AAAGGTCCAA TATTGGGAAA
AACACCGTGG GAAAACCTGT ATTTGGCTGG AGGGTACTGG CGAAATGGTG TCTTGTTGGC
TCCAAAAACT GGAGAACTAC TGGCTGCTCT CATGACCGGA CAAGAAATTG ACGAGCAGGA
TCAGGCGATG TTGGATGCGT TTGCTTGGGA TCGCTTCACG AACAAGGACG GTGGCGATCG
CCTCTCAGCC AACGCAAGGT ACGCCGCCTC GATGCACCCG ATACATAGTC GAAAGTCTGG
TGCTGGCGTC GCAGCCTCGG TTGGAACGGA ACTCGGAACT TACTCAAGCG CTCGTTCGGC
GAAAGAAGAA AGACAACAAG ATCGTAATTC ATTGTGGAAC GAAAATGGAG ACGGAGACGT
TGCTTTTGAG CGGGCAGCAA CAATGGGACG GAACGACGGG GCGGCGTACT CTTTTGGAGA
CGATGAATCT CCGTATGAAC GAAAATCCGT TTCACAGTCA ACAGCACAAA CAACTCCTTC
GTTTGAGGAT CCCAAAAGTT CAAAGAGGTC ACTGAAGGCT TCTGATACTG TGGATGCGTA
TACGGTAGGA GCGTCGGACG AGATTCAGGA CTCTCATTCT GCGGAGACAA AGGCTTCCGA
TTTGACTGAC ATGTATGAAA AAATTAGGGC AAACAAGGCA AAGAAAACTA CGACCTTAGG
TGAAAGCGAT GGCGACGAGG AGGTACGTCC CGATCCTGGC TTTCGAATAT TTTATAAAGA
TCCAGAAACA GGTGAACGGC ACGAAGTCCC TCCGTACACA TCGCCCGGAG TGTTCCAGCA
AAAACTGCAT GCGAGGAAAA AGTCAGAGCG ATCCGCGAAC GGAACCAGGA ATGATGTCCC
AATCAGTGAT GTTGTTGCAC CTTCGCCAGC GGCGAATGGC AACAAGGAAG CGCAGCAATA
CAGCGAAACC ACCTATGACG GTTACCAAGA GATTCAGTCG GCTAACTCAC GACAAACTCG
AGCAGAAGAA TTAGAAGCGA TGCGAATGGC ACGACAGAGT AATCGTGTTG GCCAAGAAAG
CATCAAAGAG TCGGATATTG GCGCCCAACC GATGGGCGAC GAGTAG
 
Protein sequence
MSFHTRIVTS TTNRRLHLLG LRRGEELLGM DSGWENTTKL FRHLQCRILS SQRTSSQPNP 
RYDPTAKRPN KVCDPYGQGG KAMPLSDIQA FQATIDDQWK VTEDGTALVR DFVHADFLTG
ARFVQKIAAV SQMNSHFPII GLQRCIVKKN WQVVTRIECS TMVLGGLSAH DFHLAMLIDV
ETARPEVHAL LAAADDKRLL RSIVDVSRAA GFPCQIYVSQ LESRLQHHVL TADTIAFVAS
NIKRCNINAN NSFDGEDEDE VCAALAFQPS YQNLPLRPAQ HLRLTTSGSS DDGFSQENSE
TGISLSLRNT GTNEYTNPVK QRMSYPSSKR QRKQHRDIVI VGGGLAGLSA ALYVSQIDPT
RHVTILDKQD PESHRSKDST VASYAAAGML APNSERLPKG DLLNLCLESR NMYGDFCDMV
ESLAQESGEE GMKYLATSSQ SADGLEPWSI GYVASGGFLA PAFAGDSVAT WAPPDDGGAA
TWLDATQARE LEPNLHPDVV GAYWFPEDAS VDARRLTNSL RAACVAAGVQ ILHGPSNEVT
SLDLSEGICK GVRLQSGRYL SCNSILVANG AWMRNLLPVP IEPHKGQSLS LRMPKDRPPI
LKRVLFAQDS YIVPKADGRI VVGATVEAGS YDPNVTPGGL LHILTHALQL VPALKDLPIE
ETWAGLRPTT PDKGPILGKT PWENLYLAGG YWRNGVLLAP KTGELLAALM TGQEIDEQDQ
AMLDAFAWDR FTNKDGGDRL SANARYAASM HPIHSRKSGA GVAASVGTEL GTYSSARSAK
EERQQDRNSL WNENGDGDVA FERAATMGRN DGAAYSFGDD ESPYERKSVS QSTAQTTPSF
EDPKSSKRSL KASDTVDAYT VGASDEIQDS HSAETKASDL TDMYEKIRAN KAKKTTTLGE
SDGDEEVRPD PGFRIFYKDP ETGERHEVPP YTSPGVFQQK LHARKKSERS ANGTRNDVPI
SDVVAPSPAA NGNKEAQQYS ETTYDGYQEI QSANSRQTRA EELEAMRMAR QSNRVGQESI
KESDIGAQPM GDE