Gene PHATRDRAFT_31400 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31400 
Symbol 
ID7196987 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp388 
End bp3359 
Gene Length2972 bp 
Protein Length787 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176501 
Protein GI219109493 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAAG AGGAGAAGAT TCTTGAGGTG GCCAAGAAGC TTATGCCTTC TAAGCCAACA 
ACCCTCAAGG ATATGAGCAA GTGGCGTTCC TTCTTTGAAA ACTGGAACTT GTACATGAGT
CAGTGTCGCG GTGCTGCGGC TATCCCTCTT TCGTACGTTT ACCGTACCAA CGAGCAGCCC
GAGACCGCTT TGGTCGGAAC CTATGTGAAT ATGGATGCCT ATTTGGTTGC CCAGACAGTA
CTGTCTGGTT CCAACTTTGA GATCGATAAT CAATGGGTTT TTGACGAATT CAAGGAAGCA
ATCACTACAA CCGGACCTGG TTGGTCTTTC ATCAAGACGT ACAACCGAAG CAAGGACGGT
CGTGCTGCCA TTTTGAAATT AAAGGAACAG GCGGAAGGAA CATTAAACGA GTCCGTTCGC
CGTGATGATG CCATCAAGAT CCTGTCAACT ACGACATACA ATGGTCCGAG TTGTAACTGG
AATATTGATA TGCTGTTGCA GAAATTTCAG TATGCCATCT CGGAATTGGT CGAAATTGAC
GGAGTCGCGT TGCCGGATGG GCAGCTTGTG ACTTATTTGG TCCAGGCATT GAAGGACCCA
AGTCTGAGTT ATGTTCGTGA CACAATTCGC ACCAATGCCA CTTATCGGAA CAGTTTTCCG
GAAGCGCAGC TTTTTGTGAA GACTTTTGTG TCTTCGTCCA CGAGCAAATC CGAAAACACG
CCTCGACAGG TCAATGATGT GCAAACATCA GGTAGTGGGG CCTCCGGTGG GAGTAAGAAA
GGAGGTACCG GGAAAGGAGC CAGCAAGCAG ACTCCCTTCA AGGGTGCAGT CACGGCTCGC
AGTTATACTC CGGGAGAATG GAAAAGATTG TCCAAGGACC AACAGGAAAA AGTGCGATCG
CTGCGTAATA AAAAGAAGCA AGGAGGGAAA CCCGAGGAAT CAGAGAGGAG TGTTGACAGT
GTAGCACGGG ATGAGCCTGT GGACACTAAG GAAGTCCATA CCAGCAGTGA TATGGAACCG
ACTTCAGATG CGGCTGGCCT GCAATTTGGC CGTGGTGCGT ATAAGAAATC GGTCGGATTC
ACTGCGGACA CCGCTTCTCC TTCAGAAAAC GGAACGAAGA AGCAGAAAAC GCATCACGAT
GCGTGAAACG CGGCACCCAA TGCCAGTGTT TCGGGGACTA AGCAATGCAT TTTACCAGAT
CGAGTGATAT TGAGCCTCAC CTCTACACGC AGCATTTGTG ATCTCAACGC ATGCACTCAT
CTTGGTGAGG GCCGCTGCGA GTTGGATTCA CATGCAGACA CATGCGTGGC TGGGGCAAAC
ACTGTCTTGA TTGGTGAATC GCAGAAGTCC GTAACTGTGC GACCTTTCTC CGGTGAATAT
TCTGCACTGA AGAATATCCC CATTGGAACG GTTGCCACAG CTTACACAGT ACCAGAAGAC
GGGAGAGTGG TGCTTCTTAT TATTAATCAG GCCCTATTCT TTGGGGACAG ATTGAAAAAC
ACCCTATTGA CCCCCAACCA GATGCGAGAC TTTGGCATTG AAGTTGACGA TGCCCCTCGG
CAGTACGTCG CCAACTCCAA GCACTCTTTG TATGTTCCTG ACTCCCAACT TCGGATTCCG
CTGCAGCTGC GCGGTATATT CTCGTTTTTG GAGTCGCGGA AGCCCACGCA ACAGGAACTT
GACGAGTGTG AGCATATCAT ACTCACCTCT GATGTGCCGT GGGAGCCTTG CTCAACGGAC
TTTGCCCGTC GAGAAGAAGA GGCCGCTAAG AGAGACCGGA GCGTATCATT GGTAGACACA
ACGGGACTTT CCACTGGCCA CGCAATCCTA TCAGCACACC CATATGGTAT ACGAACTGTT
GCGGCTTCGC AGCAAATACT TGAGACTTTT CGTTCCTTGA CAGAGGTTGA ATTGTGCGAG
ACCAATCTGG CGGACCGCCT TATTGCCTGT GTTAATGTTG CGTCGGATGA TTACTGTGGA
GACGGGTTGG ACGGTAGAGC TGACTTGGAT GTGTACCCGG ACTCAGAAGA CTTCACTCGT
GTCGTCTCAG GTATGACATC AAGCGAAAGA CGGTCAGCGT TGACAGCTGA GGTTTTGTCG
AAGCGTTGGA ATATTGGCCT GGATTCGGCC AAGCGGACTC TGCAAGTAAC AACGCAGAAA
GGTGTGAGAA CGGTGATGCA TCCCTTGACC CGACGGTATC GTACTCGCCA ATCGCATTTA
CGATTTCCTA CCATTCGGAC CAAGGTTTAC ACCGACACCA TGTTTTCGTC CGTGATTTCC
ATCCGTCAGT ACAAGTGTGC CCAGGTTTTC ATAACCAACA CGGCCTATTC GCGTATTTAC
CCTCTGCAGA CCAAGCAGCA AGCTCCTGAT GCACTAATGA AGTGGATACA TGATGTTGGG
GTAATGAGTG ACCTAGTTTA TGATGGGTCT AAGGAGCAGG GAGGTGGCAA ACATTGGAGA
GAGATTGAGC AGCGTCACCA TATACATCGC CATGTAACGG AGCCACACAG CCAGTGGCAG
AATCGAGCTG AAGGAGAAAT TCGTGAAATT AAGAAGGCTG TTCGGCACCG ACTGCAGGTT
TCTCGTGCAC CACGGCGCCT ATGGTGTTTT TGTTGTGAAT GGGTGTCGGC TATCCGTCGA
TTAACTGCTC ATGACATTCC TGCGCTAAAC GGTCGAGTTG CCACGGAGCT TTTGGAAGGG
GACACCCCCG ATATTTCTGA GTACGCGCAA TTTGACTGGT ATGAGCCTGT CTGGTTCATC
GACCCAACTT CTGCTTTCCC TGAAATGAAG AAGAAATTGG GCCGATGGGT CGGAGTTGCA
TCAGATGTGG GACAGGCGAT GACTTTTTGG ATTCTTCCAA AGTCATGCAT CCCAATTGCA
CGTTCCTCTG TTGCTTGCGT CTTTCCAGAC GTAGCCGCTA CCGATGAATT TAAGGCTGAC
CTTGCTGAAC TTGATCTAGC CATCGAAAAT AG
 
Protein sequence
MDEEEKILEV AKKLMPSKPT TLKDMSKWRS FFENWNLYMS QCRGAAAIPL SYVYRTNEQP 
ETALVGTYVN MDAYLVAQTV LSGSNFEIDN QWVFDEFKEA ITTTGPGWSF IKTYNRSKDG
RAAILKLKEQ AEGTLNESVR RDDAIKILST TTYNGPSCNW NIDMLLQKFQ YAISELVEID
GVALPDGQLV TYLVQALKDP SLSYVRDTIR TNATYRNSFP EAQLFVKTFV SSSTSKSENT
PRQVNDVQTS GSGASGGSKK GGTGKGASKQ TPFKGAVTAR SYTPGEWKRL SKDQQEKVRS
LRNKKKQGGK PEESERSVDS VARDEPVDTK EVHTSSDMEP TSDAAGLQFG RDRVILSLTS
TRSICDLNAC THLGEGRCEL DSHADTCVAG ANTVLIGESQ KSVTVRPFSG EYSALKNIPI
GTVATAYTVP EDGRVVLLII NQALFFGDRL KNTLLTPNQM RDFGIEVDDA PRQYVANSKH
SLYVPDSQLR IPLQLRGIFS FLESRKPTQQ ELDECEHIIL TSDVPWEPCS TDFARREEEA
AKRDRSVSLV DTTGLSTGHA ILSAHPYGIR TVAASQQILE TFRSLTEVEL CETNLADRLI
ACVNVASDDY CGDGLDGRAD LDVYPDSEDF TRVVSVYDGS KEQGGGKHWR EIEQRHHIHR
HVTEPHSQWQ NRAEGEIREI KKAVRHRLQV SRAPRRLWCF CCEWVSAIRR LTAHDIPALN
GRVATELLEG DTPDISEYAQ FDWYEPVWFI DPTSAFPEMK KKLGRWVGVA SDVGQAMTFW
ILPNHRK