Gene PHATRDRAFT_31650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31650 
Symbol 
ID7195984 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp616274 
End bp618192 
Gene Length1919 bp 
Protein Length590 aa 
Translation table 
GC content50% 
IMG OID 
Productsolute carrier 
Protein accessionXP_002177123 
Protein GI219110743 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.15984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACTG GCAACATCTC AAGCATTACC GGCGTTCCGC TCAGCCTCGA CGTGAGTGAC 
GAAGGAACGA GCGACGACAT TGACAAGTCA GGAAATAACT TCGTTTATGA GACACATGAG
GATCGCGCTA AAGCTAATGG TATGAAATAC ACCGTCTCGG ATGTACCACC TTTGCCTTTG
AGTATAATCC TAGGATGCCA ACACTTCCTT ACGATGCTGG GCGCGACGGT TCTCATTCCT
CTAATTGTGA CGCCCGCCAT GGGAGCAACG GCCAAGCAAA CAGCCGAAGT CATTTCAACT
ATTTTTGTGG TCTCTGGTGT CAATACATTG ATCCAAACGA CTCTAGGTGA TCGACTGCCG
ATTGTGCAAG GTGGCAGCTT CAGCTACCTC CCTCCAACTT TCTCCGTCAT TTTCAATCCT
TCTCTGCAGG CCATTGTCGG CGACAATGAG CGCTTCCTTG AAACTATGCA GGTTTTGTCC
GGAGCCATTT TTGTGGTAGG GATTGTGCAA ATGGCGCTTG GGTACTCTGG AGCGATTGTA
CCCATCCTCA AGTACCTTTC GCCCGTTACC ATTGCACCCG TCATCACGGC TATCGGACTC
GGTCTCTATT CTGTCGGCTT CACCAATGTA TCTACCTGCT TTTCTGTTGG CCTCATTCAA
ATGTTGTTGT CAATTATTTT TTCGCAATAC TTGAAAAAGT TCCTTATTGG TGGCTATCCT
GTCTTCGCAC TCTTTCCCAT CATTCTGGCG ATCGCAATTA CCTGGAGCTT TGCCGCCATT
CTGACGGCGT CTGACGTTTG GGGTGAAGAA AGTGCTTGCC GGACTGACAG TACGCGTGAC
TTACTCGACG ATATGCCCTG GTTCCGCTTC CCGTACCCTG GACAGTGGGG TCCACTAAAA
TCAAGTCTTT CGCCATCGTG CCTATGCTGG GTGGAATGCT GGCTGGCATG ATCGAATCGG
TCGGTGACTG CTACAGCTGT GCTAAATTAT GTGGAGCACC CCCGCCAACT CCCGGAATTA
TCAGGTTCGT GACTTGTGAA GCTTTTTGAT CTATGCTCTT TGTTTGAGAT TTTTGCAACT
GATTGAATTC TGCTCCTTTT GCCTATATCT GCAGTCGCGG CCTAGCTGGT GAAGGTATAG
GTGTGGTGAT TTCAGGGTTG TTCGGAGCTG GAGCAGGAAC CACGAGCTAC TCGGAGAACA
TTGGTGCCAT TTCCTTGACC CGCGTCGGTT CCCGCGCTGT CGTCCAATGC GGTGCAGTTG
CGATGATTAT TGTTGGTCTA TTCAGTAAAG TGGCGGCTCT TTTTGCCAGT CTCCCATCGG
CCTTGGTTGG TGGTATTTAC TGCGTAGTGT TTGGGCTAAT CGTTGCGGTT GGTCTGTCAA
ACTTGCAGTA CGTTGATCTG AACAGTGAGA GAAACCTTTT TATTATCGGC TTTTCAATTT
TCAACAGTCT TTCCATTGCT GGTCCAGCGG GATACTTTGC GGGTCAAAGC GAGAATCCGT
TTGGAGATTC AAACGCTGGC GAAATCGCAC TGGCGTTGTT CAGCTCCCCG ATGATTATCG
CACTGATTGC GGCCTTTGTT CTGGACAACA CCATTCCCGG TACACCAAAG GAGCGCGGTT
TGCTTGCGTG GGCGCACGTC CGGGACGCCG ACGTCAACAA CGATCCAGAG TACGTCAAAG
TTTACTCGCT TCCTCTCTTC TTTGCCAAGC TCTTCAAGAA CTGCGGCTAT TTAGAGTACG
TCAGCCGTGG CCGTATGCCA AATCCTCCGG CGAATGGCTA TCAACCAGGA CATGGCGATA
TTGGAGAGCT TTGCTGCGGC GGCTGTTTTG GTGGGCCGCC TTCCTTGCAA GACGACGTGG
AAGAAGTGGC TCCTCAGGAT TCAGTAGTAG ACGAAGAAAA CATTGCAACC GAGGCTTGA
 
Protein sequence
MATGNISSIT GVPLSLDVSD EGTSDDIDKS GNNFVYETHE DRAKANGMKY TVSDVPPLPL 
SIILGCQHFL TMLGATVLIP LIVTPAMGAT AKQTAEVIST IFVVSGVNTL IQTTLGDRLP
IVQGGSFSYL PPTFSVIFNP SLQAIVGDNE RFLETMQVLS GAIFVVGIVQ MALGYSGAIV
PILKYLSPVT IAPVITAIGL GLYSVGFTNV STCFSVGLIQ MLLSIIFSQY LKKFLIGGYP
VFALFPIILA IAITWSFAAI LTASDVWGEE SACRTDMGST KIKSFAIVPM LGGMLAGMIE
SVGDCYSCAK LCGAPPPTPG IISRGLAGEG IGVVISGLFG AGAGTTSYSE NIGAISLTRV
GSRAVVQCGA VAMIIVGLFS KVAALFASLP SALVGGIYCV VFGLIVAVGL SNLQYVDLNS
ERNLFIIGFS IFNSLSIAGP AGYFAGQSEN PFGDSNAGEI ALALFSSPMI IALIAAFVLD
NTIPGTPKER GLLAWAHVRD ADVNNDPEYV KVYSLPLFFA KLFKNCGYLE YVSRGRMPNP
PANGYQPGHG DIGELCCGGC FGGPPSLQDD VEEVAPQDSV VDEENIATEA