Gene PHATRDRAFT_35637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_35637 
Symbol 
ID7201083 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011676 
Strand
Start bp411973 
End bp413880 
Gene Length1908 bp 
Protein Length635 aa 
Translation table 
GC content55% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180229 
Protein GI219118925 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCCT TAAATTCTGC GAGCGAAAAC AAAGCGCAAG CGATAGTGAT CATTACCGGG 
GGCGGTGGAT TCTTAGGGCA ATCGTTGGCG TCGGCCTTGC TGGAACGTCA AACGATCCAA
GGTAACGGGG TTGTGCTTTC ACTGGGCTTG CTCGTTTTGG CCGACGTTGT TTTCCCCGAA
ATCTTACAAC CAGTGCTTGA AACATCCAAG TGGGATAAGC TTGTCAAGCT TCAGGGAGAC
ATTTCCGACC CTACCTTTGT CGATAACTTG TTCGGCCTAA TCCCTTCCGA CGCCGCCCAT
GTATCGATTT TCCATCTGGG CGCCGTCATG AGCGGTGACG GGGAACGGGA CTTTGATTTA
TGCATGAATG TGAATCTATA CGGCTTTTTA CATCTGATAC AAGGAGCCCG TAAGTACGTG
TACGAGCGTC TCGGTTTCCC CGCCAAGTTC ATTCTGGCGT CGGCCGGAGC AACCATTGGA
TCCGGCGCAC CGACCGACTA CATTGGCAAG GACGACATTA TTTCGGACGC TACACGAGCA
ACACCGCACA CAACCTACGG GGCCACCAAG GCCTGTGCCG AATTACTTTT GAGCGATTAC
AGTCGCCGGG GATTCGTAGA CGCCCGCGGA CTACGACTTC CTACCATTGT TGTACGGGCC
GGTAAGCCCA ACGCCGCCAC GACGGGTTGT TTTTCCGGTG TTGTACGCGA ACCACTTGCT
GGGGTCGATA CGACCTTGCC CATTGCCAGG GATGTACTGC ATGCCGTTAC TGGTAAACGT
CACGCAATCG ACGCCATGCT GACACTCCAC AACGCAAGCC TGGAACAAAT CGAATCTGTT
TTGGGCTACG ATCGGACCGT ATTCTTGCCA GCCGTAGCTC TGAGTCTGGG AGATCTCGAA
GACGCCCTTT GGAAAACGGT CACACCTGAT ACGCAACACA AGTTGGGAAA GATCACGTAC
CAGGTGGATG CCCATTTATC GGCCGTGGTG GCAAGTTTTC CGACCAAAAT CGATGCCCGA
AGAGCTCGAC GGCTCGGCAT TCCGTCCGCG CCGGATGCGG ACACTTTGAT TCGCCAATAC
GTCGCCGACT TTTCCTCGGC TATCGCCTCG GGTATTGAGC TTGTTGCTCC ACAAAGTGGC
AACATTGCCG CTTTCCCCAA GGAAAGCAAA GTGGCCGTCA TTACAGGAGC CGGCAGTGGC
ATTGGACAGG CCGTCGCTCA AAGATTGTCT CGAGGTGGTT GGATTGTAGT CTTGGCGGGA
CGTCGCAAAA CGACGTTACG AGAAACAGCC AAGACTCTTG AAGGGCGCGC GTGTTTGTGC
GTCCCGACGG ATGTGACCAT CGAGTCGGAA GTAGAAGCGC TCTTTGAAAC CGTCCACACC
AACTACGGTA CGATTGATCT GTTGTTTAAC AACGCTGGTA TCAACAGCAC AGCGGCCAGT
TTCGCAGACG TGGAGTTTGC CGACTTTGAG CGTGTGCTAC GTACCAACGT GTGCGGCCCG
TTCTTGTGCG GCAAAGCGGC CATGAAACGC ATGGCCGCCA ATGGTGGCGG CCGAATCATC
AACAACGGTA GCCTGTCGGC GCAAACGCCC CGACCCGGGT CCGCCTGCTA CACCGCCTCC
AAACATGCTG TGCTGGGGCT AACAAGATGC ATGGCACTTG ATGGACGTGC GTTCAACGTG
GCGTGTGGTC AGATCGATTT TGGCAACGTG GTGAGTGAAA TGAGCTTGCG TACTAACAAG
GTAGGGACCG GGGCGTTGCA GCCCAACGGA ACCACTCTCG TTGAGTCTTC CATGAGTCTC
AAGGATGCCG CCGAGACCGT CTGGAGCATG GTCAATCTAC CTCTGGAAGC CAATGTATTG
CAGATGACGG TCATGGCCAC AACAATGCCG TTTGTCGGGC GTGGATGA
 
Protein sequence
MASLNSASEN KAQAIVIITG GGGFLGQSLA SALLERQTIQ GNGVVLSLGL LVLADVVFPE 
ILQPVLETSK WDKLVKLQGD ISDPTFVDNL FGLIPSDAAH VSIFHLGAVM SGDGERDFDL
CMNVNLYGFL HLIQGARKYV YERLGFPAKF ILASAGATIG SGAPTDYIGK DDIISDATRA
TPHTTYGATK ACAELLLSDY SRRGFVDARG LRLPTIVVRA GKPNAATTGC FSGVVREPLA
GVDTTLPIAR DVLHAVTGKR HAIDAMLTLH NASLEQIESV LGYDRTVFLP AVALSLGDLE
DALWKTVTPD TQHKLGKITY QVDAHLSAVV ASFPTKIDAR RARRLGIPSA PDADTLIRQY
VADFSSAIAS GIELVAPQSG NIAAFPKESK VAVITGAGSG IGQAVAQRLS RGGWIVVLAG
RRKTTLRETA KTLEGRACLC VPTDVTIESE VEALFETVHT NYGTIDLLFN NAGINSTAAS
FADVEFADFE RVLRTNVCGP FLCGKAAMKR MAANGGGRII NNGSLSAQTP RPGSACYTAS
KHAVLGLTRC MALDGRAFNV ACGQIDFGNV VSEMSLRTNK VGTGALQPNG TTLVESSMSL
KDAAETVWSM VNLPLEANVL QMTVMATTMP FVGRG