Gene PHATRDRAFT_37497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37497 
Symbol 
ID7202488 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp287032 
End bp289008 
Gene Length1977 bp 
Protein Length462 aa 
Translation table 
GC content44% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181693 
Protein GI219122730 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0913561 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACCCA AGGCAAGTTG GCCAAAACTA ATCATGTGTG TTTGGTCGCA GATTGCAACA 
ATGATTGTTT TGGATTGGTG ACTGGACAGT TGAATGTCCA AATACATCAT GTGACTGTTT
TGGGTGTCGA CCGTTACCAT GCTTGGAAGC ATGGTTACTA TGGCAAACTT GCCCACGGTA
GCGTGTACAA TACTGTGGGA TTGATGAAAA TGGTCCACGG TAATGCTTAA GATCGTGGAA
CTGGACCACG GTGGTTTTGG CACCGTGGAA CTTGATGAAA TTGAATAGAT TTGGCTTTAG
TTTAGAGCTT ATAAATCTTA ACAACAATTG TGTGAAAACC ATATTTGTTA GTTCGTATCT
CTTTGTTAGA AGCCAGACGA GTCTTGATCA CCTTGTGCTG TCTTCAGAAG CCAGGAGATA
CTTGACCATT GACAGATATG GTGCATTGAA GACTACCAAG GTTATTGTTG GGACTGTTGT
AGAGGTCACA ACCACCAGGA AGCCTCAAAC CAACCGTACT TCCACCTTTG TTACTGCTGA
CTTTGATTTG GGGGGTGGAG CAGTGAAGCG AAGCACTCTG AACATCCGTA GCGTCAAGGC
CTTTGTACCA GACTTGACCA CGTCACCAAA TGACGCTGAT ATAGCAGCTG TGGCAGCGCC
CAACAATATG CTTGACAACA TTGATGCGCT TCCAACCGTA ACAGCGGATA CTGCAACTGA
GACTGATTTG TTTCATATGG AAGCAGGGTT TGATCAACAT CCAATTCTCG AACAAGACAC
GGAGTCACTG GTGCAGCTAC CAGTACTTCC AATTCTACCC AATGCTGACT TTGGTAACAA
AAACTTTTCC GAGGCAGAAG CTGTCCCTGC TGCAGTAGCA CATGGCACAA AGTGGTATGA
AGATGATGAA GCTACTCTAA ATGGTACAAA TGGCAGTGTG CCAATCAAAG ACTTCGGTAT
TTCCACGCCT GTTGGTGAAG TCTTGGGTCC AAACTCTGAT ATTGGCGGAA AGTACTCGAG
ACTGGAATAC TTCCTTCTGA TGTTTCCACC CAAACAGCTC ACCACTATGT GTCAGCTAAG
AAACAATGCT CTGGTGCAAC AGAACAAGCA CATCATCACT ACTGGAGAGC TGCTTCGCTT
TTTTGGAATA GTCATTCTGA CAACAAAGTT TGAGTACACA AGCCAATTCC AGCTGTGGTC
AACAACTGCA CTGTCAAAAT ATATTCTGCT CCATGCTTTG GACGGACAGG AATGTCAAGA
CAGCGATTCA ATGATATATG GCAATGTCTT TGCTGGAGTG AGCAGCCTCC TGAGCGGCCA
GAAGGTATGA GTTTGCAGAG CTACAGATGG AAACGTGTCA ATGGCTTTGT AGCCAGGTAC
AATGATCACC AAAGTACAGC TTTCAAGCCC TCTCACATGA TTTGTGTTGA CAAGTCCATC
TCTCGCTGGT ATGGCCAAGG GGGGAATTGG ATTAATCATG GGCTGCCTAT GTATGTTGCC
ATAGATTGAA AGCCAGAGAA TGGTTGCGAG ATCCAAAATG CGGCATGTAG ATGTTCCGGA
ATTATGCTTC GGTTGAAACT GGTCAAGTCA AAGACTGCTT GGGAAGAAGG GGATGAGGGT
GGTCTAAGCA ACAATAATCT TTTACTTGGC ACAAGGATTC TCAAAGAGCT AGTTACTCTT
TGGGCATGGA CAAACCAAGT TGTATGTGCT GATTCCTATT TCGCTTCTGT TGGTGCTGCA
TTGGAGTTGA GACAAATAGG TTTGGGATTT ATTGGGGTTG TGAAGAGTGC AACAAAGCAC
TTTCCAATGG CTTATCTTTT GAGACTGGAG TTCAATCATC AAGGAGACCA AAAAGGACTG
TTGATGAAAG ACGGACTCAA TGGAAGTAGC TTGATGGTGT TTGTATGGAT TGATTGCAAT
TGTCAATACT TTATATCAAG TGTGTCCAGT CTTGATGCCG GCAGTCCATT TGTTTGA
 
Protein sequence
MTPKASWPKL IIYGALKTTK VIVGTVVEVT TTRKPQTNRT STFVTADFDL GGGAVKRSTL 
NIRSVKAFVP DLTTSPNDAD IAAVAAPNNM LDNIDALPTV TADTATETDL FHMEAGFDQH
PILEQDTESL VQLPVLPILP NADFGNKNFS EAEAVPAAVA HGTKWYEDDE ATLNGTNGSV
PIKDFGISTP VGEVLGPNSD IGGKYSRLEY FLLMFPPKQL TTMCQLRNNA LVQQNKHIIT
TGELLRFFGI VILTTKFEYT SQFQLWSTTA LSKYILLHAL DGQECQDSDS MIYGNVFAGV
SSLLSGQKPE NGCEIQNAAC RCSGIMLRLK LVKSKTAWEE GDEGGLSNNN LLLGTRILKE
LVTLWAWTNQ VVCADSYFAS VGAALELRQI GLGFIGVVKS ATKHFPMAYL LRLEFNHQGD
QKGLLMKDGL NGSSLMVFVW IDCNCQYFIS SVSSLDAGSP FV