Gene PHATRDRAFT_31975 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_31975 
Symbol 
ID7196454 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1383425 
End bp1384807 
Gene Length1383 bp 
Protein Length397 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176774 
Protein GI219110044 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACCAT TACCTTACAT GGATCATGTG CAGGCTACAT TGAACCCATG CAGCGTTGAA 
ACAATTCCTG AACCCATAAC TCTGGCGGAT AGCTCAAAGG TCTGGCCCTT GGTTGTCTCT
TGCTACGAGT TGGACGAAGC TTCCGGTCGT CGGAACGGCA AAGCAGATAT GTTTACAGTT
CCGATGCCTG ATATCTCGGA AGATAAGGAA ACGACTCTGC CACTAAAGTT TGGAAGTCCC
CATACTTTCA CAGACAAAAT ATCAGGGATT CTTGATGGCA AATGGTCCGA ATTTTATAGC
CCGGGCGACA ATTCCAAATC ATGGTGTTTC GCGACAGCGC AATCATCAGG CGAGATTCGT
TCTTTCCGTT TGCAAATCCC ACGATCTTTA GAAGGATATC CCCCGGTGTC AAAGTCAGAT
CCGTTGTACA CAATTGCGGA AGCGGGCGCC AGCGAACCAC CTGAAGATGA TGACGGAGCT
CCGCTATGCT TGTCTTTAAA TTGGGAACCA TCATCTCAAT GGAATAGCAA ATCCGGTATG
AAACGAATAG TGTCCACGTA TTCAAATGGG ACTGTCGCAA TTCATGATGT ATCATTTTCA
TCTGGTTCTA CGCATTTCAT TGCCAGGGAA AGTTGGCGAG GTAAGTGCTC CTTTTACCGG
AAAGAATTTT GTAGGTACAG TGATAGCACG CTTAAAATAT TCGGATCTTA TTGCTCACTG
ATCAGCACAT AGTATATTCA CAAGTCCTGC AGAAGTTTGG TCAGCGTCTT TCGCGTGTGA
CGGGGACCAA AATATGATTC TTTCCTGTGG CGATGAAGGA TCAGTGAAGG TATGGGATAT
TAGGAGTAAT GTTCGACCCA TGCACGAATT GAATTTTTTT GAATCCGGAG CAACGTGCGC
TTCGCATCAT CCTCGGCACG AGCACTTGGT TGCATGTGGT TCTTATGACG AGAGAGTTTG
TATCTATGAT ATTCGATATC TATCTCAAAA GCCATTGTTT CGAAGTGATT CCTTAGGAGG
GGGAATATGG AGACTTAAAT GGCATCCATA CTCCGACCAG AAGTTACTTG TTAGCGCAAT
GCACGGCGGA TGCCTTGTCC TACGCGTAAG CCAAGATGTT GGAGTAGAGA GCGGAATTGT
AGACGCGCCG AGTTTTGAAG TGACAAAAAC GTTCACTGAG CATGAGAGGT ACGTCTTGTG
TCTAATGAAA TGCCGCAGTC CCTGATCGAA ATGCACTAAA ATCAAATCTT GTCATTTTTA
TGCAGTATGG CGTACGGTGC CGATTGGCTT GTGAGTGGCA ATCCAGCGCA GAAGACCTAC
TTTGAAGCTG CAGCTAGTTG TAGTTTTTAC GATCGGAGCA TCTTCCTCTG GGAAACGGTA
TAA
 
Protein sequence
MAPLPYMDHV QATLNPCSVE TIPEPITLAD SSKVWPLVVS CYELDEASGR RNGKADMFTV 
PMPDISEDKE TTLPLKFGSP HTFTDKISGI LDGKWSEFYS PGDNSKSWCF ATAQSSGEIR
SFRLQIPRSL EGYPPVSKSD PLYTIAEAGA SEPPEDDDGA PLCLSLNWEP SSQWNSKSGM
KRIVSTYSNG TVAIHDVSFS SGSTHFIARE SWREVWSASF ACDGDQNMIL SCGDEGSVKV
WDIRSNVRPM HELNFFESGA TCASHHPRHE HLVACGSYDE RVCIYDIRYL SQKPLFRSDS
LGGGIWRLKW HPYSDQKLLV SAMHGGCLVL RVSQDVGVES GIVDAPSFEV TKTFTEHESM
AYGADWLVSG NPAQKTYFEA AASCSFYDRS IFLWETV