Gene PHATRDRAFT_37299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37299 
Symbol 
ID7201946 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp664125 
End bp666628 
Gene Length2504 bp 
Protein Length713 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181244 
Protein GI219121794 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000102085 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGGACT TTATCATCCC TGATAACTTT CCTCCTGACA ACCCCACAGT GGAGACTACG 
GAACCAACTG CAACCATTGC ACCAATCCCA AATCCTGATC CGCCTGAAAA TGTCAATGTC
AACACCACCT TGGATATCCC GGACGCATTG AAAGATCTCC TTTCTAACGT ATCAAACACT
GATGCCACAG TTTCTGGCGC CTACTACACT TGATACACTC ACTAGTTTCG TGCCTCTCTC
TCGTCAACGG ATTATTACAC GTACCAAACT CAAGCATTCA AGGGACTCAT CAAGGAATTC
AAGTTTGATG CTGATAACCC AATGGATGTT CTCACCAAGG CCAAGTCAAA CATGGAGGAA
GCCGCTTTTG CTCTTACAGC AAAAGGTATC ATCAATTGAA AGAACAAATT GGAAAACTTC
CTTACCGAAT TTGGACTTCG CGACCCATTT GATACCATTT ACACACAGTG GCAAACCTCC
CCTAAGGGTC CCATTCCTGT CTTCACGTCA AGTAAAAGTC TCTTTCACGA CTTTCATTTC
ATCTCCTTGT CCAACGTTGT CAATACGGTG GAATTCATGA AACAGTACAC AAACTTGACT
CACCCAACCA AAGGAAAAAT CAACAAAGAA CATTCCCGTG ATTATTCCAT GTCCGGAACC
GTTTTATACA ACTCATGTGA ACCGTCTCTC CAGTTGTGGT TAGATACCCA GATTAGTATC
AGCACCGACA CCATTCTCAA ACGCCATGGC AACTCAGGAC CAGTCCGTTT TTATCTCATT
TGGTCTCGCT ACGCCAATGT CGATGGAGCC GTAGCCACGT CTATTCAAAA CGCTCTTACC
AAGCTTCAAG TGCGCGATCT TCCCGGTGAG AATGTGTCCC TTTACTTTGA CACCATTACC
ATTATTGAAG AGTATCTTAG CTCCATGGGC CGTACCATTC CTGACTTTGT TTCACACGTT
ATTGACGTTT TGATCAATGT GTCTGTTCAT GACTACTCCC TGTTTCTCAA GACACAACAG
TTTGTCTCAA ATCCAGCGCT TCGGAATATA CATGCCCTTC GCCAGCTTGT CTGTGACCAA
TACCAGCTGC TTCTCAATTC TGGCAAATGG CACCCTACAG CAAAAACTGG TGCCGCATTC
CACGCTGTCA AGAACTTCTC CATTGAAACT GGTTTCCCCA ACGACACTCC CAACACCAGT
GCCAATATCA ACCAGTCTCC TGGACATTCT AAGCCCCGAC TCTCTCGCGA AGAGTGGGAA
AAGACTATTG ATCGATCTCC CCCGTCTCCG GGCTCCCCAG ACTGCCGAAA GTCGACAAAA
GGGGATTTCA ACGAGTACTG GTGTGTCACC TGCAATCGCT GGGGCAATCA CCCCACCGAC
AAAACTCGTC ATCCCACGGC AAAGCTAGAC CACACTCAAT TTCTCGAAAA ACGAAAGAAG
CGATTCACTA AACGAGAGAC TCAAGACCCA TCTCCGGCTC CCAGTAACCC TCCTACTCCA
CCACATGGCA TCAATTCCTC TGGGGCACTC CAATTCTTGT GTACTTCCCC ACTTACCCAG
TTCCATTCCT TTGGCGTTCC CCCGGCGAAT TTTTAATGGC ACTGATCTTA CAGATAAGCC
CTTTCCTTTC CTTTGACCCC GGTGGATTAT TGTTTCTTGT TGGTCTCGTC TGCCTCCTTC
CTTTCCTTCT CTACCAGACT TGCTGCCTCC TTTTCCTTCT GGGGGTCGGG CGAGCTATGC
TCCCATTTCT CGCTTCCTTC CTGCCCTGCT CCACCTGGAC TCCACACCGC ACTCATCGAC
ACTCCAAATG GCGCCTCACC ACCGCTTTTC CTACTTCGTT TCTTCTCCTT TCCTCCGTTT
CGGTCTCTCG CACTACAGCG ACCTTTCTCG GCACCCTTAA GATCACGGCT GTTACTCCTG
CCGCACATCA GCTCCTCACC TTCCGTACTG TTTACCTACG TCCCTTTCAA CGCTGTCGTC
GTCCTGGTTT ATCACACTCC CGCGCTGGTC ACTTCACCAC TTATGTCTCC AGCCGTCAAC
TCTGTTTGGA ATACCGCACC CTTCTTAACG ACTTCAAACT GTGCCGTTTT TCGGGCCTCC
TTGATCCTTC TCTGGCGTAT TTTGATCATC CACAATATGA TCTTGAACGT ATACTACCCT
ACGGACCTTG GGAAGATGAT CCAACCATTG TTCCCTCCTT TTCTCCTCCT GTCGAACCCC
TGTATGTTGT GGACTCCCGT CTCGCCTCTG CCCTGAGCAC ACGTCATCTC CAACAGCTTT
TTGACTCCGC TCTACTACAA CACAACATGG TCTCCGAAAT CCGCCATGCC CACAGTTCCG
ACAATGTTTG TTCACCCCTC CTACGGCCCG GTTGCGCCAC GGCTTCTATT GCTGGTGGCA
CTCTCCCTTA TGACACCCTC TATAGTCATC GCTCAACCCC TTGGTCCATG GGGTATTCTC
CTCCGTCCTC TCTACACTGT GCCTACGCTC ATGATAAAGT ATAG
 
Protein sequence
MLDFIIPDNF PPDNPTVETT EPTATIAPIP NPDPPENVNV NTTLDIPDAL KDLLSNFRAS 
LSSTDYYTYQ TQAFKGLIKE FKFDADNPMD VLTKAKSNME EAAFALTAKG KINKEHSRDY
SMSGTVLYNS CEPSLQLWLD TQISISTDTI LKRHGNSGPV RFYLIWSRYA NVDGAVATSI
QNALTKLQVR DLPGENVSLY FDTITIIEEY LSSMGRTIPD FVSHVIDVLI NVSVHDYSLF
LKTQQFVSNP ALRNIHALRQ LVCDQYQLLL NSGKWHPTAK TGAAFHAVKN FSIETGFPND
TPNTSANINQ SPGHSKPRLS REEWEKTIDR SPPSPGSPDC RKSTKGDFNE YWCVTCNRWG
NHPTDKTRHP TAKLDHTQFL EKRKKRFTKR ETQDPSPAPI PFLWRSPGEF LMALILQISP
FLSFDPGGLL FLVGLVCLLP FLLYQTCCLL FLLGVGRAML PFLASFLPCS TWTPHRTHRH
SKWRLTTAFP TSFLLLSSVS VSRTTATFLG TLKITAVTPA AHQLLTFRTV YLRPFQRCRR
PGLSHSRAGH FTTYVSSRQL CLEYRTLLND FKLCRFSGLL DPSLAYFDHP QYDLERILPY
GPWEDDPTIV PSFSPPVEPL YVVDSRLASA LSTRHLQQLF DSALLQHNMV SEIRHAHSSD
NVCSPLLRPG CATASIAGGT LPYDTLYSHR STPWSMGYSP PSSLHCAYAH DKV