Gene PHATRDRAFT_43481 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43481 
Symbol 
ID7197183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp576092 
End bp577670 
Gene Length1579 bp 
Protein Length333 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177643 
Protein GI219111783 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AATTTATAGA CAGGAAAATA TCAAAGCCTA CACAGAAAAA TAATTATACT AATCAAAAGA 
TTTCAGAGGA CTGTGAAAAA GCGCACGCCG GTCACAACAT CCGCAAAACA TCCTCCGAAC
CTCAAAGCAT GTGCTTTGCT ATTTGAGGAT CCACTCGGTG GACATTCCCT ATCCGTATTT
CAAATTTTGT CGGAGGCCAT GATTCGCGGT CTAGAAGCAT AGAATATTTT ACCGCTCAAG
ATAGCAGATT CTCTTTGTTC GATGAGGAGA TGCTCAAGAA TTTATTGACG AATCCGGGTG
TGGCTATCGA CAGAACCATT CCACAAAAAA GAGTGTCAGA ATGATGCGTC CTCATTTCCT
AACGACAAAT GGCCAACTCA TGTTCTCCCT CACAAATAAA ACAGGTTGCC ATTTTTCACG
AAGCCACAGT CGGTGCATTA TCCTCCCAAT TCGATCTCTT CCCCGCAATA GTAAACAAGC
ATGAACTGTT CCGACCAAGA CATCCTTTTG CGACGCGGAG GGGTCGTCTC GCAACGGCCT
GCCAACAAAG AGTACCACGA AATTCTTAAG TCCAACGTCA TGATGTACAA AGGCCTTCCC
AAAGAAGGAC GTGCACAATT TGCCACCAAC ATCTTTCGCT TCATCCATGA CGTGGTTGGT
CGCAGATTCG TACAATTTGA CCACGAAACG GGGTCCTATC AGGAGTTGCA GGAGACGGAG
GCCAGTGTCA AAATTTCCGC TGCAATCCGC GATGATAAGA TCTCTAGGGC TGCGAACAAG
TCCAATCCAC GGTGGATCTT AGGTGTCCAT GATCCTACGG GGACACATAA CGGCCAGCCA
TTCGTCCACC GTATTGAAGC CAACAATACC TGGCGCCGGC TTTCGTCCAA TGAAACCGAT
CAAGTTCTCA TCACAGAAGG ATTTCAGAAT CTTTTGATTC TTCCGGAAGG TGTAAAATTG
CCTCCTTCTT TGGAGACGCG ATCAAGTGAC TATGCTAGCG CGGTCTCGGC CAGCGTTGTC
AACCCCTTCA CGAACGACGA TTGTTTTTTT ATTCCCAAAG ACCGTCCCAG TCGGCGGTCT
CCTATGTGTA CCGTATCACA GCAAGTTGCC TTGGAACTTG GCTTCCAAGA TGCGGAGTCC
TTCCAGCGAG CCTTATCACT GCTAATGGGG AATCAGCAGC AGGCGAAAGC ATTTATGCGG
TCAAGATCCA AAGTCACGCA CCCGGTACCG ACTGTCCCAT CGTATGTTGA ATTCATCTCG
GACGCGGATT CTTCAATGAC TGAAGACTCG GATGCAGCTT CCACTTCGAT AGAGCCTCAT
CCGTGGAAAA GGACTAGAAT CTGCAGTGTT GATGATTTGG ACCTCAACAA AGTGCACGCG
CTGGCGGGAC AAGCGTTGGC ACCGTCTGAT GAATCCAGCT TGCGAGATTT TTTATTAATC
GACGCCATTG AGCTTGACCT GCTCCAAACG TTTGAGGTGT GATGAGGATT TTCTGAAAAT
CGAAGATGTT GAGCTTGATC TTGTCCAAAC GTTGATGAGG GTAGTTTCAT TGGATTTAAT
GAATTTTTAC AATAACATT
 
Protein sequence
MNCSDQDILL RRGGVVSQRP ANKEYHEILK SNVMMYKGLP KEGRAQFATN IFRFIHDVVG 
RRFVQFDHET GSYQELQETE ASVKISAAIR DDKISRAANK SNPRWILGVH DPTGTHNGQP
FVHRIEANNT WRRLSSNETD QVLITEGFQN LLILPEGVKL PPSLETRSSD YASAVSASVV
NPFTNDDCFF IPKDRPSRRS PMCTVSQQVA LELGFQDAES FQRALSLLMG NQQQAKAFMR
SRSKVTHPVP TVPSYVEFIS DADSSMTEDS DAASTSIEPH PWKRTRICSV DDLDLNKVHA
LAGQALAPSD ESSLRDFLLI DAIELDLLQT FEV