Gene PHATRDRAFT_37459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37459 
Symbol 
ID7202371 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp193918 
End bp195261 
Gene Length1344 bp 
Protein Length422 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181676 
Protein GI219122695 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGGGC GTATTCCTGA AGAGGACCCT TTAAAGTGGG AACGCATGTA TCAAGAAGGA 
GGGTAGGTCG AGCAAGGACC AATTTCATTG GTGAGTCTTC TGTACCCTAA CCAAGATTCG
GTTTTCCTGT CCTTCAGAAA CGCTGCTCCT TTCAAGCTCG AAGGAATGAT GAACCTGCAG
CAGTCCAGCG AAGTCAGAGT TGTTTCGTTT GATCTCGACA ATACTTTATG GGTCACATCG
GCGACTATTT CCGCCGCCAA TGAAGCTCTC GCGGCCTTCC TCGACGCACG AGGCGTCGTT
CAACCTCAGC GAATAGAAAC AATAATGGGA ATTTTATTCA AAGAAAACAA AGAACGATAC
TGCCCCATTG AAGTGGAACA GGCAAAAGCT CCAGCTTTAT TAACACTACT CCGAAAAGAT
GCCATTCGAA AAATTCTTTT GGACGACAAC GGATACTCGT CCGAGAGTGC TGAATGCTCT
GCCGAAGAAG CATTTCAGAC TTGGACAAAT GCGCGCCACG ATGCCATTAC CTTTAACATG
GCTGAAGCTG TGAAAGAATG TCTTCAAGAA ATAGCTGCTA TTCAAACGTC GGATGGACAT
TCGGTCGTGA TTGGAGCCAT TACGGATGGC AACTCAGATC CACGCTTGAT TGATGAGCTA
TCCAAATATT TTCATTTCTG CGTCAACGCC GAAAAAGTTG GAATAAGCAA ACCTGACAAA
CGAATCTACC TAAAAGCTGT ACAGGAACTG GCCGGTCACC CTAGCTTAAA ACATCTCCTT
CCCGACGATG ACGCCCAAGA CTATGAATTG GAATCAAGAT TGGGACCGTG GTGGGTTCAT
GTGGGTGATG ATTTCATCAA AGACGTAGTC GCTGCAAAAG ATCTGAATAT GCGTAGCGTC
TGGGCTCGAG AGCTGGTCCT CAACAAACAG GTAGATTATG CATTGTCGGA GGGAAAGCCG
GAGCGAAGCG TTGAAGCTCT GGTGAAAGAT GTTTCTAAGA ATGAAGTAGT TAAGATGCAG
GTAGGGGCTA CAGATTACTT GGTGAATTCT CTTCACCAAG AGTTTGCAGA TGCAATTGTC
GACCGCTTTG GTGAAGTTGC CACCGTTCTA AATGCATGGC ACAGTGAAGG ACTGGTCAAA
ACCTCTACTC CTCTCCAAAT TGTCGAGAAC GATGTGACGG TACAGGAAGA AGTCGTGCTA
CGCCCCGAAG TAGAATCGGG AGACACTGAA AACGACAGAA CGCCAAACAT AAAGAACGGA
GGATCAAAAT TTTGCCTGTT TTGTAGGAAT ACACTTCCTG GAGCCGCGAA GTTCTGCTCG
GAATGTGGGG AGGGACAACA TTAG
 
Protein sequence
MEGRIPEEDP LKWERMYQEG GNAAPFKLEG MMNLQQSSEV RVVSFDLDNT LWVTSATISA 
ANEALAAFLD ARGVVQPQRI ETIMGILFKE NKERYCPIEV EQAKAPALLT LLRKDAIRKI
LLDDNGYSSE SAECSAEEAF QTWTNARHDA ITFNMAEAVK ECLQEIAAIQ TSDGHSVVIG
AITDGNSDPR LIDELSKYFH FCVNAEKVGI SKPDKRIYLK AVQELAGHPS LKHLLPDDDA
QDYELESRLG PWWVHVGDDF IKDVVAAKDL NMRSVWAREL VLNKQVDYAL SEGKPERSVE
ALVKDVSKNE VVKMQVGATD YLVNSLHQEF ADAIVDRFGE VATVLNAWHS EGLVKTSTPL
QIVENDVTVQ EEVVLRPEVE SGDTENDRTP NIKNGGSKFC LFCRNTLPGA AKFCSECGEG
QH