Gene PHATRDRAFT_37521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37521 
Symbol 
ID7202501 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp335570 
End bp337395 
Gene Length1826 bp 
Protein Length538 aa 
Translation table 
GC content46% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181706 
Protein GI219122757 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGGAT GGTCGTCCTG TGCTCCTTCT GCGCAAAAGT TTTCAAATCT CAAGAAAGCA 
ACGTATGGAA GCTTCAGCCT GCACTGTCCT TCATCTCGAT GTGAATGTCT ATGACGCATA
GTCAATATGT GCACAGGAGA CATCCAGTTG GGACTGGGAT TCGAACAGAA GAATGGAGGA
AGCCCGTCGT GGTGCCGGCA GAGCGATGTG GCACGGAGAA AGTCAAAGGT GAATTGCCTG
TAGCTCGCAA GCTGCAAAAA CAACAAAGGC AAATTCCGAC AATCCTGTTG GTCGCCGTTC
TCTTGCTTTT TTCTTTTTGC ATGCTTAAAA GTTTTCGAAA AACGGTGCAT CATCATGCGA
TACGCAGAGA CCATTTGCAC CAGTACTCTG ATGTTAGCGA GAAAACAGTC ACTTCCATTC
GCCAAGCTAG TGGTATTCTT CTCCACAACA ACCAGATAGG GAGGACACCA ATTATTCAGC
CACAAATTTT TCTACCAACT GTAAACGAGG ACGGAACTAC TGAAAAAAAA CGTGAAGCTG
CGTCACACGA GATTCCGTCA AGCAATGAAG GAATACGCGT AACGGATCAG ATAGCCCCGA
ATATCCGCAA GACAGTCCCG AATACAGACC GAATCGCTTT TAGGTACTGG CATGAAGACG
AATTGATAAG CAACCAGAAA TCTTGTCGGC AGCCGCACTG GGCATTCTTT CACTTTCCTA
CCTGCAACGC CTTTCACGAG ATGCCACTCG AACGTGAATA TTTTGAAGCT TCACAAGGCC
GACAGGGTTC CGGAGTAACT GAACTCGACA GCTATTATAT CAACAGCGGC TATTATCGTG
ATGTTTGGGT GGTCGCAGGC TCTGCGTCAC TTGGCAGGCT TATTCTCAAA ACATCCAAAT
TTGAATTTGA CATAAACTAC AAAACCCTGC ATCAGGTCCA TCGTGAAGCA AACGTGATGG
AGCGTTTGTC CAGCAACCCG TCCATAGTTG ATATTTACGG TCATTGCGGA GGCTCGGTGG
CAGCTGAAGC CATATCGTAT GAAGTTGAGC GATACGTTGT CGCCGGATCA GGCTATGTGA
ATCCTGGCCT AGGGGCTGAT CAACCCGCAG ATCTTTCACC ACAAAATGAT TTCACGCCGT
CAGAAAAATT CCGCATGGCT CTCGCCATGG CCGAATCGAT TGCAGCTCTT CATGGCTATC
ACGGTGGTGT TATTGTTCAC GACGATATCC AGTTGCGACA ATGGCTGCAA ACCAAAGACG
GAATATTGAA GTTGGGCGAC TTCAATAGAG CGTATGTCCT AGATTGGAAC GACTCCACAC
AGGCATACTG TTCATACAAC AATGGACAGG CATTTGGAAA TGTAAGTATG ATCTTTGCTG
ACCAGTTTGA GTTGTACAAT TACTTCATCT GAGAGTAATC ATTTTTTGGG CCAGAATCGT
TCACCGGAGG AATATCAAGC CGGAGAATTA GACGAAGCGA TCGACGTCTA TTCCTTTGGG
AATTGCTTGT ACAGTCTGGT AGGTTGAACA TAAAGAGCTG TGACAGCCTC CGCATTGTTT
TCAAGTTAAC TGACACCATT TTCTTGTTCT TGCCATACTA GTTGACTGGG CTTTGGGTCT
TCTACGAAAA TGAAGATGAT GCTATTGTGC AAGAAAAAGT TTTGACGGGG AAGCGACCCA
TGATTGATAT CCGCTACCGA AACCGCAGTC TTGAGGAGAA AATTTTAGTC GAAGTAATAG
ACGGGTGCTG GCAACCAGAT CCGAAAAAGC GGCTTGACAT CTTTCAGGTT GTTCGAAGAC
TTCGAGAAAA CTCACAGGCA CTTTGA
 
Protein sequence
MTGWSSCAPS AQKFSNLKKA TRHPVGTGIR TEEWRKPVVV PAERCGTEKV KGELPVARKL 
QKQQRQIPTI LLVAVLLLFS FCMLKSFRKT VHHHAIRRDH LHQYSDVSEK TVTSIRQASG
ILLHNNQIGR TPIIQPQIFL PTVNEDGTTE KKREAASHEI PSSNEGIRVT DQIAPNIRKT
VPNTDRIAFR YWHEDELISN QKSCRQPHWA FFHFPTCNAF HEMPLEREYF EASQGRQGSG
VTELDSYYIN SGYYRDVWVV AGSASLGRLI LKTSKFEFDI NYKTLHQVHR EANVMERLSS
NPSIVDIYGH CGGSVAAEAI SYEVERYVVA GSGYVNPGLG ADQPADLSPQ NDFTPSEKFR
MALAMAESIA ALHGYHGGVI VHDDIQLRQW LQTKDGILKL GDFNRAYVLD WNDSTQAYCS
YNNGQAFGNS NHFLGQNRSP EEYQAGELDE AIDVYSFGNC LYSLLTGLWV FYENEDDAIV
QEKVLTGKRP MIDIRYRNRS LEEKILVEVI DGCWQPDPKK RLDIFQVVRR LRENSQAL