Gene PHATRDRAFT_42003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42003 
SymbolCbs 
ID7201255 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011677 
Strand
Start bp487016 
End bp488548 
Gene Length1533 bp 
Protein Length469 aa 
Translation table 
GC content49% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180648 
Protein GI219119791 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCGCA TCTGCGACGA TATTCTTGAA GCGATTGGAG GTACACCGCT GGTACGCTTG 
AATCATGTTG GAGCCGATTT ACCATGTGAA TTGCTTGCCA AGTGCGAGTT CTTCAACGCT
GGTGGATCTG TCAAGGACCG AATCGGCCGA CAAATGGTTT TGGACGCAGA AAAGGCCGGT
AAAATTAAAC CTGGCGACAC CTTGATTGAG CCTACGTCCG GTAATACTGG AATTGGGCTA
GCCTTAACAG CTGCCGTTCG GGGATACCGC TGTATTATTA CAATGCCCGA AAAAATGTCG
AAAGAAAAGG TAGATGTTCT GAAGGCCCTG GGGGCCGAAA TTATCCGAAC GCCGACCGAA
GCCGCGTACG ACGCACCGGA TTCACACATA TCCGTCGCAC GTCGTCTACA ATCTGAAATC
CCCAATTCAC ACATTCTGGA TCAGTATTCG AATCCATCGA ATCCGAATGC ACACTACTAC
GGGACTGCGG AAGAAATTCT ACGCCAAACG GGTGGCAAAG TTGACATGTT GGTGGCTGGA
GCCGGTACAG GCGGGACGCT AACCGGTATT GCCAAGCGTT TAAAAGAACA CAATCCGGAT
ATTCAAATTA TTGGTGTCGA CCCAGAAGGT AGCATCTTGG CCATTCCGGA TTCTCTCAAT
GACAAACGTC GTTTGGAATC GTATCACGTC GAAGGCATTG GCTACGATTT CATTCCCAAC
GTTCTTGATC GCAGCGTTGT CGATCATTGG TACAAGTCCA ATGATGCCGA AAGTTTTGTT
GCAATGCGGC GCTTAATTCG AGAAGAAGGT TTACTCTGTG GGGGTAGTTG TGGTGCGGCC
GTCGCGGGCG CGCTCAAGGC TGCGCGAAGC TTGAAAGCTG GACAACGCTG CGTCATCATT
TTGCCCGACT CTGTCCGCAA TTACATGAGC AAAGGTCTCA ATGACGATTG GGTCCGTGAC
AATGGATTCG CGGATGGAAA AATTATCAAG GCCAAGTCAT ATTCTTCTTG GTGGGCCACG
AAAAGAGTTT GTGATTTAAA TCTCAGCATT CCTTTGACAA TCACCAGCGA TGTCAGTTGC
AAGGATGCTA TTTTGCTGTT AAAACGAGAG GGTTTCGATA TGGTGCCAGT CTTAGACGAC
GGGAATGTCG TGGGTGTTGT GACGGAGGGT AACATGACGA GCAAGTTGCT ATTAGGACGA
TGCGATCCCG ATACGTCGGT AGCGGATGCA GGTGTCATCT ACCACACGTT TCATAAGTTC
AGCATGAGCG ATACGTTGGA TGAGTTGGCT CAAGCTCTGG ACCACGATCC GTTTGCTTTG
ATAGTGACGG AGCAACGTTG TTTCTCGGTG GCCTCAAAGA AACGGAAACC GACTGTGAAT
GGAGATGGAA ATGTAGAAGC TTTGTCGGAG GAGAGTTCGA CAAAGGCAAA TCATTCCGAA
GTTGTGACTC GCAGCGTAGT CAGTGGCATC GTTTCCCGGA TAGACTTGCT GGACTTCATT
AGCTCAGACG CCAAGCATGA ACTTGAAAAA TAG
 
Protein sequence
MDRICDDILE AIGGTPLVRL NHVGADLPCE LLAKCEFFNA GGSVKDRIGR QMVLDAEKAG 
KIKPGDTLIE PTSGNTGIGL ALTAAVRGYR CIITMPEKMS KEKVDVLKAL GAEIIRTPTE
AAYDAPDSHI SVARRLQSEI PNSHILDQYS NPSNPNAHYY GTAEEILRQT GGKVDMLVAG
AGTGGTLTGI AKRLKEHNPD IQIIGVDPEG SILAIPDSLN DKRRLESYHV EGIGYDFIPN
VLDRSVVDHW YKSNDAESFV AMRRLIREEG LLCGGSCGAA VAGALKAARS LKAGQRCVII
LPDSVRNYMS KGLNDDWVRD NGFADGKIIK AKSYSSWWAT KRVCDLNLSI PLTITSDVSC
KDAILLLKRE GFDMVPVLDD GNVVGVVTEG NMTSKLLLGR CDPDTSVADA GVIYHTFHKF
SMSDTLDELA QALDHDPFAL IVTEQLSGIV SRIDLLDFIS SDAKHELEK