Gene PHATRDRAFT_43163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43163 
Symbol 
ID7196916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp2209625 
End bp2211274 
Gene Length1650 bp 
Protein Length549 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176932 
Protein GI219110361 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAAA TAACGAAGAA ACGCCGGCGG GAACGTAAGG ACGTGCTCGC GAATGGATTG 
TTCAATATTC CCGGTCTCAC CGGTGAGGCG GACAATTCTT TGGAAACCTG TTTGGCCGAG
TTTGGGACGG CAACGGCAGT ACCACTTTTC AAGACTGAGA AAACTTCTCA CGAGCCATTG
GAGACACATT CACACCAAAC ACGAAACATC AACGTCACAT CAACCGAAAT CATCAAGCCA
AAAGATCAAT CATTCTTTGC TGCCTGTCCC TCGCATACCA AGGATTTGTC GGACAGGGTT
TCAAGATGGA ACCGAGAGTT TCGGACATTT GTCGATCAAG GGACTTATAC GCCAGGATTA
CTTCCATCCT TTGATGTTGA AGTGGCCCGG CATTTCCAGG TCAAAGATTT ATCGGTTTAT
TTGTTGGAGT CATGTCCAGG AATCAAGATT CCTGCTTTCG AACGCTGGTT AATTGATTCC
AAAATTGAAG AGCGCCAGCG GATAGCTTTG AACCAAGAAG AACTCTCGCA GCATAACGAC
TTGATCCTCA GTCATACAAC GCTAGAATAT GGCGCTTCGC AGCGCTTAGT TGCAGAGTTA
AGCGACGCAG GAGTTTCCCA AGATAGGGCA ATCAAGGCGG TTAAAGAGCT CTGTCGACGC
ACCCAAGCTG CGATTCCTGA ACTTGCATCC CAAACCCGAC GCTTCGCGCT GCGCACGCCA
TTACGCAAAG GCGACCGAAT TGATGTGGTA AAGGACAGTC GCGTTTTCTC GCTAGTCTTC
CATCGCAAGA GTTGGAAGAA ACCCTTTCGG GTCAAAATCA ATGTCTCGCA TTACCACAAG
TTGAAAACAG CTTTTCTACG TGTCCACAAT TCGGATCACC AACTGAAACC TATTCTGTTG
TATGATCATG GAAAACCGAC AAAAGCGATT CATTCGTTTC ATTTGATTAT TATGTCGCTA
CTCTTGCGCT ACTCTGCTCT TTCGGGTGGG CAGCTTTTGG TGGACCTCCG GGGAGGGGGC
ATGCAAGGAG CTGTGCACGA CGAAGTCTTC GAAGCGTTGC AGACTTGCTT TCCAAACGAA
TCGTTTCTCG AATGCTTCGC ATCACCGTTA AATTGCTATG CCGCAAATTT CGGCTCAGCC
TTTACCGACA TCGATTTTCA TTTTGGATCG GTTGGCGACT TTTTAGACCA ATCAATCTCA
CACGGCGTCT GTGAAGCGAA TCCACCGTTC TCGCCTGGTC TCATGGATAC CATGGTAGAT
CGAATAGAAT ACAATCTGAC GTTGGCCGAT CAGACGTCTT CCTGTCTGAC GTTTGTTGTT
ATTATTCCGA CAGCCTCTAC CTCGGAAGAT GTCCGTACCG CTAAACGCTT CGCGACCAAG
TCTTTTCAAC GCATGCTTGG AAGTGCTGCT TGTCGACTTC ATATTTCCTT GGCAGCGCGG
GACCACGGCT ATATTGAAGG TGCGCAACAT TTGCGACCAA CGAGGTACAA GGAAAGCAAT
TTTGATACAA GTGTGATCCT ACTACAAAGT TCAGCGGCCA GAAAAGAAAA CATCGATGAA
AATAATCTGG AAAAGCGACT ACGTTCCGCC TTTACAAGTC GTCACAAAGC TGAGGTTGAC
ACACGCAAGG AACAGGAATT ATCGGAATAA
 
Protein sequence
MPKITKKRRR ERKDVLANGL FNIPGLTGEA DNSLETCLAE FGTATAVPLF KTEKTSHEPL 
ETHSHQTRNI NVTSTEIIKP KDQSFFAACP SHTKDLSDRV SRWNREFRTF VDQGTYTPGL
LPSFDVEVAR HFQVKDLSVY LLESCPGIKI PAFERWLIDS KIEERQRIAL NQEELSQHND
LILSHTTLEY GASQRLVAEL SDAGVSQDRA IKAVKELCRR TQAAIPELAS QTRRFALRTP
LRKGDRIDVV KDSRVFSLVF HRKSWKKPFR VKINVSHYHK LKTAFLRVHN SDHQLKPILL
YDHGKPTKAI HSFHLIIMSL LLRYSALSGG QLLVDLRGGG MQGAVHDEVF EALQTCFPNE
SFLECFASPL NCYAANFGSA FTDIDFHFGS VGDFLDQSIS HGVCEANPPF SPGLMDTMVD
RIEYNLTLAD QTSSCLTFVV IIPTASTSED VRTAKRFATK SFQRMLGSAA CRLHISLAAR
DHGYIEGAQH LRPTRYKESN FDTSVILLQS SAARKENIDE NNLEKRLRSA FTSRHKAEVD
TRKEQELSE