Gene PHATRDRAFT_37111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37111 
Symbol 
ID7202247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp189043 
End bp190344 
Gene Length1302 bp 
Protein Length433 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181322 
Protein GI219121956 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0416192 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGGTGT CCCGAATGAG GGCCTCTTTG TTGCTTCTTG CACTACTGAC ATCATGCGAC 
TATGCTGAAT CCTTTCGACA TGCAGCACAA AATAAGATGA GGTTCAAACT GCCTTCCATA
AAGACGTATG AAAGAAACGG ATTATCGGCA AAATTTGTGG ATGGCACACA CAATTCCTCG
ACTGAAAGAG AGCATGCGTG CAACAAGGCT CTCAAAGTAG CACTGCTACA AAACCTGTCC
GTAGATCTTG CAAAACTATC GACGATTCGC CCTGTATCGC CAGCTGCTGA TTTCAGCGCA
CCGGCAGCTA TCATTTCCGC TGGATCCAGC TACACTCGCA TTTGGACACA CAGTACGTGG
GAAAGCCATT CTCGCCCTCC CCACGTGCGA TATACAAACC ATGTTATCCG ATGGGGAGCC
AGCTCTACCG CGCGCAAAAT TCTCCCCACG GTTCTGCTCG CTGCAGCCTG GGCTGCTTTG
GTCGCGAGGC TGGCGCGATC GAATTTTTGG GTCTTAAGGT TCTTGACGGC GACGGAACCG
TCCAAGGCCT TCGGATTTCT AGCAGCTCCG CTCGCATTGT TACTAACGCT TCGTGCGAAC
GCCAGCATGC AAAGACTTTT GGAAGCTAGA TTATTATGGG GTCGCTTAAT CCTCCACACT
CGATCGTTGG CCAGTGTTAT CAGGGTTTAC CTTTACCCTG CTTGCCCACA AGCGTCGACA
TTAGCCATTC GACATATAGC CATGATGGGA TGGATCCTGA AGGCTACATT GCGTGGAGAA
AGTTCCGAGT CGCAACAGGC TGTGTTACGG GTCATGCTCC CTGACGAACG GGATTTCCAA
TGGCTTGCTT CGCATCCCAA AACGAGCGTC GCGGTGACAT ACAGATTACG ACAAATCTGT
TCGCACATGT TAGAATCTTT GATCGATCGA TCTTCCTCTT CGGCAATAAA GTTTGTGATT
GAAGATAAAA TCGGATCGTT GGAGGAGGTC GTTGGCGGGT GCGAACGGCT ATTTGGGAGT
CCGATTCCAC CAACCTACAG TCGACACTTG AGTCGCGTTG TAGTTATGTG GGTTTTGCTC
CTACCGATGT CTTTGCTCTC ATCTCCGGGG CTTTCCACAC TCGGAATTTC CATAGCGACC
GCCGTCGGAA CCTATGTTCT GGTAGGCATT GACGAAGTTG GCATGGAAAT CGAGAATGTC
TTCCAGATGC TACCCCTACA GCAATTGGCG GGTGCGGTAC AAAACGATGT GCGCGACCAA
TTTATTCCCA AGCAGGGGGA AATGCCAAGG GTTATTTTGT AG
 
Protein sequence
MPVSRMRASL LLLALLTSCD YAESFRHAAQ NKMRFKLPSI KTYERNGLSA KFVDGTHNSS 
TEREHACNKA LKVALLQNLS VDLAKLSTIR PVSPAADFSA PAAIISAGSS YTRIWTHSTW
ESHSRPPHVR YTNHVIRWGA SSTARKILPT VLLAAAWAAL VARLARSNFW VLRFLTATEP
SKAFGFLAAP LALLLTLRAN ASMQRLLEAR LLWGRLILHT RSLASVIRVY LYPACPQAST
LAIRHIAMMG WILKATLRGE SSESQQAVLR VMLPDERDFQ WLASHPKTSV AVTYRLRQIC
SHMLESLIDR SSSSAIKFVI EDKIGSLEEV VGGCERLFGS PIPPTYSRHL SRVVVMWVLL
LPMSLLSSPG LSTLGISIAT AVGTYVLVGI DEVGMEIENV FQMLPLQQLA GAVQNDVRDQ
FIPKQGEMPR VIL