Gene PHATRDRAFT_37466 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37466 
Symbol 
ID7202374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp208934 
End bp210127 
Gene Length1194 bp 
Protein Length397 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181680 
Protein GI219122703 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATCCCT TGCACTGCCG CTCCATCGGA ATGTTCAGTC TTATCTTGTG GAGTTTGGCG 
TTCGGTACAA CGGTAGCATT GTCGTCGCTT TCTGGGCCAA CAAATAGTCC AACCGCGAGC
AAGAAACGTG TGCACATCGT GACGGGTGCC AGCGGATACG TAGGCCGAGC CATTGTGCAT
CATATTTGCG AAAACGCTTC AATATCGCTT ATTCAATCCG AGGTACATCA TTGTCAAGAC
GTTTTGTGTT TGGTACGACC AAATCGAGTG GCGACCGAGC AAGCGTACTG GAACATACTT
TTGCAAGATA TCGCATCGCC CGTGTCCGTC CGTGTCCTTC CCTACGATAT GTTGGATGGT
GGAGCAAGTC TTAAGGACGC ACTCGCATCT GTGGTGGTGG AACAAGATCA CGCCGAGACG
TGTGTCTATC ACGTGGCTTC CGTGTTCGGT CCAACCGAAG ATCACCAACA AACGGCACTA
GACAATGTAA AGGGAACGGA AGACTTGGTG CGTACCTTGG TAGATTCTGG CATGACTTGC
CGGCTCATCA TGACTTCGTC TATGGCGGCC GTTCGAGGCT CTGGACAAAG GCCACGAAAC
GGAAAGTATT ATACCGAACA AGACTGGAAC ACAATTAGCC TGTTGGGTGC CAACTGGGGC
GCCAGTTATC AATGGTCCAA AGCGGAATCG GAACGCAAAG CCTGGGAGAT CTGCCGACAC
CACAACATTC CAATGGTGGC ACTTTGTCCT TCTTTCGTCT TTGGACCTCC TCGGGATTCG
ATTAATAGTA ATTCATATTC TATCACTTTG GTTGGTCAAT GGGCGAGAGG GGAATCTCAA
GTGCAAAGCC GTCTTTTTGT TGATGTACGA GACGTCGCTG CAGCACATGT GGCCGCCGCC
ATCGAGCTGG AGGCTGCTGG CCAACGGTAC ATCGTTTCTT TGGAAACGCG AGCTCCTAGT
CAAGACATTG CGACGTGGTT GCGAGAGGTA TGCCAAACTA CCGGACTGTC TGATCCGGAA
AAGGTTCATT TTGACGGAGA ATTTGACGGT GGCGCAATCC CTATCGGAAG CAAAGAAGTG
GACGCAATCG ACCGGCTACG AAGGGAACTG AGAGTTACAT TGCGTCCTAT CAAAGACACG
ATAAGGGACA TGGCTGGAAA CTTACTCAAA GAAACCGCGC AAAATGACTG CTAA
 
Protein sequence
MNPLHCRSIG MFSLILWSLA FGTTVALSSL SGPTNSPTAS KKRVHIVTGA SGYVGRAIVH 
HICENASISL IQSEVHHCQD VLCLVRPNRV ATEQAYWNIL LQDIASPVSV RVLPYDMLDG
GASLKDALAS VVVEQDHAET CVYHVASVFG PTEDHQQTAL DNVKGTEDLV RTLVDSGMTC
RLIMTSSMAA VRGSGQRPRN GKYYTEQDWN TISLLGANWG ASYQWSKAES ERKAWEICRH
HNIPMVALCP SFVFGPPRDS INSNSYSITL VGQWARGESQ VQSRLFVDVR DVAAAHVAAA
IELEAAGQRY IVSLETRAPS QDIATWLREV CQTTGLSDPE KVHFDGEFDG GAIPIGSKEV
DAIDRLRREL RVTLRPIKDT IRDMAGNLLK ETAQNDC