Gene PHATRDRAFT_42116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42116 
Symbol 
ID7202200 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp848733 
End bp850008 
Gene Length1276 bp 
Protein Length417 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181275 
Protein GI219121859 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGGAA TCTTTGTACT CGGCTACATG GGCATTATTT TCGAAGAAGT CTTTGAATTT 
AATAAAGCCG GCGTCGCGCT CTTGATGAGC ACCGGATTAT GGGTGACCTA CGCGGACTTT
TACAACAGTG CCGGTACGGC GTCCACGGCT GTACTGGAGC AACTGGCGGA ACAACTCTCG
GAAGTATCCG ATATTTGCTT TTTCCTCCTG GCCGCTTCGA CAATTGTGGA AGTGGTGGAC
GCCCATCAAG GGTTCAAAGT TGTCACCAAC CAAATAAAGA CCACTTCCAA AAAGTCTCTG
TTTTGGACCA TTGGATTCCT GACCTTCTTT TTGTCGGCCA TTCTTAATAA CTTGACCATC
ACAATTGTCA TGGTCAGTCT ACTGCGCAAG CTCGTGCCCA ACGTAGATGA TCGTCGTTTA
TTCGGAGCCA TGGTTGTCGT GGCGGCCAAC GCTGGTGGTG TTTGGACGCC AATCGGGGAC
GTGACCACGA CCATGCTATG GATTAACAAT CAACTATCAA CGATTCCGAC CGTTCTCGAT
CTCTTTCTAC CGTCGCTAGC ATGCTTGGTA GCTTCCTTGG CCTTTTTGGT CAACAAGGTG
GAAGAAGACG ACTCTTTAAA GGCATCGACA CTACCGGAAC CGACCCCGTT GTCGCAACGT
GGGCAGTTGG TCTTCTACAG TGGAATTGCC GCTCTGTTAT CGGTGCCTGT CTTTAGCGAA
CTGACAGGAC TGCCACCGTA TCTGGCCATG TTAACGGGTC TTGGGGCCAT GTGGACCCTG
ACCGACATCA TTCACATGGG AGACAAAGAG GAAGGGCTCA AAGTGCCGGC GGCCTTGTCC
AAATTAGATA CATCCGGCAT TCTATTTTTC CTCGGAATTC TCATGAGTAT TGGCGCATTG
GACAAGAGCG GCTTGCTCAA AAGTCTAGCC GTCTTTCTGT CGGACAACTT GCCCAGTCTC
GATATTATTG CTACCGTTAT TGGTATCGCA TCGGCCTTGA TCGATAACGT TCCGTTGGTC
GCGGCAACCA TGGGTATGTA TGATCTATCC GAATATGGTA CGGACGATAA ACTCTGGCAG
TTGATCGCGT TGTGTGCTGG TACAGGGGGT TCCATTCTAG TAATTGGCTC CGCCAGTGGC
GTGGCCCTCA TGGGACTGGA GAAGGTGGAC TTTTTGTGGT ACGCCAAGAA TGTTTCGATC
GGAGCCGCGG TAGGGTACTT CGCCGGAATT GCAACATATT TGGCCCAGTA CGCAATCTTT
CACGGTGATC TGCTGA
 
Protein sequence
MIGIFVLGYM GIIFEEVFEF NKAGVALLMS TGLWVTYADF YNSAGTASTA VLEQLAEQLS 
EVSDICFFLL AASTIVEVVD AHQGFKVVTN QIKTTSKKSL FWTIGFLTFF LSAILNNLTI
TIVMVSLLRK LVPNVDDRRL FGAMVVVAAN AGGVWTPIGD VTTTMLWINN QLSTIPTVLD
LFLPSLACLV ASLAFLVNKV EEDDSLKAST LPEPTPLSQR GQLVFYSGIA ALLSVPVFSE
LTGLPPYLAM LTGLGAMWTL TDIIHMGDKE EGLKVPAALS KLDTSGILFF LGILMSIGAL
DKSGLLKSLA VFLSDNLPSL DIIATVIGIA SALIDNVPLV AATMGMYDLS EYGTDDKLWQ
LIALCAGTGG SILVIGSASG VALMGLEKVD FLWYAKNGTS PELQHIWPST QSFTVIC