Gene PHATRDRAFT_22453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_22453 
Symbol 
ID7203628 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011685 
Strand
Start bp525396 
End bp528716 
Gene Length3321 bp 
Protein Length622 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002182977 
Protein GI219125414 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGCAG CCCTGGCGTC GGATCGTCTC GGAGCGGATT CAATTATACT TGCGGCTCTA 
ACGGCGCTCA TGGCTGCTCA GGTTATCACG GTGAAAGAGG GCTTTCAAGG TTTCGCCAAC
GAAGGAGTAC TAACTGTCCT TGCTCTGTTC GTCGTTGCTG AGGGGATCAC CAAGACCGGA
GCGCTCGATT GGTACATTGG AAAGGTTTTA GGAAGCCCCA CCAATGCAAG TTCAGCACAG
CTCCGTATCC TCGCACCAGT AACGTTTGCA TCTGCCTTCC TGAACAATAC TCCTATTGTT
GTCGTGATGA TCCCAATCAT CCAAAAATGG GCCAGGAACA TACATATTCC GGTTCAACAA
CTCATGATTC CGCTCTCGTT TGCGTCGATT TTGGGGGGAA CTTGTACACT GATTGGAACG
AGTACCAACT TAGTCGTGGT GGGTCTTTTG GAAGAGCGCT ATCCCAACGA TCCAGATGTC
AGCATCGGCC TCTTCGACAT TGGACTTTAC GGCGTCCCAG TCGCTCTCGT GGGGATGGCA
TATATTCTTT TTGCTTCCCC TTATTTGCTG CCTGGTGGAG GGGGGCAGTC TCAAGATGCC
CTAAATTCTT TGGAGAAAAA TGAAGAGATT TTGCTTGGAG CCCGGCTGAC GTCGTGGTCC
CCTGCAGTTT CGAGGACTGT CAAGCGTAGT GGTCTGCGTG ACACTGGAGG CATCTACCTG
GTTTCTGTAG TACGGGCAGC AACGGGCCAC GTTCACAGAG CCGTTTCGCA TGATTTCGTA
CTGAATGTCG GGGACGTCCT TTACTTTACT GGACTTGTTG AGAGTTTCGG AGAGTTCTGT
GAAGAACACG AACTTGAGCT ATTAACAGTC GACCATGATT TTCACGAAAC GAGTATCGAT
GGCACAGTTC TTACACTTCC AGAAAAAAAA CAAGTGAACT GGGAGTCATC TGACGATCAA
GATTGTGATC TCACAAAGAG ATCTGTGGTC ACTAGATTTT CTGAAGAGGG TGAAAGCCAC
CGTCTGATTA ATCGAATGAC AGACATAATC CGAGGTGCAG AGTCTGGGGA AGAAGGGACT
TATGCTATCC ACGAATCTCG ATTTATTGAT GACCCTGCGA AAATTGTGGT CACAATCGAC
AAGGCCCTGG TGGTGGTTGG AATTGACGTC CAAGATCGAC CCGGGCTCAT GGTGGACATT
TCAAAGGGAC TTTTGCGCCT AAATCTGGAG TTGCACCATA CCGAGGCCGC CGTTATTAAA
GGGCGTTCCC TTTCCATTTG GCGCTGTGAA GTGATCGGAC CACAAATACA CGACCAAGAA
CAAATTTGGT CGGGTATCGA GTCGATTCTT GCCACTTGGT CCGGTATCAG TGCCCTCAAA
CACAGAGGTC TCCGTGTGAT TCGAACGCGA GTTGTTAAAG GATCTAGATT GGTCGGTCAC
ACAGGAGCAG AAGTCGACTT TCGGCAAACG TACGAGGCTG GTATTGTCGC TCTGCAAAAA
AAGGGGAAAA ACGCAGAAAT TCCTCTGTCA AGAGTCCGGT TTGAAGTTGG CGATATTCTT
GTTTTGCAAG CAAATGATGC CTCGCCACTG TTGAAGATTC CTCCCGCGGA ATTCTACATG
TATCTTTCGG AAGGCTCAGG AACGGATGAG AGCATACCCA GAACTTCTTC TGTAAGAAGC
ATGGTAAATA TGGTGACGCG GCGAAAGGCT AGCACTGATG TTACCCCGGA ACTCGCGAGC
GCTAGGAAAG TAATCGACAA TTCCCACCTC GCACATCATG GTGACGAGGA AGATCCTGCC
GTTGTTGATC TGCCGGAATT AGTGCAACAA ATTGAAGAAC AGGAGGCGGT TTGGAAAGAT
CTGCAGCTCC TCTTGCCTGA CGAAGGGATA CATAGCGGTG ATGGAGCAGC TCGCGAGTTT
CTTACCGCGA TGCAAGTTGC CCCAAAATCC AAGTTGTCGG GGAAAACCGT TGCAAAAAGT
GGCATCGACA GGCTTCCAGC ACTTTTCTTG GTAAGTATCG AACGCCCCAT CCCTGCAGGG
ACCTCTTTGC CAACGAAGAA CAAAAGACTA TCAGTGATGT CTGGCGCATC CGATGCGCAT
TCTCTGGGAG AGGACAGCAA TCAGCGCCTT GGCTCGATTC AAACAGACAA TCAGGCATAC
CAATCCATTG CTCCAGAGGA GCCGCTTCAG CACGGAGATG TTCTATGGTT CTCCGGCTCT
GCATCGTCCG TTGGCGATCT GCGCAAGATT CCAGGGTTGA TCTCGTATCA AAACGATGAG
GTGGAGAAAA TCAACGAGAA GGTCCATGAT AGACGTCTGG TTCAGGCTGT CATTGCCAGA
AAAGGACCAT TGGTCGGGAA GACTGTGAAG GAGGTCCAGT TCCGGAAGCG GTATGGAGCC
GCGGTGATTG CTGTACATCG CGAAGGCAAG CGTGTGCACG AGCATCCGGG GAACGTGAAG
TTGCAAGCAG GTGATGTGCT GTTACTGGAG GCGGGTCCTT CGTTCATCGC CAAGAGTGGT
GAGAACGACA GATCGTTTGC TCTGCTAGCT GAAGTGGAGG ACTCGGCCCC TCCTCGTTTG
AGTCTTTTGA TTCCTGCGTT GTTGATCACG GCAGGGATGC TGGTTGTATT TATGGCTGAC
TGGACGTCGC TATTGGTTTC TGCACTAGTG GCTTCAATGT TGATGGTAGC TCTTGGTATT
TTGTCAGAAC AGGAGGCTCG GGATGCGGTG AATTGGGAAG TATTTATAAC TGTTGCTGCA
GCCTTCGGCA TTGGTACAGC TCTTGTCAAC TCAGGGGTGG CAGGAGGGAT TGCTAACTTT
TTGGTGGACG TAGGTACTGC ATTAGGTATC GGGGAGGCAG GGTTGCTTGG AGCCGTGTAC
TTCGCAACCT TTCTTATTTC AAATGTGGTC ACGAACAATG CAGCGGCGGC TCTGTTGTTC
CCTGTCGCGT TGAATGCAGC GGAGCAGACA GGCACTGATC GTGTTTTGAT GAGTTATGCG
TTGATGTTGG GCGCGTCAGC CAGCTTTATG TCACCTTTCG GCTACACAAC GAATTTGCTG
ATCTACGGTC CTGGAGGATA CAAGTACAAA GACTTCCTTT TGTTTGGAAC CCCAATGCAG
ATCGTGTTGT GGGTAGCGTC AATTGCCTTC TTGGCGATCA TTGAGCCTTG GTACATTAGT
TGGATCGCTG CAGCGGCCAT TCTTGGGATT ATCATTGCCC TCCGGTTATT CTGTCTTTCG
CGCACAGCCC TCAGAGCTGG GGAGGAAAAA TAAGCTCACT ACTCATGCTA CATAATTAAG
ATAGGTTTTG TTTTCTTTGC T
 
Protein sequence
MFAALASDRL GADSIILAAL TALMAAQVIT VKEGFQGFAN EGVLTVLALF VVAEGITKTG 
ALDWYIGKVL GSPTNASSAQ LRILAPVTFA SAFLNNTPIV VVMIPIIQKW ARNIHIPVQQ
LMIPLSFASI LGGTCTLIGT STNLVVVGLL EERYPNDPDV SIGLFDIGLY GVPVALVGMA
YILFASPYLL PGGGGQSQDA LNSLEKNEEI LLGARLTSWS PAVSRTVKRS GLRDTGGIYL
VSVVRAATGH AYQSIAPEEP LQHGDVLWFS GSASSVGDLR KIPGLISYQN DEVEKINEKV
HDRRLVQAVI ARKGPLVGKT VKEVQFRKRY GAAVIAVHRE GKRVHEHPGN VKLQAGDVLL
LEAGPSFIAK SGENDRSFAL LAEVEDSAPP RLSLLIPALL ITAGMLVVFM ADWTSLLVSA
LVASMLMVAL GILSEQEARD AVNWEVFITV AAAFGIGTAL VNSGVAGGIA NFLVDVGTAL
GIGEAGLLGA VYFATFLISN VVTNNAAAAL LFPVALNAAE QTGTDRVLMS YALMLGASAS
FMSPFGYTTN LLIYGPGGYK YKDFLLFGTP MQIVLWVASI AFLAIIEPWY ISWIAAAAIL
GIIIALRLFC LSRTALRAGE EK