Gene PHATRDRAFT_42434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42434 
Symbol 
ID7196621 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp41016 
End bp42852 
Gene Length1837 bp 
Protein Length484 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002176508 
Protein GI219109507 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTCGT CCATGAGCAT GAACGAAACG GGAGCGTCTC CTTCTTCGGG AGGTGCTGTA 
CCTACCACCG GTGACTTTGT GGATCTCCAC GAACTCTCGA AAGGATGGAA AGCAGCAATT
ACCAGAGCCA AGGACCCTTC TACAAGGTAC GCCGTTCCCT TCCGCCTTGT GCAGATCATT
CCTCGAACCG ACAGTCGCGC ACTTGTGTAT ACACACTCAC ACTCGTATAC ACATACTCGT
TGATACACAT TCTCACACAC ACACACTCAC ACCCACATAC ATACATACAT ACGTAGGTAC
GAAGCGCGCG TCAAGAATGA TCGCGGAAAC TTGCCGTTGC ATTCGGCAGC GTCGTTCCGT
GCTCCACTGG AAGTAGCCGA AGCACTCCTC GCGGCCTACC CGGAAGCGGC CTCCATCACC
AACAACTACG GAAACCTGGC ACTGCACTTT ACGGCCTGGA AGAAAGGTCC TCTGGATGTG
GAACAGCTAC TCCTCAAAGT GTTCCCCCAA GGGGCGGCGC AAAAGAACAA CCACGGCAAN
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNT CCATGGAACA
GCGAGATGGC ATGACCGAAG AACGCGAAAT GGTGCTCGCC ATGTTGATGG ATATGAAGGA
AAATCATCCG CACGCCTTAT ACTCGGCGGG CATTGATCCC AAGGCCGTCA CCGACTTGGA
CTCCATGCTG GAACAGGTGC GCAAAGCTTC GGTTGAAGAA ACAATTCCTC CCGGGAGCAG
TGACGAGGAA ATCGATGCAC AGCTTATTGA AGAATCACTA TGTCCTCCGG ATGATCCGGT
CGAAGTGGCG TTGGCCGGTG TGATTGGATG TAGCGCGGTC AAAAACCAAA TCCGCGGACT
GCGACGTACC ATTGAAATAG CCGCAGCTAC TGGTGAAGTC TCCAAAATCC CGCGTCACTT
GGCCTTTATC GGAAACCCAG GAACCGGCAA GACCATGGTG GCCCGCAAAA TGGTCAACAT
CCTGCGTAAC GTGGGAGCCA TACAGAGCCT CAACTTTGTC GAAGTGGGCC GTGAAGATCT
CATTGACAAG AAGAGTGAAG CCCGGACCGT CTTCAAGACA CGAAAAGTAT TGGAACGTGC
CGCCGGAGGG GTCTTGTTTG TCGACGAAGC GTACACTTTG TTGCCCTCAA CGGCCCGTCC
GCGTGGACGC GATCACGGAG CGGCAGCTTT GCGGGAGATT GCCCGAGCAC TCCCCGGCGG
AAACCCACTG GTCATTCTGA CCGGTGCACC CTTGGATCTG CAGCGTGTAC TCTCTAGCGA
CATTGGTTTC AAGGGACACT TTTTGACCAG AATCGAGTTT CCGGATGCGA CCCCTCTACA
GATTGCACAC ATGTTCATGG CAAAACTGTC CGAGAAGGGA CTCATGCCGG CCCAAGGTGT
CACCCCACAG TACCTGGGTG AGCTGATCAA GTCGAATACG GAAGCCGAAT GGAGGCAGGA
GCGAAACGGT CGTATTGCCG ACCTGTTACT GCTGGGCGTC CGGGCCGAAG TCAAGAAACG
TGCCGTCTGG GACGATACCG CGTCCAAGGG ATCGTTGAGC CCCATGAAAA TTCTTAGCCC
GGGATCTTCT CGCATGCCTG CTTTTGCCCC CGAAGAAGTA TTTGTGAATG TGGAAGATAT
TCAGAATGCC ATTGTGAACG GCATGTAAAC GTTCAAATGC TAACCCGATC CTTGTGTTGT
GATTGAGCCG TCGGGCGTCG AGCTTAGACG TGAACGGTAC GATCCGGGTT GCAGACGTCG
TCTAGTTTAA CTACTACATA GCGAGCCGTT GCCGAAT
 
Protein sequence
MASSMSMNET GASPSSGGAV PTTGDFVDLH ELSKGWKAAI TRAKDPSTRY EARVKNDRGN 
LPLHSAASFR APLEVAEALL AAYPEAASIT NNYGNLALHF TAWKKGPLDV EQLLLKRDGM
TEEREMVLAM LMDMKENHPH ALYSAGIDPK AVTDLDSMLE QVRKASVEET IPPGSSDEEI
DAQLIEESLC PPDDPVEVAL AGVIGCSAVK NQIRGLRRTI EIAAATGEVS KIPRHLAFIG
NPGTGKTMVA RKMVNILRNV GAIQSLNFVE VGREDLIDKK SEARTVFKTR KVLERAAGGV
LFVDEAYTLL PSTARPRGRD HGAAALREIA RALPGGNPLV ILTGAPLDLQ RVLSSDIGFK
GHFLTRIEFP DATPLQIAHM FMAKLSEKGL MPAQGVTPQY LGELIKSNTE AEWRQERNGR
IADLLLLGVR AEVKKRAVWD DTASKGSLSP MKILSPGSSR MPAFAPEEVF VNVEDIQNAI
VNGM