Gene PHATRDRAFT_42820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42820 
Symbol 
ID7196481 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp1224325 
End bp1225859 
Gene Length1535 bp 
Protein Length419 aa 
Translation table 
GC content47% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177244 
Protein GI219110985 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.749474 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GCTAAGGTGT GTGACGATTA GCACAATATG GAGCACGATA CGGGAGGTTC TTTTCTTTCC 
CCAGCGGAGC GAACGATGCT TCTAAAAGAA ATCGAAGTTT TCGAGAAGCT GGAAGCATCA
CTCAAATCAA CGAAACGCGG GAATCTTCGC AAGCCACGTT CTGTACACGC AGGGTCAAAT
AAGCCCAGGA CGAGAGATTC GGGTTTTCGA GAAGCTCCGC GCAGCACTGG TTGTCTCGTT
TGTGGTATCG ATAGGGATCA CACGAATATT CTTTTGTGTG AAGGCTGCAA TGGTGAATAT
CATACATACT GTCTTTTGCC GCCCTTGAAA TCAATTCCGC AAGATGACTG GTTTTGTGGT
GAGCTCGACG TACTAAATAA TGTTCATTCA GCTCATCGTT TTGTCCATCA AGAAAGCTCA
CTCGAATCTC ATTGATTTCT ACAGACAACT GTCTTCCAGA TGACGGAGAT GGGCTCGAGC
AGCTCGTAAG CTCATTGCCG CCGAATTTTA CTACCAGATT CGCTGAGATA TGCTGGGCTC
AAGGAGGCAA TGGCTACGGC TGGTGGCCTT GCTGCATCTA CGACCCAAGA TTAACTGTTG
GAGATGCTCG AGTCCTCGCT CGCAAAAACT TGGGCAAGAA GCACTTGGTA TATTTCTTTC
AATGCGAGGA GGCCCCTTTT GCGGTGCTTC CTACGACAAA AATTCAAGGC TGGACGGAAG
GCTTGGTTGA TAGCTTTTAT ATGGGAAAGG CGGCGAAGGC TGCAGGGAAA TGCCGTTATA
TCCAATTTCG AAAGGCTTTC CAAGCCGCAA TAATTGAAGA GAGCAAACCT CTGACACAAA
GATTAGAATG GAACCAGCTT GGATTTCCGC CCCAAGCTTC GTTAGCAGCA AGACCGGCAT
CGCCGCAGAA GACTCCGGTG AACGCGAAAA AGCGTCCAAC GGACTGTCTC ATTGAAGGTA
GCAGAACTAA GCGTGCAAAA AGTAGTGTCA GTCTCGACTT GGCACGAAAT GATATAATGG
AGAAGCGGGA GCCTGCACGT TACGTCGAAA CTTCGGAAAG CGGTACTGAA ATGTTTTGTA
AAGTTAAACG AAAGTTATTG GGGGCCGTAG ATAAAGTGGA GATTGGATTT GTGTTGTTAC
CGTGTCGATT CACTTCGACC TTCGCGGACG TGCGGAGAGC CATTTCTATT GATCTTGACG
AAGAATTGCC TTCAAATTGG GATTGGAGAT TTTACGTGCC ACCCCTTGGA CCGCTGAGCA
TAAAACAAGA ATCCAGATTT GGTGCAATGC TATCTTTCTT GCGGAAAGCA GCACCCCATT
CGGATATCGG AGAAGGGAGC CTACAAAAAC CCGCACAAGT CGTGCTGGTC GATGCGCCTC
AAAACTAATT TGCAACAAAC ATCGATTGTC GGCTTGAAGT AAATTGCTGA ACCGAAAGCT
CCCTTACTCA CGCTAATTTT CGAGTGGAGA AACACTAAAG GCGACCGTCA TGGAGTGGAC
TGTTCGTAAG GTTAAAAAGT GCCATTTGAC TTTTA
 
Protein sequence
MEHDTGGSFL SPAERTMLLK EIEVFEKLEA SLKSTKRGNL RKPRSVHAGS NKPRTRDSGF 
REAPRSTGCL VCGIDRDHTN ILLCEGCNGE YHTYCLLPPL KSIPQDDWFC DDGDGLEQLV
SSLPPNFTTR FAEICWAQGG NGYGWWPCCI YDPRLTVGDA RVLARKNLGK KHLVYFFQCE
EAPFAVLPTT KIQGWTEGLV DSFYMGKAAK AAGKCRYIQF RKAFQAAIIE ESKPLTQRLE
WNQLGFPPQA SLAARPASPQ KTPVNAKKRP TDCLIEGSRT KRAKSSVSLD LARNDIMEKR
EPARYVETSE SGTEMFCKVK RKLLGAVDKV EIGFVLLPCR FTSTFADVRR AISIDLDEEL
PSNWDWRFYV PPLGPLSIKQ ESRFGAMLSF LRKAAPHSDI GEGSLQKPAQ VVLVDAPQN