Gene PHATRDRAFT_42569 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_42569 
Symbol 
ID7195952 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011669 
Strand
Start bp459027 
End bp461095 
Gene Length2069 bp 
Protein Length427 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177091 
Protein GI219110679 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.191925 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CATCAAGCCA GGCCCGACAC GACGAATGTG ACGGAAAATA ACAAAAACAT AAGGCCAAAG 
GGTTTCAAAG CGTTGCTTCA CCCTCGACTC CTGCCGACCG TAATCTAGCA TATCTTTGCA
GTAAAGTGCA TGCAATCTAC AAGCCTCCTT AGATGTCAGA CCTTGTCCAA GAAAATATCA
ACTCGTCCTT CACTCTCTGA ACGACGTGTC CTGCTGGCGT ATTTCCTTTT TCTGCTTGTG
CTGCTGGCGC TTTTCGTGCG ATCTGATGTC GAACGGAAAA CCTAAAACGC CAACCGCGGC
ATTGAATGAT CAAGCTGCTG AAGAAAGTGA GGGACCTTCT ACAAATTCAG AAAACGAGAC
ATTGACTGCG TTTGATGTGC TGGGAAACGG GGCGACTAAC AGTGAAAGTG ACGGGCGGAA
CGACAGATCT GCCGAAAGTA AAGCTTTGAA AACGGAACAA AAGTGGGAAG AACATTTTAA
AAAACTGGTG GCTTTCAAGA AACGGTTCGG ACACTGCCTT GTCCCCAATC GTTACAACGA
CGACTTTCAT CTTGGTAGTT GGGGTATGTG TGTATGGTGA TTGTGATTCA GACTTTTCGT
CTTGGAACTT ATTTATCTCG CGTTTTTTCC AATGATATTC AGTTTCCACA CAGCGCCGTT
ATTATAAGAT CTTATTATCT GGAACCAGTG CATCTACGCC AATGACTGCG GAAAGAGCTA
AAAGGCTGAC GCAGTTGGGA TTTTCTTGGG CAACAAAGGA TCCACGCCAT GTACCGTGGG
ACGATCGGTA CCAGGAACTC GTGGTTTTTG TCGTAAGTAA TTATGAGTAT TTCTTACGCG
GTGACGTTGT GCATGGCATA CATTAAAAAC TTCCCCGTCT ACTCCTTATG TAGAGAGAAT
ACGGCCATAC ACAGGTTCCC ATTGGTTGGC AAAAGAACAC AAAGCTGGCA AACTGGGTTT
CCACACAGGT AAGCAGGTTG TGACATTGCC TTGCCTAGTC ACACGTCGAT ACGCATTCTT
CAAACACAAT TGAGATCTCG CGTTTCACAG AGACAAGAGT TCAAGCTGTT GCACAAAGGA
CGTTCCTCAA GATTGACCCA AGATCGCATT GACAAACTGA ATGCAATAGA CTTTGTCTGG
GAAGCTCAGC GAGGGGGTCC GCGCCGAGGT CCAAAGGCTT GTACGGTCGG GAAGGTATCA
GAAAAAGCAA ATCCGGTTCC GGGTGTGGGG CCACGTTCAA ATGCGTTAAT TAGTCTCTCT
ACAAAATTAG AAGCAGAATC GAGAGGGTGC ACTACACAGA CAATGAAGTT GTGTGATGTT
GGGGTAGGTA TTCAACAGGT ACCATATCTG GGTCGCGGTG AATCCAGGCC GGCTCCAGCG
CAGCCGGTAG TGCTCGGAAC GTTGTCTGTT GCCCAGTTGT TGGAGCTGCA ACAAGCGGTA
GAAGTAGCAC AAAGACCGTG GCAGGTTCTA GCACATGGAG GATTTCACCA AGGGCAGCTC
CCACAACCGC TGCTGTCTTC GCAAACTGTC AGTGCGCAGT TTGGATTCCA GCCAATTCCG
AAACCCTACG AGAGTCAACT CATGCCTAAT GAACCACGGA ATTTGCAGCT ACCTCGTACA
TCCAGCAGCA ACTTTCCGAT TTCTGCTTTG ACGCAACTGC ATCAGTCATC CAGCGGAATG
AGTTTACCAC CTAACATTGT CAACACGAGC CAGGTGGCGG ACCAAGTTTT GATTCAGAGA
CTTTTGCTCG ATCAGCAGCA GTCAGAGTAC TCCTACCATT CCGATCAATG AAACCGTGCA
AACGCTGCTT CGGGACTTGC AATCGCTGAC GCGTTCGTCG GCGACGAATG GTGCGCCAGG
AAACTAGTGC CAATCCAATT CTTTGTGTAG TTCGCCATTT TCTGGTCTTG CTCGAAAGAG
GGAGTCTGTG TCACCAAAAA TCCTATTCCA GCATCAACTT TCAATTACAG TTAACTGTAA
GACCTTCCCG AAAAAGGATA CGGTCCGTGA AGGGGCCAGG TCCTCGTAGA GGTGACATAA
TACAACCTGA CGAAGCATTC TGTAGCCTC
 
Protein sequence
MSNGKPKTPT AALNDQAAEE SEGPSTNSEN ETLTAFDVLG NGATNSESDG RNDRSAESKA 
LKTEQKWEEH FKKLVAFKKR FGHCLVPNRY NDDFHLGSWV STQRRYYKIL LSGTSASTPM
TAERAKRLTQ LGFSWATKDP RHVPWDDRYQ ELVVFVREYG HTQVPIGWQK NTKLANWVST
QRQEFKLLHK GRSSRLTQDR IDKLNAIDFV WEAQRGGPRR GPKACTVGKV SEKANPVPGV
GPRSNALISL STKLEAESRG CTTQTMKLCD VGVGIQQVPY LGRGESRPAP AQPVVLGTLS
VAQLLELQQA VEVAQRPWQV LAHGGFHQGQ LPQPLLSSQT VSAQFGFQPI PKPYESQLMP
NEPRNLQLPR TSSSNFPISA LTQLHQSSSG MSLPPNIVNT SQVADQVLIQ RLLLDQQQSE
YSYHSDQ