Gene PHATRDRAFT_43489 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_43489 
Symbol 
ID7197185 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011670 
Strand
Start bp595318 
End bp599067 
Gene Length3750 bp 
Protein Length425 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002177648 
Protein GI219111793 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0573784 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTCTCAAAAA AGTTCTCTAT CAAGGAGAAA ATTGGGGTAA TGAGTTCTGG GCACGCAGTG 
TGACTCTTCT CTATCACACA CGCATCGCCT TTTATAGTCA AGCCGAGATT TCTTCTCTTC
CCAGCCAGCG CTCCTGGCAA TGTTGCTCCG TCGAAAGCAA CCATGTTGGG TCCGGTCTCA
GATCTTTGTA TAGTAGAAAA TGATCCCTAC CCTTGGTAGC TCCCGGATGT AGATTCAATG
AGGAGGTAAA GTCCAAAAAG ACATCCCGAT CGGGTGCAAT TTGGTAGCCC GCTTTGTGGT
ATAATTGCAA GGCACTGTGA TTCATCGTGT CCACGTGCAG ATAGAGCGTT TCGACGTGAC
GATCCCTCGC CAAACGGTCC ATGGCGTCCA ATAAACGCAA TCCAATGCCT TGCCGTCGCG
CGGACGGGTG GACCGCCACT TCCGTCAAGT ACAGTAGTCC GTACCGTGGC CGGCGGGAGC
CTAACTGTGT ATTGTAGAAT TCGTGAAAGC TACATTCGGC ACTACCCAGA ATAACGCGCG
ACGTGGTGCT TGGTTCGGTA TGCGCGGTCG CTTTGGTCGC AACGATGCAG GTGGCGCCCC
GCATCCGCCG TCGGACCATG GCTTGACAGG ACCGGGAGCA GAACTGGGCC TGTAAGTCGG
CCGAAAAGTC GGAAAAGACG GATAGACGCA AGTTGGCAAT ATCCACGTCG TCCAACGGAG
TAGCCAACTG CACCGAGACG GCTCCATCCT GCGCAAGGAT GATGGTGCGG GGAGATTCGA
TTGGTTCGCC TAAAAGTCCC ATCTCGGAGT GAGTGAGCGA AGAGAACGCG GTAGCGAGTC
GATCGTGCTG CGTACTGAAT GAGGTGGTAT CCAATTTCCT GTGCGGGGAT CACTGTTTCT
ACGACTTCCC AATAAGTCAT CAATAACAGA GTGAATAATA GATTTTGTGG TGGAGGCTGC
CGTGTTGGCT AACGCTGATG ACCGGAGCGA TGCTTTGGTG GGCGGGTCCG TATGATCCTT
TCGGGGAGCA GCTTTGCGCA AAATGTCCAA AACCTTGTGG GTGGCTCGTT CTGCTTGCCG
GGCATGGGTG GCACTGCCAA TCCGACGAAC TCCCACACTG GTGGGTTTTC CTGTCGCATT
AGCGTTCCTT CGTTGTGGCC TCGTACCAGC GGCACGGTTG GCCAACAGTG CGGCTCGTAT
CCTGGCAGCC TTGCGTTGAT CGTCGGGAGC CGCTTCGGTC CGTGTGAGCG AGTCGTCGCC
GCGTCCCGCG GGTTGGGCGG ATCCGGATTC TGGTGGTCGG TCCTCCTCCT CCTCGCGAGA
CGCAGAATTC TTGGGAACGT CGTCGACATC ACTTTTCGAG GCGGACGGGG TGCATGATCG
TGCGTCGTCT GGTCGTATCG GATGATCTCG TGATTCCGAC CAAGATCGTT TGGTGGTGAC
GGTGGGTACT GGTGTACCAG TGGAGACGCA AGACGGTGTG GCAGGTACCA TGGTAAGGCG
CGCAAAGGGA GAAGAGAACG CCCGGTCCCG AGATGGACCA TCAACGAAAA CAGACGATTG
GTCAGCGCCG GAGACGGGTT GGGTAGAGGG ATTGGTTCGA AAGGGCATTC CTGGAGAGGA
ATGGAACGCC CCACAGACCT TCCCGGAATG TATCGACACG AGACAGCAGA GTAGAAGAAC
AGCAATGACG TGCCATCGGC GCACCATTGC GGACGGCAGA CCGTACCGAG AGTTTGTCCG
GCTAAGGGCG TTGGTAATGC TCGTTGCGGT ACACACCCGT GAACTCGCAA AGTTCGGAAG
CGCAGCGCTC GACGGAATTC GGCTGAGAAT GAGCGGCGGT AGGAGACGAT TCTTGGACTG
CGATCTTCAA CTGCCTAGCT AATGTAGTAA TGAGCAGGAG TCGTCGGGAT GGCAAAGAAC
AGGAACGGAA TTGTGGAAAC TTCCTGCAAC TGTAAGCAAT CTCTTTTTCG ATACTCCTGT
CGCAGCACCG GGTCGACCCA GACGACCCTG ACGACCCCTG TAGTTTCCAG GTAGGGCACC
AGCCAGCTCA CACTCACAGT CAACGGAATA TCGGTACCCT CCTATTGTCT GTTTGTCTGT
TTGTCTGTCT GTACGTCTGA CCAGCAGTAC CGAACGGGAC TTTTTCCAAC TGGCTCTTCA
CCAACGTTGC AATCACCAAC CACAAAAGGT GTTCCAGGTG CCAGTATCTT TTGGAACGGC
CAGTACGACG TACAAACACA GGGAGCAGAC GTCGGCAGCG CTGGATTCGA ATTTGGTTCC
TCACGCTCCC CTTTCAAGTG CTCTTTCATC GGACACTAAC ACTAACAAAT TCTCTACCCT
CGGCAACTAC CTACCTCCTT TCCTAAGTTT GTTATTGGTG TGTGGTACTA TTGTTGCTCT
TGCTGTCATT ACCATTTGTT GTGGATGCCC ATTGTGGAAG CTGCGAGTCA CTTTTTAGAA
AATTGATTCC CCATGACCAG CCCTTCCGTT GCGATCACCC GAGCCGCTAC ATCGACCCCT
TTGGTGTCCC GAGCCTTGGT CGTTCTGCCG TGTTGGTTGT CCCTATCCGT TCCTTTTCAA
AGGCGGCCAC AAGCGGGACT TACCCAAAGG GTCGGTCGAA CCATTTCCAC AACGACTCGT
TACACGGTAA AAACGCGGCA ACATCACCCA TCGAGTCTCC CTTGGTACTA CCATTCCACC
CGATTCTTCT CGGCCAATCT TTCTTCCTCC CACGAAACCC ACAGTGATGA CGTTCGGAAT
GGATCGAATG AAGACAACCA AGATGACGAA GAAGAAGAAG AAGAAGAAGA AATGGAAGAC
AACGCCGATG AGCAAACAAA GGCAGCCCTG CAAATCCGCA ACGAGATTAT TTGGCAGAAA
AGATTCATGG CCCTGGAAGC GTTCGTGGCG ACCCAGACCC GCGACGAACA AGGGACTCTA
CCCTACCCAG AGGACCGCTC CATGCGTACC TGGTTAGACA AGCAGCGGCA TTTGTTTCAC
CTCAAGATGC AGGGCGAAAG CTCGTCGCTC ACGGATGCTC GTTCAGCCCA ACTCGAGTCA
CTGGGCTATC CACTCAGTCC CCGGGACGAT TGGTGGGAAA AACGTTACGA AGATGTACGA
GCGTTTGTCC AAAAACACAA TCGCTTTCCT TACGATATGG ATGACAGTTA CATGACTGAA
GAAGAAAAAC GATTGCTTTG GTGGTGTCGT CTGCAAAAGA AGCAGTACAA GGCATGGAAA
GAGCAAGACG ACGATTCGTT GACTGGAATG AACGAAGCAC GAGAAGCCAA ACTGAACGAG
ATTGGCTTTT GTTGGGATGC TCATCAAGCG TCCTGGTTGG CTCGATACGA AGAACTAAAA
GCTTATCACG CCCATCACGG AGATTGTCTC GTGCCGAAGG ACTATCCGAC GAATCCCCCT
CTGAGCAAAT GGGTGAGCGA TCAAAGGAAC AACATGGCTC GTTCCCGCAA AGGAATAATA
AAAGTTAATC CGGAGCGACT TCAACTCCTA AAAGAATTGG ATTTCGAATG GAATGCACTA
GAAGAATTTT GGAATCGGAA GTACAAAGAG TATGCTGAGT ACGTGCGATT ACATGGGCCA
GGTAGCATGC CTCGCCAAAA ACACAATCCT CATTTACGGA ACTGGCTTAC TTATCAACGA
AGGCAGTATC AGTTGTTGTT GAATGGGCAG AAGAGCTGCA TGACACAGAA ACGCAAGGAT
CTTTTGGACG CATTGGGTTT TATCGTTTGA
 
Protein sequence
MTSPSVAITR AATSTPLVSR ALVVLPCWLS LSVPFQRRPQ AGLTQRVGRT ISTTTRYTVK 
TRQHHPSSLP WYYHSTRFFS ANLSSSHETH SDDVRNGSNE DNQDDEEEEE EEEMEDNADE
QTKAALQIRN EIIWQKRFMA LEAFVATQTR DEQGTLPYPE DRSMRTWLDK QRHLFHLKMQ
GESSSLTDAR SAQLESLGYP LSPRDDWWEK RYEDVRAFVQ KHNRFPYDMD DSYMTEEEKR
LLWWCRLQKK QYKAWKEQDD DSLTGMNEAR EAKLNEIGFC WDAHQASWLA RYEELKAYHA
HHGDCLVPKD YPTNPPLSKW VSDQRNNMAR SRKGIIKVNP ERLQLLKELD FEWNALEEFW
NRKYKEYAEY VRLHGPGSMP RQKHNPHLRN WLTYQRRQYQ LLLNGQKSCM TQKRKDLLDA
LGFIV