Gene PHATRDRAFT_21430 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_21430 
Symbol 
ID7202186 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011680 
Strand
Start bp780220 
End bp782120 
Gene Length1901 bp 
Protein Length541 aa 
Translation table 
GC content54% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181442 
Protein GI219122207 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.368988 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TAGATTCTTC TTCCACTTTT GTCTAGATTT CGTACAGTTT CGATCCTTTG TCTCCATTTT 
CTGCTTCTCT CTCTCTCTCT CTTCAAGAAA CACAACCACA CGGCCTCATC ATGTCGATGG
TCTACGGTAC GTTGCGTGTG TGCGATGGGA TGCGATGCAA TGCAGTATTC AACAACGACA
AGCACACGAA TCGAGACAGT GTCACACACA CAAACAAACA CAAACCCTAC ACTCGCGCAA
ATACTCAACT CACCACCGCT GTGTTCTTCC ACTTCTTCTG TTTCCGACTA GATGAATACG
GCCGGCCGTT CATTATTCTA AAAGAGCAAC AGGCCAAGGC CCGTGTGAAA GGCACGGAAG
CGACCAAGTC GAACATTCTC GCTGCCCGTA GCATCTCCAA CATGCTTCGC ACCTCGCTCG
GTCCCAAAGG TCTGGACAAG ATGCTCGTCA GTCCCGACGG CGACGTTACC ATTACCAACG
ATGGAGCCAC CATTCTAGAG CAACTGCACG TTGATCACCA GGTTGCCAAG CTCATGGTGG
AGCTCTCGCA GTCCCAAGAC GACGAAATTG GTGACGGCAC GACGGGAGTC GTCGTTCTGG
CCGGAGCGCT CCTGGAACAA GCCGAGGTGC TTTTGAAAAA AGGCATTCAT CCCATTCGCG
TGGCGGAAGG ACTGGAAAAG GCCGCCGATG TGGCCATGCA GACGCTCGCC GAAATCGCCG
AGCCCATGGA CATTGCCGTC AACAACCACG CCGCCCTCGT TGCGACCGCG ATGACCACAC
TCAGCAGCAA GATCCTGCAC CAACACAAGC ACAAAATGGC CGACATTGCC GTCCGCGCCG
TCCTGCAAGT TGCCGATCTC GAACGGCGGG ACGTCAACTT TGAGCATATA CGTGTAGAAG
GGAAGACGGG AGGGAGTCTG GAAGACGCCG AACTCGTCAA CGGTATCGTT ATTGATAAGG
AAATCGCGCA TCCGCAAATG CCCAAGATTA TCGAAGACGC CAAACTCTGC ATCTTGACTT
GTCCGTTCGA ACCACCCAAA CCAAAGACGA AACACAAGCT CGAGATTGAC AGCAAGGAAG
CCTACGAACA GCTCTACCAA CAGGAACAAG AATACTTTCG GGACATGGTC AAAAAAGTGA
AAGACAGCGG CGCCAACCTC GTCATTTGTC AATGGGGCTT TGACGACGAA GCAAATCATC
TACTGCTACA GAACGACCTG GCGGCCGTGC GTTGGGTCGG CGGGGTCGAA ATTGAGCATA
TTGCCATGGC TACGGGTGGA CGTATCGTGC CGCGCTTTGA AGAAATATCG GCGGAAAAAC
TCGGACACGC CGGTCGCGTG AAGGAAATTA CCTTTGGTAC CTCCGACGAA CGCATGCTGG
TCATTGAAAA TCCCGTCAAT ACCACGGCGG TCACCGTTTT AGTCCGCGGC GGCAGCAAAA
TGATCGTCGA AGAGGCCAAA CGCTCGTTGC ACGATGCCAT GTGTGTGGTG CGCAATCTCA
TTCGGGACAA TCGGGTCGTC TACGGCGGCG GCTCAGCCGA AATCGCCTGT TCCTTGGCCG
TCAGCCGATT CGCCGACACT GTAACCGGTG TGGACCAGTA CGCGATTCGG GCCTTTGCCG
ACGCTTTGGA CGACATCCCG CTGGCCTTGG CGGAGAACGC TGGCCTCTCG CCGATTGAAG
AAGTGGCGGC CGCCAAGTCG AGGCAAGTCA AGGAAAAGAA TCCCGTAATT GGGCTCGGTA
TGGACGTGAT GAACGAAGCG GATGGCTACC ATTCGGCCGA TATGCGGGAA CTGGGTGTCT
TTGAAACGTT GATTGGCAAG CAACAACAAA TTCAGTTGGC AACTCAAGTG GTCAAGATGA
TTCTCAAGAT TGACGACGTG ATTTCCATGG GACCACAGTA G
 
Protein sequence
MSMVYDEYGR PFIILKEQQA KARVKGTEAT KSNILAARSI SNMLRTSLGP KGLDKMLVSP 
DGDVTITNDG ATILEQLHVD HQVAKLMVEL SQSQDDEIGD GTTGVVVLAG ALLEQAEVLL
KKGIHPIRVA EGLEKAADVA MQTLAEIAEP MDIAVNNHAA LVATAMTTLS SKILHQHKHK
MADIAVRAVL QVADLERRDV NFEHIRVEGK TGGSLEDAEL VNGIVIDKEI AHPQMPKIIE
DAKLCILTCP FEPPKPKTKH KLEIDSKEAY EQLYQQEQEY FRDMVKKVKD SGANLVICQW
GFDDEANHLL LQNDLAAVRW VGGVEIEHIA MATGGRIVPR FEEISAEKLG HAGRVKEITF
GTSDERMLVI ENPVNTTAVT VLVRGGSKMI VEEAKRSLHD AMCVVRNLIR DNRVVYGGGS
AEIACSLAVS RFADTVTGVD QYAIRAFADA LDDIPLALAE NAGLSPIEEV AAAKSRQVKE
KNPVIGLGMD VMNEADGYHS ADMRELGVFE TLIGKQQQIQ LATQVVKMIL KIDDVISMGP
Q