Gene PHATRDRAFT_20773 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_20773 
Symbol 
ID7201651 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011678 
Strand
Start bp218153 
End bp220164 
Gene Length2012 bp 
Protein Length484 aa 
Translation table 
GC content53% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002180965 
Protein GI219120454 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AACACCCCAA GTACGGCAAG GCGAACCGAA GACAGGTAGG TAGGTAGGTA GGTATATATA 
TATACACACC CTCCCACCAC TAACAGCCCG TAACGAACGT TTCTACCTCT CTGCTTCCAA
AGAAACCGGA TACATCTAGG ATTGTCGGAG ATCCGTTGAC GCCAAGAGTT GCACAAAGCA
AAACTTGTTT CCAGAATCAT GCCGTCGGCT ACAAAATGTG AGTACTTGCG AGCGACACTC
CATCCAGTCA GGGCGACTCG CGTGTCTGTT TCGTGACACT GTGTCTTTTG CTTGCGGTTG
TGTCGAACGC TTTGGCGGAG CACTTTATTC CCACATACTC ACACACACAC ACATACACAT
TGCTTGGTCG ATCTCGTGAC TGCGCAGCCG CCAAAAAGCG CAAGGCCGAC GACGGCTCGG
CCGTGGCGGA AACGTCGGCG GGTCCGATGA TTACGCCTTC GTCGCACACG CCCAAGCTCG
ATACGAGTAA GTGGCCGTTA CTGCTGAAGA ACTATCACGA ACTCCACGTG CGCACCGCGC
ACTACACCCC GCTACCGACC GGCAGTAGTC CGCTCGCCCG CACCCTCGAA CAGCACTTGC
AGTACGGAGT CATCAATCTC GACAAACCCG CCAATCCGTC CTCGCACGAA GTTGTGTCCT
GGATTAAACG CATTCTGCGA GTCGAGAAAA CGGGACATTC CGGCACGCTG GACCCCAAAG
TTACCGGTTG TCTCATTGTG TGTATAGATC GCGCCACCCG ACTCGTCAAG GCACAGCAGA
GTGCCGGTAA AGAGTATATC GGCGTTGTGC GTCTGCACGC TGCCTTGGAC GATCCCAAGC
AGCTCACGCG AGCGATTGAA ACCACACTCA CCGGCGCGCT ATTTCAACGA CCGCCCCTTA
TTTCGGCCGT CAAACGACAA TTGCGCATTC GAACCATTTA CGATTCCAAG CTCCTCGAGT
TCGACGGCGA ACGAAACCTA GGTGTCTTTT GGGTCAAGTG TGAGGCCGGA ACGTACATTC
GTACCCTCTG CGTGCACGCG GGACTCTTGG TCGGTACCGG TGGTCACATG CAGGAACTGC
GTCGCGTTAA ATCCGGTGTG CTCGGGGAAG AAGACAATCT CGTCACGCTG CACGACGTTA
TGGACGCACA GCACGTGTAC GACACCACTA AGGACGAAAC CTATTTGCGC CGGGTGGTCA
TGCCGCTCGA GACTCTCTTG ACCAACTACA AGCGTATCGT TGTGAAAGAT AGTGCCGTCA
ACGCAATCTG CTACGGTGCC AAGTTGATGA TACCCGGTTT GCTGCGCTTT GCCGACGATA
TCGAACTCAA TCAGGAGGTG GTACTCATGA CGACCAAGGG CGAGGCTATT GCCGTGGCTT
TGGCGCAAAT GACGACGGCC GTCATGGCCA CAGTGGACCA CGGAGTCGTG GCCAAGATTA
AGCGGGTCAT TATGGAACGC GATGTGTACC CGCGTCGATG GGGTCTCGGG CCCATGGCGC
AACGGAAGAA GAGTATGATC AAGGAAGGCA AACTGGACAA GCACGGCAAA CCCAACGAAA
AGACGCCGTC GAACTTTTTG GATTCCTACA AGGATTATTC GAAATCCAAA CCGCCCGTAT
TGGATGGTAA CGGTGAACCG ACCACGCCGG GATCCGCATC CGTCGCCTCC AGCAACGGAG
ATAAAATGGA AGTCGACGAA TCGGAGAAGA AACGACCATC CTCGCCAAAA TCCGAGGATG
ATGACGATGC TCCCCAGAAA AAGAAGGATA AGAAAGACAA GAAGAAGAAA AAGGACAAGA
AAAAGAAGAA GGAAAAGGAG TAGAGGACGC CTTGTTTCCT GCTACTAGTA TATCTTCCAG
AGAAATCTAA ACTAATCTTT TGAGAAATTT GTATGCATTC GCGTACATCG AAAAGTTGAC
AAGGGGTCAT GCAATGGGTC CGTTGCTGGC AGCAGTTGCG AACGAGTAGG CTAGAACAGA
ATTTTTAAAT GGTATGCTCT TAAGATTAGG TA
 
Protein sequence
MPSATKSAKK RKADDGSAVA ETSAGPMITP SSHTPKLDTS KWPLLLKNYH ELHVRTAHYT 
PLPTGSSPLA RTLEQHLQYG VINLDKPANP SSHEVVSWIK RILRVEKTGH SGTLDPKVTG
CLIVCIDRAT RLVKAQQSAG KEYIGVVRLH AALDDPKQLT RAIETTLTGA LFQRPPLISA
VKRQLRIRTI YDSKLLEFDG ERNLGVFWVK CEAGTYIRTL CVHAGLLVGT GGHMQELRRV
KSGVLGEEDN LVTLHDVMDA QHVYDTTKDE TYLRRVVMPL ETLLTNYKRI VVKDSAVNAI
CYGAKLMIPG LLRFADDIEL NQEVVLMTTK GEAIAVALAQ MTTAVMATVD HGVVAKIKRV
IMERDVYPRR WGLGPMAQRK KSMIKEGKLD KHGKPNEKTP SNFLDSYKDY SKSKPPVLDG
NGEPTTPGSA SVASSNGDKM EVDESEKKRP SSPKSEDDDD APQKKKDKKD KKKKKDKKKK
KEKE