Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_20574 |
Symbol | |
ID | 7201274 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 571859 |
End bp | 574141 |
Gene Length | 2283 bp |
Protein Length | 720 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180464 |
Protein GI | 219119406 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000077682 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GACTATTCCA ATCTCCCACT GAAGCCTGAC CATATGTCGC GGCCCTGCTG GACTTGTCCA GACGGCAATA TTTATCTCGA GGCCTTTCAC GATTTATACG TCAGTGCGTA CGATTTTCTC GTCGCCATTG CCGAACCAGT GGCGCGCCCC GAATTTTTGC ACCAATACAA GTTAACACCG TACAGCTTGT ACGCCGCTGT CGCTACCAAC ATTGAAACCA ACGCAATTGT GTCCGTTCTT GAACGCCTGT CGAAAAACAA GCTCCCATCC CAGGTCATCA AGTTTATTCG GGAGTGTACA CAGAAGTACG GCAAGGCCAA ATTGGTCCTT AAGCACAATC GTTTTTATGT CGAATCGGAG TTTCCGGCCG TTTTACGCGA ACTGCTACGC GATCCTACGA TCGCGCAAGC ACGCATTGTG GAAGATGTGG TGGATGCATC TGCCTTGGAT GCGGACGGAT TTGTCACGCA ATCAAAGGCA CAGGAAATGA AAGAGAATCT CCATATGTTG AGGGAACCAG ACGACGATAG TGAAGAAGAC GAAGAAGGAC TGTCAACCGG GGCAGGCGGC CAACCTACCA AACCCAGCAA TGTCGTCGTC AGTTTTCAAA TTAAACCCGA ATCTGTTGAA GTTGTGAAAC GCCAAGCCAT TGAACTAGAT TATCCATTAA TGGAAGAGTA CGATTTCCGC AACGACACTA TGAATCCGAA CGTACCCCGC ATGGATTTAA AGCCTCACAC GCGTATTCGT CGCTACCAGG AAAGGTCCCT GGCTAAAATG TTCGGCAATG GGCGAGCTCG ATCCGGAATC ATCGTTTTGC CGTGTGGTGC CGGGAAGACC TTGACGGGCG TAACAGCGGC GCAAACGATT AAAAAGTCAG TCGTGTGTCT TTGTACCAAC GCGGTGTCAG TGCTGCAGTG GAAATACCAA TTTCAATTGT GGACTTCCAT TCCGGACGAA CATATTGCTG TCTTTACTTC TGATCACAAA GACAAACTTG TTGACCCATG CGTACTCGTA ACGACATATA CGATGATCAG CTACTCCGGT AAGCGTTCAG CGCAATCGCA AGAAATTATG GACCAAATCA CGTCGCGTGA ATGGGGTTTG TTGCTAATGG ACGAAGTCCA CGTCGTACCG GCCAAAATGT TTCGTCGCGT TGTGGGATCT GTCAAAGCGC ATTGTCGATT GGGACTCACA GCCACATTGG TCCGCGAAGA CGACTTGATT TCCGATCTCA ACTTTCTGAT TGGTCCGAAA CTATATGAGG CAAATTGGAT GGATCTGACT GCTCAAGGAT ATCTGGCGAA CGTTCAGTGT GTGGAAGTGT GGTGTCCCAT GACAGGCCCG TTTATGAAAG AGTACCTGAT GGCTTCAAAC TCACGTTTGA AGCAATTGCT ATACGTGATG AATCCGAGCA AGTTACGGGC AGTCGACTTT TTAGTCCGAT TTCACGAAGC TCGAGGCGAC AAGATTATCG TCTTTTCCGA TTTGGTGTAC AGTCTGAAGT TGTACGCAGA AATGCTAAAA AAGCCGTTGA TATATGGCGA AACGCCGGAG CGTGAGCGAC AGGCTATTCT AGGGACCTTC CGAGCGTCCG ATGCAGTCCG GACTATTTGT ATATCCAAAG TTGGTGATAC TAGTATTGAT CTACCTGAGG CCAATGTGAT TATCCAAGTG TCGTCTCACT TTGGTTCTAG GCGACAAGAA GCTCAGCGTC TCGGACGCAT TTTGCGTCCC AAGTCGTATA CACAACAGGA CGGCAGCAAT AGGTCCACCT TCAACGCATT CTTCTACACC CTGGTTAGCA GCGATACCCA GGAAATGTTT TATTCAGCCA AACGACAACA GTATTTGATT GATCAGGGAT ATACATTCAA GATTGTCACA AACCTATGTG AGAAGGCTGA CGCCGAGGCC ATTGCAAACG GTTGCACTTT TGCGACGCCA GAAGACGATC GTAAATTGTT ACGAACGGTT TTGACGAGCG AGACGGATTT AGAGAAAGAA CAGCGCGCTG AAGACACCGC AATTCGGAAG AATAACACTG ACGGTGCTGC CTTGGCTGAT GCGGCTTCTA AAAAGACGAC GAGTATGGCC CAGCTTAGTG GTGGGACTGG ACTGCGTTAC AAAGAGTTTT CTTCGTCAGG GCTACCGAAG CGGCATCCGT TGTTCCGGAA GAGACAACGT CTCTAGTAGA TTGTCCATGT CTTGAATCGT TTTTGCTTCT TTGGGACAGC TATAGCTTAA GCTTTTGTAA TCACTAATGT CACTTCTTCT GCT
|
Protein sequence | MSRPCWTCPD GNIYLEAFHD LYVSAYDFLV AIAEPVARPE FLHQYKLTPY SLYAAVATNI ETNAIVSVLE RLSKNKLPSQ VIKFIRECTQ KYGKAKLVLK HNRFYVESEF PAVLRELLRD PTIAQARIVE DVVDASALDA DGFVTQSKAQ EMKENLHMLR EPDDDSEEDE EGLSTGAGGQ PTKPSNVVVS FQIKPESVEV VKRQAIELDY PLMEEYDFRN DTMNPNVPRM DLKPHTRIRR YQERSLAKMF GNGRARSGII VLPCGAGKTL TGVTAAQTIK KSVVCLCTNA VSVLQWKYQF QLWTSIPDEH IAVFTSDHKD KLVDPCVLVT TYTMISYSGK RSAQSQEIMD QITSREWGLL LMDEVHVVPA KMFRRVVGSV KAHCRLGLTA TLVREDDLIS DLNFLIGPKL YEANWMDLTA QGYLANVQCV EVWCPMTGPF MKEYLMASNS RLKQLLYVMN PSKLRAVDFL VRFHEARGDK IIVFSDLVYS LKLYAEMLKK PLIYGETPER ERQAILGTFR ASDAVRTICI SKVGDTSIDL PEANVIIQVS SHFGSRRQEA QRLGRILRPK SYTQQDGSNR STFNAFFYTL VSSDTQEMFY SAKRQQYLID QGYTFKIVTN LCEKADAEAI ANGCTFATPE DDRKLLRTVL TSETDLEKEQ RAEDTAIRKN NTDGAALADA ASKKTTSMAQ LSGGTGLRYK EFSSSGLPKR HPLFRKRQRL
|
| |