Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38433 |
Symbol | |
ID | 7203419 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011684 |
Strand | + |
Start bp | 83770 |
End bp | 85368 |
Gene Length | 1599 bp |
Protein Length | 532 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182471 |
Protein GI | 219124356 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTATA AAAAGACAGC TTCGGCAGAT CGGGATGCCT TGTTTGGTGG AGTGCCAGCC GCAACGGAAG GAGGGCGCGA CAAGAAGAAA ACCACGAATC GTACCGCGCG TCTTTCCTCG TCGACTACCG AGACTGCACG GCCAAAACCT ACCGCTGTTC CAACCAATCA AGGTTATAGA CCCCAGCGCG GCACCGCCGG AGGGACTGGG AAAAAGTCGC AGCCCTCGCT GTCGGCGGAA ACGCAGGCAC AAAAACGAGC GGAAGCGGAG GACTACAAAG CAAAGGCCAA CAAGTGCATG CAGCGTTCGT TTTTCGGCAA ACCCGATCCG GTCGCCGCCA GCACCTTCTT CAAACGCGCG GCGGATTGTT ACCAAATCTT ACAGGAAACG CGGTTGGAGC AATTGTACCG CGTCGAATCG GCTGAGTGCA ATCGTATCGT GCAGGCCTGG GCTTCGTGTG CTTCGGATTA CACGCGGGCT GCCGAGCTCA TGCTCGAACT CATCGACACC ACCGACAACG GTACGGATGC TTCCCAAAAA CGCCGAGATG CGTCCAAATT TCACAAGCAA GCTGCTGGGG CCTGGACAGA AATGGGCGAG AAATCCAAAG CTGCCGCTTC GCAAGTCCAA GCCGCCATTG CTTTGAATTT CGGAGAAGAG TCCACGGTTT TGTCCAAACA AGCTCTCCAG GGTATGGAGG AAGCAATTGA AGCGCACGTA CCCGACGTTT TGAACCCGTA CGGACGGTAC CGCCAAACCG GCGTTTCTGC ATTTCTGGAT CCGGAGAATG CGGACGAAAC CGTTGAACAG GCCAGTGCAG AAACCTTACA ATTGGCCTCG TCACACATGG TGACCCGCTC GTACGCCCAC GAACCGTTGA ACCAGCTCGT AGCCGTACTC GTCAATGCCG GTGAGTACGC TTCGGCCCTG TACGCGGCTG GTGCCGTGAC AGCGATTTTG GAAAAGGATG GCATTAGCAC GTTGAGTTTG AGTCGTGCGT ACGCCGTCGA AACAGTCTTG ACACTAGCCT TGGGCGATCC CGTCATGGCG GAACAATCCT TCTTGTCCCG TCACGTTCAG TCCACGCCGT ACTTGGCCTC ACGTGAATGC AAGCTGGCCG AAGACTTGTT CCGGGCCGTC AAAACACGCG ATCTGGATGC CTTGGAGGAG GCCCGCGCGG TTACCGGTAG CAATCGGGCC GCCCTGGCCA ATCTGGATCC GGCCGTTAGG GAATTGGTGC CCCTTCTGCG CTTGACCGGT GTCGCGCGAA AGAATGTGGC TAGCAATGCC ATCCCCGTGG CATCGACCTC TGCGAACAGC CGGCGTGGCG GGAAGAATGA ACCGGATCGG TTGCAGAAAA ATGAGATACC GGAGGCGACG ACGGAACCGG CAACCTTACA AGAACTGAGT AAAATGAAGA CTGGATACGA AAAGGAGGTC GCCGAAGGAG CACATTTGGA TGGGAATGCG TTGGCTAACG AATTGGATGA TTTGGATTTT GGTGCTTTGG ATAGTGATCA CGAGGATGAC GGTGATGGCT TGGGAGGGGT GGGCGATGAT TCCGACTTGG AGGATGACGA TGACGTTGAC TTGCGATAG
|
Protein sequence | MSYKKTASAD RDALFGGVPA ATEGGRDKKK TTNRTARLSS STTETARPKP TAVPTNQGYR PQRGTAGGTG KKSQPSLSAE TQAQKRAEAE DYKAKANKCM QRSFFGKPDP VAASTFFKRA ADCYQILQET RLEQLYRVES AECNRIVQAW ASCASDYTRA AELMLELIDT TDNGTDASQK RRDASKFHKQ AAGAWTEMGE KSKAAASQVQ AAIALNFGEE STVLSKQALQ GMEEAIEAHV PDVLNPYGRY RQTGVSAFLD PENADETVEQ ASAETLQLAS SHMVTRSYAH EPLNQLVAVL VNAGEYASAL YAAGAVTAIL EKDGISTLSL SRAYAVETVL TLALGDPVMA EQSFLSRHVQ STPYLASREC KLAEDLFRAV KTRDLDALEE ARAVTGSNRA ALANLDPAVR ELVPLLRLTG VARKNVASNA IPVASTSANS RRGGKNEPDR LQKNEIPEAT TEPATLQELS KMKTGYEKEV AEGAHLDGNA LANELDDLDF GALDSDHEDD GDGLGGVGDD SDLEDDDDVD LR
|
| |