Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45041 |
Symbol | |
ID | 7200063 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | + |
Start bp | 34421 |
End bp | 36207 |
Gene Length | 1787 bp |
Protein Length | 333 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179125 |
Protein GI | 219116661 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGAGAA AAGAAATTTA CGGCCACCCT GATGCTGCGG TCGGAACGAT AAAAATGGAT CCGCGAGTGC TGCCCGTGCG ATATCGTCCA TCATCAGTGT CGGCTCGACC GCATCGACGA TCTTCCTATC GATCCGCTTC TTCTTCAGGC TCGTCGGTAT CCACTTCTTC TAAAAGCATT GACTCTCAAA ATATTGCGCC CGCTATTGCG CTCTTTGGTG CGGAAGGCAA AACGGGTCAC CACTTTCTCC GCCTGGCACT AGATGCAGGG TACAATGTCA ACGCCTACCT CAGTCCCAAG GTTTCGTTGC GCTCGTTGGA AGAATTTGCA CATCAACCCA GTCTGTCCGT GACAACAGGC AAGTTAGAAG ACATCGAACA GCTAGAGCAA GCAATCTCGG GTGCACAGTA TGTTGTATGT ATGCTAAACG ATACGTTGCC TGGCAAAAAA GAATACCCCG CGGGATGCCT TGTCTCGTTT GTAAATCGCT TGTACCCCAT TCTTCAGCGG GAATCCTCTG TTCAACTGCT GATCTTTCAA TCTACGTCGT TGGCGACTAG CGTGTCTGGT CCTACTCCTT TACTATCCAA AGTGGTAAAA AGAGCAGCAC GAAGACGGTG TGCCTTTACA AAAGATCAGG ACGCCGTCGT CCGATACATT GCCGCCCAAC ATGGGCTGCG CCAGCCGAAA CAATCAAATA CTATCATCGG ACCCGAGAAA AGAAGCTCGG TTGGCCAAAG GTCGATCAAA GCTGACGATC AAAGTTCCGT CACTCGGACC GATTGCTGCG ACGATGCGTT ACCACCACAC TTTTCGTTTA TTGTGACACG TCCGACTATA CTTCTCAAAG ACGGCCCTCC GTCCAAGACG TTATCCGCTT CAAAATCGGT ACGTAAGGGC TTGCTCGCCC GTCTTTTATC GAATGCTTGT CAATGCCTTC TCACTGACAC GTATCACAGC AACCTGGTCT CTTTCCAGTG GCACACGTAG ATCTCGCCGA GTTCACATTG ACCGCCCTAC AAAACGAGAA GCTCTACAAC ACGAGTCCTT ACGTCGTGGC AGACATCTTT TAGAGTAAAG GGTCAATCAT AAACAATTAT TCGTTCATAT GCGTTCATTC ACTTCTACAT TACAACAAAG ACAGAGCAGC AAAACAAAGC TATACAGGTA ATGTAAAATC AGCTCTGAAA TGCCAAGCAC CGTTCTTGAC CCCCATGAAG CTCATTTTGG ACATTCAAAA TCGATTGTCC TGTCGAGCTC ACATACCTCG AGTAAGACAA AAGTTAACAC AAAGACTCAA GATACCGCTT GTCTCAAAGA GGTCCGCCCA CTACGTTCTA CCTATAGTTA GTGACCAGCG TTCTACTCAA CATTGCACAA GTAAGCGGCT CGACATACAA CCGGAGGTTG AATTACAGTG AAAACATTTA GCTGAGCAAC ATCAAGGAGC ATTCTTGGGA ATTGGAAGAC TCGATCTGGA CAAGGCCTAC AATCAAAACC CAATACACAA AAAGGTTGGA ACTGCTACTT TTCCTATTTG CTGACATCGA GGATGCTGCT AAAGCATCTC CAATTGTTCC GGTGGCAAAG CAGCAGCTCT CGCCTTTTCT TATTGCTGAT TGTATTCACA TCACCTTGTC TAGCTGGCGT CGAATCTTTA CAAGCCACAA GATATTCGAT GCCACCCGTG CGAATTCAGT TTCAGATCGC CAATGATAAT GTTGATCGGA TCGAATTGGA GCGACAGCTG CGATCTATCT TCGCTATTCG CTTGAGTTCA GTTTTTT
|
Protein sequence | MKRKEIYGHP DAAVGTIKMD PRVLPVRYRP SSVSARPHRR SSYRSASSSG SSVSTSSKSI DSQNIAPAIA LFGAEGKTGH HFLRLALDAG YNVNAYLSPK VSLRSLEEFA HQPSLSVTTG KLEDIEQLEQ AISGAQYVVC MLNDTLPGKK EYPAGCLVSF VNRLYPILQR ESSVQLLIFQ STSLATSVSG PTPLLSKVVK RAARRRCAFT KDQDAVVRYI AAQHGLRQPK QSNTIIGPEK RSSVGQRSIK ADDQSSVTRT DCCDDALPPH FSFIVTRPTI LLKDGPPSKT LSASKSQPGL FPVAHVDLAE FTLTALQNEK LYNTSPYVVA DIF
|
| |