Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49717 |
Symbol | |
ID | 7198400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | - |
Start bp | 44304 |
End bp | 46579 |
Gene Length | 2276 bp |
Protein Length | 421 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184559 |
Protein GI | 219128730 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACTGTGAT ACACAGGCAG CCATAGAGTT TAACCTCTTG GTGTCTGTCC AGTATTTGGT TGCTTTTACC TGCGTACAAT GGGAGCGGCA TCGTCGTTTC TGTGCACCCA GGCCGACGCC ATTCCGGGTC AGCTCATTGC CGGTACATTG CCTGCCGTCC GCCACTGCAT CGTGTACGGC AAGGAGGACG GCACATTGCG CGAAGAAACG CAACAGCTAC CCACAGTTAC CGACCAATCG GATCACGTCT GGGTGCGTGT CCACGCCGTT GGACTTAATC CGGTTGACGC TAAGAACGTC GTCGGCGACA AGTTTCCTCA CCACTGGCGC ATTGTGCGTT CCTGGGTCCG TTCCGCCTTG GTCGCCGGCA CCATTCCTGG ATTTGATTAC GCTGGTACCG TTGCGGCCCT ACCGAAGCAC GGACGGGCCC TCAAGTGTAA CGCTGACGAT CTGCCGGCTT TGAAGGTTGG GGACGCTGTC TTCGGGACCA TGCCAGCTTT GCAAGGTACC TTGGCAACCT ATATTGCCGC TCCCCGGCAC CAAATGTGGC ATAAACCAGA GAGCCTTTCC TTTGTCCAAG CAGCCGCCTT GCCCTTGGTC GGACTGACGG CCTACCAATG CTTGCAGCCA CACATGGCTT TTCGCAATGA GAAGGATGGT GCTTCCAATG CGTCCGCATC TTCGGTGCTT GTTGTTGGTG GAAGTGGCGG TACCGGTCAC GTGGCGATTC AAGTAGCCCG CAATCTCGGG GCCCGTTGCG TGACAGCGGT TTGTTCCACG CGGAATGTGG CATTTTGTCA CCAACAGGGT GCCACCTATG TGGTCGACTA CACCGCCAAT GGCGGAGACA AAGATACGTT GCGGCGGCAT TTGCTCGAGA CCGGTCCCGC TTGTCCGTTC GATATCGTGC TGGATTGCGT CTCGTCCGCT GATCCCCGTG ATCAACAAAC CTTTTCGTAC CGCCAACTGC TGCAACAAGA TGTCCATATT CAGAAGCTTT TGAACGAAGC CGCTGTGTAT CGTCGTTTGG GTGGACCGTC CCTCGATTGG CTCCGGGCCG GGATGGAACA GAAGCTTGGT TGGACGGGTG TCTGGAAGTA CACAACCACC AAACGGAATT CCAACGACGA CCGCTTGTTT TGGATTCGGT TTCCCCATAC GTCGTTACAG TTGCAAGCAT TGTCTATCAT GGCCAATCAG GGCCAGCTAC TGCCCAAAGT GGAAAAAGTG TACAGCTTCT CCGCCACCGA CGTTGAAAGG GCCTTTGACG ATTTACTGTC CCGTCGAGTT CGGGGCAAGA TTGTGGTGGA ATTGGTCAAG GAGACCGACC AGCAGAAAGA CTAGTTTGTT ACTCTAGTTG CCACAGCATA GCGAGTATTT CATCCGTGGA TGGCTTATGG AATTGAAACA GGCAAAACGA GAACGAGGTT TTACGGATTG TACTATTAGA GCATCCATCG GACGGAACAG GGTGCGGAAT GCTTCTAACC ACGTTTGCAG ACGAAAGGCA TGAAAAACAT AGGGATCGAT GAACCGATAT CGATCGCAAA AGAAGCGTCC AAGTCCAGCA TCCCTTTTTG ACGATGACTA CAAAGATGGA AAAAGGGGTT TTGTTTCGAG GAAATATGTG ACAGACGAAT TCTCCGCTCA ATGAAGATAG GAACAGGGTG TTTTGTCAAC GCGGAGTGTT CTCGACTGAT CGGTTGACAC AACACATACA GTAAGCGAAT GACTGCCCTT AGTGACACGC GGGTACATTT GTGGTCGTGG ATGCTCGTCC TCTTCGGAAC GATACCCTTC ACAACTATAA GGGGGTACGC AAATGAAACA GGGTACGGCA ACACAGGGTC ATGTCCTGAA ATGTCGGGGT CTGGAGTTTG CCGGCGTAAA ACGTCACCAT AGGTGACAAT GTATACTCCG AGGATACCCA AACTCGGGAT CCGGTGTCTA CCCAACAAGG AATCACTCGA TTTGCTAGCT GAACAACGGG TCAAAGTCTG ATCACGGTTC TTCAATATTT TGTTTTCAAT TTGATTTTTA CAACGGTGAC AGAGCGAGCA TCGCTTTTCA TCATAAGTAA TGTTTTCCTC GATCGCGTTT TTTTAGGATT ATTGTGTCAA GCAGGAACAA TTGATTACAT CGGCCGATAT TCTGGTACAT GATTCTACAC CCGTCTACGA AGGCTGCGAA CTGGCGCTCC CCACATGCTC ATGAGAATGT CCCCAGCCCC CTTGGGTCTC ACACAACAAT CAAAAACTAG ACGGATTTGC ACCCAC
|
Protein sequence | MGAASSFLCT QADAIPGQLI AGTLPAVRHC IVYGKEDGTL REETQQLPTV TDQSDHVWVR VHAVGLNPVD AKNVVGDKFP HHWRIVRSWV RSALVAGTIP GFDYAGTVAA LPKHGRALKC NADDLPALKV GDAVFGTMPA LQGTLATYIA APRHQMWHKP ESLSFVQAAA LPLVGLTAYQ CLQPHMAFRN EKDGASNASA SSVLVVGGSG GTGHVAIQVA RNLGARCVTA VCSTRNVAFC HQQGATYVVD YTANGGDKDT LRRHLLETGP ACPFDIVLDC VSSADPRDQQ TFSYRQLLQQ DVHIQKLLNE AAVYRRLGGP SLDWLRAGME QKLGWTGVWK YTTTKRNSND DRLFWIRFPH TSLQLQALSI MANQGQLLPK VEKVYSFSAT DVERAFDDLL SRRVRGKIVV ELVKETDQQK D
|
| |