Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44208 |
Symbol | |
ID | 7203927 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | + |
Start bp | 1312237 |
End bp | 1315224 |
Gene Length | 2988 bp |
Protein Length | 937 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186227 |
Protein GI | 219113287 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATAAAA GTGTCTGCAT AATATATTGT CACGTGCTTC TCCTCCTGTG CTTGCTGCAC GACGATGGTT GGCGGGCCTA CGGACAAGAA ACCGAAGACA TTCTCGATAC GACCACAACT CGTCCAGTTG TAGTCCACTT GCCAGATCTT GGACGCGTCC AAGGCAAACG ACAGTCGGGA ATCGACTTTT TCGGGGGGTT GCCCTATGCC GCTCCCCCCG TCGGTCACTT GCGATGGGCT CCACCGGAAC CACCAGCGCC CTGGGCACCC GCCAAACTAG ACGCTACCCA CTTCGGTCCC GACTGTTGGC AGCTCGTCGA TCCCTTGCTC AATCCAGGAG CCGAAGTCGC ACGCATGTCG GAAGATTGTT TGTATCTTAA CGTATTTACT CCGGCGGGAC ACGCTTCGCG GCACGAGCGA CTGCCCGTGC TTGTCTGGTT GCACGGCGGT GCCTTTCAAC AAGGGGGCGC TCGACGCTCC GAATACGACG GACGTCGTCT CGCCGAGCGC GGCACCATTG TGGTGACAAT CAACTACCGG CTTGGTGCGC TGGGATTCTT GGTCAGTAGC GTCGACGGCT TGTTTGGAAA CTTTGGACTC ATGGATCAGC GCGCCGCCTT GCACTGGGTA CAGGAGAATA TTGCTAAATT TGGAGGAGAT CCGGATAGCG TCACGTTGTT TGGAGAGTCG GCGGGGGCAG TCATGACAGG ATTGCACCTC ATGATGGAAG GGGCGGGATC GCTTTTTCAT CGAGCTATTA TACAGAGCAA TCCGCTGGGC TGGCAAGTAC GAGCCATTGT AGTAGCTGAC TTTATCGGTG AAGCAATGAA ACGTTCCGTA GATTGTCGAG ATGTGGCCTG TCTCCGGGCG GAGCGTGTGG AAGAGATTAT GCGTGCGCAG TCCAGCCTCA TGGGAGTCCC AAGGAGTGTG GGCGATTTTT TTACCTGGGG TCCAACCTTG ACGGAAGAGC TCAAGCTCAC CGTCGGAGGG CGCACACCGT TTGGCTCCAC TTCCCCCCTT AGTCGTGAGC ACGTCATGTT CCGAGACTTG GACTCTTGGA AATGGCAAAA CAACCGCGAT ACGTCCTGGG CTGCCGTCAA CGTCACACAG CCGCTGAAAA ATTTGAATCT CATACCCGAC GATATACCCG TCATTATTGG TGCCAACAAG CATGAAGGCG AAATGTTTGT ACACGGTGCT TTCCCCATCA CCATGTCGAA AACTGTCTAT TGGATGTTCG TTGGGGCCCT ATTTCGAGAT AGTGCTTCGA GGGTATTAAA ACATTACCGC GCGTACGTGG ATCAAATAGA GCGGGAAGCC GAAGAACTTG CCCGATGCCA AATCGAAGAA GAGGAGAACC GGCAATACTA TTTAGAGCAC AAAGAGCAAC TCGATCACGA GTATCAGTTA CTACTGGAAA TGAATTCGAC TAAGGAAGGA GTCGAAGCTA TCTCGGACAT TGAAACGTTG GTACAGACCT GGAGCCGCGG TGGCGCATTC TTTCACCGAG ACCAACACGA TGACACAACC AATCATACAC CGTGGCATCG TCGTGTCTGG CCTTTTGCGC GGAACAATAC AGAAGAAGCA ATTTTGGAGC GTGCCAGACT ACGCGAGGAG CGCCGAAAAC TTCGAATCAA AGAACGTGCT TTGAAAGCAG CGGCCAGGGT AGTGGTGGAC TATCGTCCCG TCATGAGTCG GATCATTGAC GACTACTTGT TTCGATGTCC GGCGTGGCAC TACGCTCATT CTTTAAGCCG CAACCGCATT TATCGTGGCA AGCGAAACAA TGTGTATGTG TATCAGTTCA GCCATTCGAC GCACATCCCA GGCTACGAGG AGTGCTGGGG CAAATCTTGT CATACGTCCG AGATTCCCTA TGTCTTTCAG GCCATGGATA TCATTCGGAG CAACTATTCT ACACTCGGTC CGCACGCTCA AAGGGAAGCC CCGTCCACTC CGGAGTACCC GTACACCGAT ATGTTGGTAG CGTACCGTGA GGCCATGGAT GCAGCCTATC GGCAATACGA TGATGAAGAG GACGCTGACG TGGAGACTCC CTCAAACGCT ACCAACCATG GTAGCACAAG CAGCAATCTC TTTCAGCACT CGATGCGATT TCAACGATTG GTGAATCACT TTTTTGGCGA TTACTTCAAA GAAGACGCGG ACGAAGAAAT CGCCAGTGAC ATGGCTGACC GATGGGTTTC CTTTGCCAAA ACAGGCGACC CAAATTACGA AGGCAGTAAA GCATACTGGC GACCTTGGCG ATATATACTG GACGAACGGT TGGGCCGAGA CAAGGAAAGA CCTTGGGAAC CTCAGGACTT TGACAAAATA TTTGATCCCG AGATCGAGGA CGACTGGGAC GAAAACGATA CCACCCTAAT TGAGCGGTAT GTTTGGTCAG ACGATCCAGG AGAACGTACC TACCGCCGCC GGGCGTTGCA CGCGCTCGCA ATGGAGGTTG TCGATGAAGA CGTCTTCCAA ACCATGCTAC GTCGGACACC AAGAGGTCAC GAAGACGATA ATCCTTTTAA CAGCTTTTTG TTCGGCAGCG CATCAAAACC AAAAGACGGT CACCAGGAAC GGCTTATGTC GCGACAAGCC ATGCGCCAGC TACAGGAGAT TGCTCAAAAT ATGGGTGTAC TGGGTACGGG GCTACAGGGG GAAGCGCGCC GGGGACACGT CGGCGATACC TGGGATGAAG ACTTCTTTCC TGAAATTTTG GAGCTCAAAT GGCCACCGGA AGGACGCCTC GTCGAACGTG ATTGTACTTG CGACATGTGG GACCGGATCC GATGTAAGCA ACCACCGTCC TTTGACTTGT TGAATTCTAT GCGACACTGT TGCTCACACC CCTTCTGAAA TCTGTGCCAT GCTTATGCTC GCCCCCTCTT TTCGACGATG GATGTTTTTC ATTAACATCA GACCGCTACT AGCTAAGAAA CTGTAAAATA TTATGTAGGC AGTTCATAAC ACACCGCT
|
Protein sequence | MYKSVCIIYC HVLLLLCLLH DDGWRAYGQE TEDILDTTTT RPVVVHLPDL GRVQGKRQSG IDFFGGLPYA APPVGHLRWA PPEPPAPWAP AKLDATHFGP DCWQLVDPLL NPGAEVARMS EDCLYLNVFT PAGHASRHER LPVLVWLHGG AFQQGGARRS EYDGRRLAER GTIVVTINYR LGALGFLVSS VDGLFGNFGL MDQRAALHWV QENIAKFGGD PDSVTLFGES AGAVMTGLHL MMEGAGSLFH RAIIQSNPLG WQVRAIVVAD FIGEAMKRSV DCRDVACLRA ERVEEIMRAQ SSLMGVPRSV GDFFTWGPTL TEELKLTVGG RTPFGSTSPL SREHVMFRDL DSWKWQNNRD TSWAAVNVTQ PLKNLNLIPD DIPVIIGANK HEGEMFVHGA FPITMSKTVY WMFVGALFRD SASRVLKHYR AYVDQIEREA EELARCQIEE EENRQYYLEH KEQLDHEYQL LLEMNSTKEG VEAISDIETL VQTWSRGGAF FHRDQHDDTT NHTPWHRRVW PFARNNTEEA ILERARLREE RRKLRIKERA LKAAARVVVD YRPVMSRIID DYLFRCPAWH YAHSLSRNRI YRGKRNNVYV YQFSHSTHIP GYEECWGKSC HTSEIPYVFQ AMDIIRSNYS TLGPHAQREA PSTPEYPYTD MLVAYREAMD AAYRQYDDEE DADVETPSNA TNHGSTSSNL FQHSMRFQRL VNHFFGDYFK EDADEEIASD MADRWVSFAK TGDPNYEGSK AYWRPWRYIL DERLGRDKER PWEPQDFDKI FDPEIEDDWD ENDTTLIERY VWSDDPGERT YRRRALHALA MEVVDEDVFQ TMLRRTPRGH EDDNPFNSFL FGSASKPKDG HQERLMSRQA MRQLQEIAQN MGVLGTGLQG EARRGHVGDT WDEDFFPEIL ELKWPPEGRL VERDCTCDMW DRIRYRY
|
| |