Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_54342 |
Symbol | |
ID | 7200297 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 185127 |
End bp | 186882 |
Gene Length | 1756 bp |
Protein Length | 585 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179379 |
Protein GI | 219117169 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGAATA GCGACCATGG TGGCAAACAG CGGATCCTTG CACCAGCGGT CCAAGCGCAT TCCTTATCAA GGCCAGCTGT CGTTCTCTTT GGAACTCATG ACATCCGAAT TCACGATAAC GAGGCCTTGC TGTTGGCCTG CCATCACAAC CACGTATTGC CGGTTTTTCT ATGGCAAGTA CCGGTCCACC ATTGGGGAGC TCGTGGCGCC CTGCAAGTCG TGTTGAAAGA AGCATTGCAC CAGCTTTCAC TACAGCTTTC ACAAGAAAGT ATCAATCTAC CTCTGGTGTG CGGCAATACG GCGGACAGTG TTTCGGAGCT CTGTAAAATT GCTTCTGAGA TTGGCGCGAG TGCAGTCTAT TGGAATCGCG AGATGACACC TGAAAGCAGG GAAATGGAGA GGCACAGAGC CACAAGTTTG AAGCAACTCG ATATCGCAGC TGTAGCATGT CAATCGGCCC TACTCTACGA TGTCGAGAAA CTTGAACTTG ACGAGGGGTT TCATGGTGGT CACTGGGGCA CGCTGATGCC GTTTAAGAGG GCTTGCGAAA AGCAACTTGG AAAACCGGAT CGACCGATTT TGATGAAAGA GTCGCTGGCT TGCCTATACT CCGTCGCTCC GCCTCCGCAA GACTGTAAAT CCGTACCCAT CGAGGAACTC GGCTTCGAGA CCGTGCCGCC TTCCACAAAG TGGGATGAGC CCATTCGAGA GCGATTCCCA ATGATACATT ACTTGGCTCA GCGGAGACTG GACCACTTTC TTATAAAAGG GCTACCTCTG TATGAAAGTG ACCGAAGTCG GGCAGACATG GAATACGCGA CTTCGCAGCT TTCAGTGTAC CTTCGTATTG GTATCATATC ACCACGAGAG CTATACTGGA GGATTGAGGA CAGCTCACTG AGCCCTGAGG CGAAGAAAAC GTTTGCCCGT CGACTAATCT GGCGTGAGCT GGCGTACTAT CAACTATTCT GTTTTCCAAA GATGAGGGAC AGATCGATAC GCAAGCACTA TGAAGCATCG GAATGGGTCA CGGGTGACGA AGAGAAAGGC AGATTCAATG CATGGAAGAG AGGGTTGACT GGCTATCCGT TGGTGGATGC TGGGATGCGT GAACTCTATA CAACTGGTTA CTTGACCCAA TCCGTGCGGA TGGTTGTCGC GTCATTTCTT GTCGAGTATC TTCGAGTCGA CTGGACCAAA GGAGCAGAAT GGTTCCACTA CACTTTGGCC GACGCCGATA GCGCGATCAA TTCGATGATG TGGCAGAACG CTGGGCGGAG CGGCATCGAC CAGTGGAATT TTGTTTTGAG TCCTGAGAAT GCATCCCAAG ACCCATACGG AGAATATACT CGCAAATGGG TCCCCGAGCT TTCTCCGTTG CCATTGCAAT ACTTACAGCG ACCTTGGCAG ACGTTTGAAG GTGATCTTCG TATGGCCGGT ATCGTCCTTG GTGAAACATA CCCACATAGA ATTGTTCAGG ACCTCAAGGG TGAACGACAA AAAAGTGTCG AGAGCGTTCT TGCAATGAGA AGGCGATCGC AAGAAAAAAA TGATGAAAAT GGATACGACT TGATCGACCT TCCTTCGGGC ATCGAAACGG TCGTTTTTAC GAAGAAAGAG TACCGTATTG ATCGGTTGGG CAAAGTGCTC CAGGGAAAAC CAAAGACCGC TACTTCAACA GTCAAGCGCC GAAAAACAAA ACGTACAACG AAAACAGATG GGAGAAGAAA GAACCGGCTG CCATCAAGTC TCGCAT
|
Protein sequence | MSNSDHGGKQ RILAPAVQAH SLSRPAVVLF GTHDIRIHDN EALLLACHHN HVLPVFLWQV PVHHWGARGA LQVVLKEALH QLSLQLSQES INLPLVCGNT ADSVSELCKI ASEIGASAVY WNREMTPESR EMERHRATSL KQLDIAAVAC QSALLYDVEK LELDEGFHGG HWGTLMPFKR ACEKQLGKPD RPILMKESLA CLYSVAPPPQ DCKSVPIEEL GFETVPPSTK WDEPIRERFP MIHYLAQRRL DHFLIKGLPL YESDRSRADM EYATSQLSVY LRIGIISPRE LYWRIEDSSL SPEAKKTFAR RLIWRELAYY QLFCFPKMRD RSIRKHYEAS EWVTGDEEKG RFNAWKRGLT GYPLVDAGMR ELYTTGYLTQ SVRMVVASFL VEYLRVDWTK GAEWFHYTLA DADSAINSMM WQNAGRSGID QWNFVLSPEN ASQDPYGEYT RKWVPELSPL PLQYLQRPWQ TFEGDLRMAG IVLGETYPHR IVQDLKGERQ KSVESVLAMR RRSQEKNDEN GYDLIDLPSG IETVVFTKKE YRIDRLGKVL QGKPKTATST VKRRKTKRTT KTDGRRKNRL PSSLA
|
| |