Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48678 |
Symbol | |
ID | 7194911 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 583732 |
End bp | 586152 |
Gene Length | 2421 bp |
Protein Length | 806 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183122 |
Protein GI | 219125720 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.26048 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGGTGT TGGAAACAGC AAATCATTGG GCCTCCCATC CTTCCGACTC GGAGAAGGAT CCGAGCAAAC CAGAAGAGAA GATGAGAGCC GCCTGCGCCG AAGGAAACGG GATTGCTGAG AAGGAAACAA ATGCGGTAAA TGCCATTCGA TTGGCTACAT TTGCAGTCCT TGTGCTGGTT GCCCTCACAG TCTCTCTTTT GGTATACTTT CATACTAAAG ATACTGAAGA AGGTGAATTT GCGGCTCAAT TTATTTCGCA TGGGGGGAAA GTCGTCGATT CATTCCGCGC GAATGCTGAG AGTCGGCTCG CTGCATTGAG TAGTTTTTCT GCGTCTGTCA CGTCCTATGC GTTGCACTCG AACCAAAGCT TTCCATTCGT CACCCTACCA GATTTCGAGC GCAGAGCAGC CTTTACGTTG CAGCTGGCGC AGGTGCTTTC AATAGCCGTA AACGCAATTG TTAGCCAGGA CAATCGTGCC GAATGGGAAG CGTACTCAGT CCTAAACCAA GGATGGCTCG CTGAAGGTCT TTCACTCCAA CAAGCGGTCG TGGACGAGGA TGAAAACGAA TCAGTTCAGC AATTTCAAAA TCAAACATCT TCGGGTGTTC TTGGCGAAGA TGTTACAAGC AGCCTAGACA TTGTCCCGTT CATTTTCAGC CTAGAAGAAG GCGGAACTAC ATCAGCGTAC GAGACAGGTC CTGGTCCGTT CGTTCCAATT TGGCAGATAG CCCCAGCGAT TCCTCTTCCG TCGCTTATCA ATTTCAATGG GCTCACTCAT CCAACTACAC AAGCAGAGTT GCGCACGGTT CTTGCGACAG GACAGAGACT TGTCGGTCCT GCCGCAGACT ATTCTGACGA CTTGGACCCA AGCATAGCGG GAAGGAAAGC GCTACTCAAT CTTTTCTTAA ATCGCTGGAA GAGTGGGGGG AATGACTATG AGGAGGGACC GGTCAGCGAA ATTTATGTTC CAATCCAGGA TAGATTTGGC CCAAATAGCA CGATGGCTGG TGTTTTCTCC TCAACCATAT ATTGGCAGGT CTACTTTACG GATATTTTAC CAGAAACAGC GCAAGGTGTT ATCTGTGTGC TGGAAAACAC CTGCTCCCAG AGCTTCACTT ACGTGATCAA TGGAGCTCAA GCGAGTTACC TTGGTCAGGG CGATCTACAT GACCCGTCTT ACGATGAATA CATGATTGAA ACCGGATTTG GCGCATTTAT TGGACGCGAC AACGTAGCAG CAAGTCGAGA TGGAAATTGT TACTACAATG TTCGTGCGTA CCCATCGAAA GAAATGGAGG AATTATATAT CACGAGGGAG CCTTTATACT TCACTCTTAC TCTGGTTGCC GTTTTTGTGT TTACTTCTCT GGTATTTGTC GCGTACGATT GTCTGGTACA ACGCCGTCAC ACTGTTGTCA ACAAATCCGC TCTACAATCA AATGCGGTTG TCTCCTCTCT CTTTCCTGAA GAAGTTCGCA GCCGGTTACC CAGTCTGTAC GCCTCGAAAA CTGAACGGGA CGCCGCAACC AAATCTATGC AGCATGAGAA GGATGATAAC GATGACAGTT TTGACGACTA CTACGACGAT TCTCTTCCAA TTGCTGATCT TTATCCGAAC TGCACTGTGC TCTTTGCAGA TATTGCAGGG TTTACTGCAT GGAGCTCCAA CCGATCCCCG ACCGAGGTGT TCAAGCTACT AGAGACAATG TACGGCCTTT TCGACAAGAT TGCGCACAAG TATTCCGTCT TCAAGATCGA GACTATTGGA GACTGTTATG TTGCAGTGAC GGGCCTTCCC AAGCCTCAAG AAATGCATGC AATCATCATG TGTCGTTTCG CCAACGCCTG TATCGTACGT ATGAGCCAGA TGATGCACGT TTTAGTGGAA AAATTGGGCC CGGATACTGC AAATCTCTCT ATGCGTGTTG GATTGCACAG TGGCCCAGTG ACGGCTGGAG TGCTGCGCGG TGAAAAGGCC CGCTTTCAGC TTTTTGGGGA CACCGTCAAC ACAGCAGCTC GTATGGAAAG TACGGGGCAA AAGGGACGAA TCCACGTTTC CGAATCCACA GCTACATTGC TGATCAACGC GGGGAAACAG GCGTGGATTA ACGCACGCGA CGAGCTTGTA CAGGCCAAGG GGAAGGGTGA GATGCAAACG TACTGGGTCA AGCCTCCGGA TGTTGGTACT AAATCTACCA CAACCACTTC TTCGGGCCCC AGCGGCCGGG ACCTGTCTCT CTCGCAGGCT CTGCTTGAGG CGCATAGTTT GAAGATGGAC CAGAAGCTTT CCGAAAGCAA AGCGAGCGCG CAAAAGTACG AAGACTTACT TGATAGCTTT CGTGAGATTG AGGTGTCCGA AAAGAAGAAT GAGGTGGAGT CTTCTCCTGA GCCAAGAAAA AAAGAACACC GTTTCGTCTA A
|
Protein sequence | MPVLETANHW ASHPSDSEKD PSKPEEKMRA ACAEGNGIAE KETNAVNAIR LATFAVLVLV ALTVSLLVYF HTKDTEEGEF AAQFISHGGK VVDSFRANAE SRLAALSSFS ASVTSYALHS NQSFPFVTLP DFERRAAFTL QLAQVLSIAV NAIVSQDNRA EWEAYSVLNQ GWLAEGLSLQ QAVVDEDENE SVQQFQNQTS SGVLGEDVTS SLDIVPFIFS LEEGGTTSAY ETGPGPFVPI WQIAPAIPLP SLINFNGLTH PTTQAELRTV LATGQRLVGP AADYSDDLDP SIAGRKALLN LFLNRWKSGG NDYEEGPVSE IYVPIQDRFG PNSTMAGVFS STIYWQVYFT DILPETAQGV ICVLENTCSQ SFTYVINGAQ ASYLGQGDLH DPSYDEYMIE TGFGAFIGRD NVAASRDGNC YYNVRAYPSK EMEELYITRE PLYFTLTLVA VFVFTSLVFV AYDCLVQRRH TVVNKSALQS NAVVSSLFPE EVRSRLPSLY ASKTERDAAT KSMQHEKDDN DDSFDDYYDD SLPIADLYPN CTVLFADIAG FTAWSSNRSP TEVFKLLETM YGLFDKIAHK YSVFKIETIG DCYVAVTGLP KPQEMHAIIM CRFANACIVR MSQMMHVLVE KLGPDTANLS MRVGLHSGPV TAGVLRGEKA RFQLFGDTVN TAARMESTGQ KGRIHVSEST ATLLINAGKQ AWINARDELV QAKGKGEMQT YWVKPPDVGT KSTTTTSSGP SGRDLSLSQA LLEAHSLKMD QKLSESKASA QKYEDLLDSF REIEVSEKKN EVESSPEPRK KEHRFV
|
| |