Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_38112 |
Symbol | |
ID | 7203050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011683 |
Strand | + |
Start bp | 101823 |
End bp | 103862 |
Gene Length | 2040 bp |
Protein Length | 679 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182157 |
Protein GI | 219123699 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.189423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCTCT CTCTCGCCAT TGCAACACTC GCCCTTTCTG CTCAGACAAG CATTGCTGTT CTCCGGAACG GCGCATTGTC AACGGATGTC TTCATCCCTA CTAAGATTTC GTACCTGGCT CTCGTGGATG ATTCCTCCTT CACGGAGTCT ATCAAAAAGT CTATGATGGA CGCCCTTCAA GAATCTGGTA TGGTGTCGAT CACTGACATT CCTGGAAAAC AAGTCAAGGA TAAAGCCCTC TCTTGGGACC TTCAAGCCTG TCTCCAAGAT TCTGAGGCCG CCAAGGAGCA TACCTTCCAG GATGGAACAG TTCGCCGTAC ACTTGCTACA CATACCGTAC CAGGTGGAGC CCAGAACATC GATCATCGCT CGGAATCGCC TTCTTGCGAG GCCTTCTCAA AAGCTGTCGA CGACTTTCGT TCGAGTTCTG CCCAAGCCAC TCAAGCTTTT GCCAAGCGAC TTGGAAGCCT ATTGGAAGAA TCTGAAGATG TCCATCAAGA TGGAGCCCTT CTTTCAACAG CTGAAGGATA TGAATTTTCG GCGTTCTCTG ACGTAGTGGA GAATGGGGAG CATTTGGAAC ATTTTCATTC CTACCAGAAA AACTCCAACC AGGCTTCTGA AGAAACCGTC GCCTATCACA CAGATCAAGG CCTCTTTATT GCCTTTACAC CTGGGCGTAT GGTGCGGGAC CAGCCCGGTC ATGTCGAGAT CTCTACCGGG TTCTTCATTG AACTTCCAAG TGGAGCACGT GTTCACGTCA AATTTGATGA AAACGATGAT CTAGTTTTTA TGCTCGGAGA TGGCGTGAAC CAGTACGTGA ACCCCATGCT CCTCGACAAG ACCCATGCTG CCGTCACCTT GTTACGAGCA ACCCCCCATG CCTTGACTAT GCCCGAACAC TCGAAAAATG TAGCTCGTGT CTGGTACGGA CGCATGGTTC TTCCGCCCGC GTCAGCTCTT CACCCCAAAC ATGGAAAATC TTTCGGGCTT CTCCGAGAAG AAATGATCGA CGCATCTATC AACAATAATA ATGAGCATGC TCTTAGTTTT GGGTGCTCTT CCTCTTCCAT GACTGCCCGT CAACTTGAAG AGACATCCTG CGAAGAGGGT ACCTTGCTAT GCTGGCATCG CTGCATGGAC ATTGCAGAGG CTGGCGTGAG CCAAGAAATT TGCGCAGCCC AAAGCCTTGA CCTGCAGTGC ATCAACCCTC GTGGACAACT CTGGGACAAC ACCCACGGTG ACTTCTTCCC TGGTTGCGCA GATGCCAATA GCACGGAATT GAATACTCCC TATCCATCCC TACCTGGAAT TCCTCGAGAC AATGGAACCT GCTCTACCGA GGAATGGGAC GCGTTTGTCT CATCCGCAAA TTTCAACAAC TCATTTGAAG TGGGTAAGAA TGGAATCTTT ATGTGGGACG TTCTCGAAAG TGGCCAAATC AAGGGAAGAC TCGCCTATAA TGGCTTGCTT GGATATCTCG CGTTTGGCTT TGCGAACGTT GGGGGAAGTA AGAACGGAAT GCACGGCGCT ACAATTCTAT TGGCAATCCC TGGCGATAAC TATACTGCCA CTGATGGGTT CGAGTTGGAC GCTGGGCCCT CGGTTCAGGA ATTCGTCATC AGCCCTGATC CTGCCAAGAG CTCGTTTCGC TTTTGGATGG ATCCAGTCAA CGATTTTGTC GCAACGGCTC GCCAGAGTTC GGAAATTGAT TCTCGTAGTG TTGAGTCTAC CGAGTGTTTC ACTGCTCTCA CCTTCCAAGC CACCTCTATC AATGGTATTC CCTTCGATCT CGAGGGAACG GACGAGCTAA TTTGGGCCGC CAATGGTCAA GACTTCTTTG CAGGGTATCA TGGGCCTGAC GATCGTTCGC GTTTCTCAAT CGACTGGTCT ACCGGGGAGG CCTTGGCGAA TACTGGAGAG ACGGCCCCGC CTCCACCGGA AGAGGGAGAT CCTGCCGAAA CGTCTTCGGG ATCTTTCTAT GTATCCAAAA CTGGTATCGC TTTAGTTGCA GCACTCATCT CCATTGCTGA CTTCATTTAA
|
Protein sequence | MNLSLAIATL ALSAQTSIAV LRNGALSTDV FIPTKISYLA LVDDSSFTES IKKSMMDALQ ESGMVSITDI PGKQVKDKAL SWDLQACLQD SEAAKEHTFQ DGTVRRTLAT HTVPGGAQNI DHRSESPSCE AFSKAVDDFR SSSAQATQAF AKRLGSLLEE SEDVHQDGAL LSTAEGYEFS AFSDVVENGE HLEHFHSYQK NSNQASEETV AYHTDQGLFI AFTPGRMVRD QPGHVEISTG FFIELPSGAR VHVKFDENDD LVFMLGDGVN QYVNPMLLDK THAAVTLLRA TPHALTMPEH SKNVARVWYG RMVLPPASAL HPKHGKSFGL LREEMIDASI NNNNEHALSF GCSSSSMTAR QLEETSCEEG TLLCWHRCMD IAEAGVSQEI CAAQSLDLQC INPRGQLWDN THGDFFPGCA DANSTELNTP YPSLPGIPRD NGTCSTEEWD AFVSSANFNN SFEVGKNGIF MWDVLESGQI KGRLAYNGLL GYLAFGFANV GGSKNGMHGA TILLAIPGDN YTATDGFELD AGPSVQEFVI SPDPAKSSFR FWMDPVNDFV ATARQSSEID SRSVESTECF TALTFQATSI NGIPFDLEGT DELIWAANGQ DFFAGYHGPD DRSRFSIDWS TGEALANTGE TAPPPPEEGD PAETSSGSFY VSKTGIALVA ALISIADFI
|
| |