Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50125 |
Symbol | |
ID | 7198927 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 127248 |
End bp | 129251 |
Gene Length | 2004 bp |
Protein Length | 425 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184971 |
Protein GI | 219129597 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.806954 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCGACACTC ATGCGTGTCA TTGCAGGAGT CGGATCGATC ATTCCAGCAC ACTCACCCCC CTGTTTGTTG CAGGAGTAGG GAGTAGTACT AGCTACCAAC CTTGGTTGCT AGGATTGCTC TTGGTCATTC CGTTACGACG CGAAGGAACA AACGGGGTCG TTGCATTGAC TTAGTAGATA GACAGCAGCT GCCGCCAACA TGGTGGTCGT TTACACGCCG CTATCTATCT GGGTCCAGTG TGCCAGTTTG GCCTTTACCC TGGGCGGTAT GCACGGTGAT TTATTGGCCA TTCGGTTTTT CCTCTTTCTC GCGTACGTCT TTTTGTTCCT CAACGCCTGC TTGGGATCTC CCCTGTGGGG TGCACCCACC AATTCGGGAG GAGTAGCCGT GGACAGCCTA CTCTGGGCAG TTTTGAATAT GTACGTGCAC GGCTCATCGC TCGTCCGACT CGTTCTGGAC GAGCGACCCG TGCACTTGAC CGAAGAGGAA GACGCACTCT GGCGTATGTT CTATCGCACC GGTGGACTCT CCAAACGACT CTTTCACGCA ATCCTCGTCC CGCATTTGGA AGTCATTGAG GCACAAGCCG GCGACGAACT CCTCACGGAA GATTTCTTTT ACATTCAGTA CCACGGAAGG GCGCATTTGC AAGTACTGGA CGGCGAGCGC CTGGTTGCGG ATCGCTATAC AAGGTCGGGA GAAATGTTTG ATTTCAAGTG TCTAGGCAGT AAGTACTACT AGTAAGGAGG CAAGCTACCG TAGCTAGCAT ATCCTTGGTT ACAATGACAT CTTTCTCACG CCTAACGACT TTTGCCACAC TCGTGTCCTT CCCGCACAGT GTTCCCCGCC AACACCATGA TAGCCAAACA CGTGGTCAAA TGCCGGTGCG AGAACCGCAC GAAACTCTTT CGATTTAGCC GCGCCAATAT GGAACGGATT GCACACCACA ATTTTGTCAA GGGCATTTGG CAATCACTCC TCATCAACAA TATATCGTTC GTGCTCGAAC GCCAGCGCGT GGACGACCGG GATTGCATCC TCCCCGAAGG AGCCTGCGAC GCCATCTTTG CACCCCTCCA GCCCTGGGAA GAACCGAATT CCATTCTGGC TGGATCCAAT AAATCCTTAT CCCGCCCCAC GGCGCATTTG TTGTTTTCCA TTCACAAGTC GTTCAGTCCC CCGTGGCCCT TTGGGGGACA CCCAACGGGG ATGCGGCAAG TGCAGCTGCC ACCACCCGTG CCGCCCCCGT CGGGCGATCC GCAAGCCTAT CCACCGTTAC ACAATATCTC GTCGCGTGGC TGGCATGACG AGGATGAGGA CGACATCGGC ACGACCGGCC ACCGCATGCG CACCTGGTTA TGGTCAATTC CTCGGCGAGT CATGTCCCGA CGCAATTCGG CGGAATCTTC GTCGTCTACC CCTGCCCTGG GCGAACTAGA GCCCGACACC GTTGAAGACG ATTTCGAAGG AAAAGTTGCC ACCGAACTGT TAGTGGAGGC GACCGAAGAT TGGCAGGTAG CAATTCACGA TCGGCCGGAA GCAGCTGCCA GAGCATCTCG TGAGAACGAA AAGACTGCCA ATGTTTGAAG GTTTTTGCGC AAACCGTGAT GGACAATATC GCGGTCCGCA ATTGTGTTGA TGGGTCGGTT GAAAGTGTGA GCTATGAACT TGCCGGCCGG GATCGCGTTC GATCCGCCAT GCCAAGGGTG AGAATTTTGT GTACGGAATA ATTGTATATC GAGAATGTAT TGATTCCCGG TAACCAGTTT TCGTGATTCC TCTCGGGCAC CATATTTCTG GTGGCGGTCC GGAGTAGAAT ACCTTTTAAT GTCCGGTCAA TGTCGGTCAA TCAAAGGACC ATAATTATCT TTGTTCTTGG TGTCTCGTCT GGTGCGGTAT CGCCGTTCGA CTGAACTTCC AGGATTGTTG CCGGTGGGAA AACGAAATTA TCACGCAGAC ATGCGTCATA GATTGTAAAT ACAGTTGATT CGGATCTGTC GTTC
|
Protein sequence | MVVVYTPLSI WVQCASLAFT LGGMHGDLLA IRFFLFLAYV FLFLNACLGS PLWGAPTNSG GVAVDSLLWA VLNMYVHGSS LVRLVLDERP VHLTEEEDAL WRMFYRTGGL SKRLFHAILV PHLEVIEAQA GDELLTEDFF YIQYHGRAHL QVLDGERLVA DRYTRSGEMF DFKCLGMFPA NTMIAKHVVK CRCENRTKLF RFSRANMERI AHHNFVKGIW QSLLINNISF VLERQRVDDR DCILPEGACD AIFAPLQPWE EPNSILAGSN KSLSRPTAHL LFSIHKSFSP PWPFGGHPTG MRQVQLPPPV PPPSGDPQAY PPLHNISSRG WHDEDEDDIG TTGHRMRTWL WSIPRRVMSR RNSAESSSST PALGELEPDT VEDDFEGKVA TELLVEATED WQVAIHDRPE AAARASRENE KTANV
|
| |