Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42768 |
Symbol | |
ID | 7196394 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | - |
Start bp | 1053398 |
End bp | 1055457 |
Gene Length | 2060 bp |
Protein Length | 599 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002177211 |
Protein GI | 219110919 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGAGCGTTTG ACAAAAAGTG TGAATGCACC TTGGCATTCC TGATCGATTT ACATCATTGG AAGGCTACTA GAATTGAGAA AGGGATCGCG TCATTATGAC AGGAGAACAT AAGCGAAAAG GATTGCTTCC TTTCGTTATT TATTCTCAAG TCCTTCTTGT AGCCACAGCT TTTGTCCGAC CTAATGTGCT GGCTGGACGC TTCCCCCCCT CGACTTCATT GATATCTCTG CCATCTCAAG CAAATGTTGA TCGCGGCTCA AACATTTCAG CCACCAATCA AAACGAGGAG ATTCTAGTAA AAAACAACAA GTACATCGAG GGTCTACTTG AAAACTTGAC TAGCCTGTTG GATCGCTGGA TCATTACGGG ATCTCCGAAG CTTCAAACCA GAATCGGCAA CGTGCTGAAC ATAATTGAAG CCGAAGCAAA AGAACCAGAG CTGATAAAAA AAGCCCAGCG AAATATTCAG CGAGCGGGAT TGTCTTTGGA ACGTAGACAG CCCGAGAAAT CTACGGTGGA CCTCGGAAAG ACAGACGAAG AGAAGCGGCG CACGGAGGCT GAACAGCGGC GACAATGGGA AGCAGATAGA AGATCTAAAG CGGTTGTCGA CCAACACTCT AGCCGCTCGG TATTGAGCAG TAGAACCACC CCGAGCCCAA CCGGCGAAAA TTTTATGAAA CAAGTGGATT CCAGTTTGTA TCCTCAAAAC TTTGCAAGGG AGAAAATCGA ACTCCAGAAG TCCCTTGAAG GAGACAACGT AGACAACCAG ACGGAGACTC TAGCAAATGA CCCTGGAAAA TCTGCATCGG CAAAGGTTTC AGAGTTGGTA GCTCTAGCTG GGGCTGGATC ATCTTTTGAA GGGCAAAATC TAGGGATTGG AGGGCTGGAT GATGTCCTTG CAGAAGTCAA GCGACGGGTA TGGACTCCTT TGGCCGCACC GCCACTTCTC TTACAGGAGC TTGGCATTCA GCCTGTTCGT GGACTACTGT TGTACGGTAA GCCGGGGTGT GGAAAAACTT TGCTTGCGCG GAAGCTGGGT CAAATGCTTT CGCCATTGCG TCCGATTACG GTGGTTTCCG GTCCAGAAGG TAAGTTCATG ATTTTGGCAC GCCATTTCCT GACAAAGATC AGAACTTATC GTTTGGCTCT CTTGCTCACT TACATTAGTC ATGGACAAGT TTGTTGGCAG CAGCGAGAAG AATCTTCGGG AGGTTTTCGA TAATCCTCCG GACATATATG ACTACTTTCG TATTAGGGAA AGCGACGGAG GAGAATCAGT TGAGCGAGCC GCTCTTCACG TCATCGTCAT GGACGAATTC GATGCCATCG CTCGATCCAG GGGAGGCCGT GGTGGCTCTG GAGATCAAGG TGATGCTGGG GTCGCCCGCG ATAGTGTGGT CAACCAATTA CTGTAAGTTT TGATCGCCTT ACTTCCTGCT GGGCAGCTGG CTTTTTGCCT TGTAAACAAT CACACTAATT CTTAATTGTC TGTGGCAGTG CTAAAATGGA CGGAGTAGAC ACCCTGTGTG TTCCCACACT TGTTATTGGG CTTACCAACA AGCGAAGTCT TATTGATCCT GCTCTTTTAC GACCTGGTCG TTTTGAAGTG CAAATCGAGG TGCCACCGCC ACGGACGGTC GATCAGCGTG TGAGTATTTT GCAAGTGCAC ACGCGGAGTA TGCACGAAGC AGGCCGGCTC TTGGTACGCG ATGCCCCCAT CGGTTCCGCA GCGGCTAGGC AAGCCACTAT AGACCTACCG TCCTATGACG AACTCCTCCA AATGTTAGCA GCTGAGTGCG ACGGCTTTTC TGGAGCGTCG TTGGCCGGGG TCTCCCGAGC GGCAGCGAGC CATTCGTTGG AGCGTGCGAT AGAGGTGTTT GCATCACATG CCCGCAGTGG CTCTTTGCTC GAAGCGTGTG TTGTAACTCG AGAAGACTTT TCGAGTGCAA TCAATGATGT ACTAAATAGT GTGGGGACTG ATGACTTTAA GGAAGAGGCG TCTGCTGATA CAGAAGACGA CACAAATGAC AACGACCCAG ATACAGACGA GGCAGATTAA
|
Protein sequence | MTGEHKRKGL LPFVIYSQVL LVATAFVRPN VLAGRFPPST SLISLPSQAN VDRGSNISAT NQNEEILVKN NKYIEGLLEN LTSLLDRWII TGSPKLQTRI GNVLNIIEAE AKEPELIKKA QRNIQRAGLS LERRQPEKST VDLGKTDEEK RRTEAEQRRQ WEADRRSKAV VDQHSSRSVL SSRTTPSPTG ENFMKQVDSS LYPQNFAREK IELQKSLEGD NVDNQTETLA NDPGKSASAK VSELVALAGA GSSFEGQNLG IGGLDDVLAE VKRRVWTPLA APPLLLQELG IQPVRGLLLY GKPGCGKTLL ARKLGQMLSP LRPITVVSGP EVMDKFVGSS EKNLREVFDN PPDIYDYFRI RESDGGESVE RAALHVIVMD EFDAIARSRG GRGGSGDQGD AGVARDSVVN QLLAKMDGVD TLCVPTLVIG LTNKRSLIDP ALLRPGRFEV QIEVPPPRTV DQRVSILQVH TRSMHEAGRL LVRDAPIGSA AARQATIDLP SYDELLQMLA AECDGFSGAS LAGVSRAAAS HSLERAIEVF ASHARSGSLL EACVVTREDF SSAINDVLNS VGTDDFKEEA SADTEDDTND NDPDTDEAD
|
| |