Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_45242 |
Symbol | |
ID | 7200258 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011674 |
Strand | - |
Start bp | 583445 |
End bp | 585724 |
Gene Length | 2280 bp |
Protein Length | 752 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179463 |
Protein GI | 219117337 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00924108 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACGGA TCGTGCCGCT GAATTGGCCC GAATCGAAAC TAACGCGCAG CTCTGCTTCA CAAAATGACA CGATCACCTC TGTGCTGCAT TTGACTGTCA GGAACGAAAG CCTCATGAAG GAACCCTTTC CTATGCCGCG ACTCCAAACC AACCAAAGAC CAGAGCAAAG GCAAGACCAG CAAAATGTGG ACGATTATAG CGTCGAAAGT CCTAGTTTTC CATGGTTGTC CAGGCTTTTC CCGAGTAGAA AACCACCAAT GAAGCCTACA ATTCTAGTGC TATCGGCTTT GTCCTTCCTG TTTGCCACCC TCCAGCTATC CCACTGTAGC TTTATGCATA CTGGCTTGCC AGGCTCCGAA GTCGCCATGG GATTGTTTTC GACCACTGCC TACGACAAGA ATGGCGCTTC GATCGGATGC CTCGCCTATA CCGATCAGAT CCAGATGGAT GGCATGTTTC GCGCGGGACG AGCTTTCGGT ACTATAACCG CACTGTGGAC AGGTACTGCA TTTCTATTGC TTCTTGTCGC TCTCACATTT TGCCCACCGA GGACCGAGAA ACTCTGGAAC ATTGCACGTG GCTTGTTAAC AACTGCGACG ATTACGCAAA TGTTCACCTT TTTCGTCATG GGTAGTGAGC AATGCCCAAC CAACGACTGC TTTCTGTCAG GAGTGGGAGT ACTGGCCGTC TTTAATACCT TTGTTTTGCT TGGAGTCAGT TCAATGACCT ACCTTGAGGC GATTCCCGTC GAGCCGTGGT TGGTACTGAG TAACGACGAC TGCGACGTTG CGCACAACGA CACCTCGGTG ACACCTATCG CGCAAGAAGA ATGTGTGCCA CACGCGAGTT GCAGTACCAC TGGTATATCA AGCAACCCGG CCAATCTGCT TGACACAGTC TTACCCAATC CTGAGGTCTT GGAAATGAAA GATGTTGATC TCAACGACAA GCACGATGAG CCATCCGATG CTGATCCTTC CAGCTCGGAC GCTTCCGTAC AGAGTCACGA CCCAGCCGAA AACGTAGCGC ATGAACGGGA TACATTTATC AGTACAATTG GCACGCACCA TCGGAATAAA CTTCGGCTCT TGAGCTTCGG GCTCGTTCTC GCGGCATGGT CTATCAGTTT AGTTGGGATA CAACGTTGTA CCTTTATATT AGTCGGACTC CGAGAAACGG GGAAGTCAAA TTATTCTGGT TTGGGTCTCT TCAGTCGAGC GGTGTACTAC AACGGCGAAA TACTGGGATG CGTTGCGTAT CCCGATGAAG TCCGGGGTGA CTTTGATTCG GTCTTTCAAG CCAGCCGAGC TTTCGGTGTC TTCACGGTGC TTCTTTTGAC AACCGTCACC ATTCTCTTCT GTTTGCAGCT CTTTACTAAC AAAGCCAAAT CACCAATCTG GCTTGCTGTC CGCGTTTTGC TTACTTGTGC TGTCATTACC CAGCTACTGG TCTTTCTTGT TTTTAAGTCG GACACTTGTT CGATCAATAA CATGGTAGAA TGCGTTCCAG GTGGTGCAGG CATTATGGTT GTCCTTAATC TCTTTCTAAT ACTAACTTTG GCAGTCTACA CGAACAAAAT GGAACCTCCG CGGAATCCTG TATTCCTTTC GTGGCGAAAC AACAATCACG AAGCGCTTCT TCCTGTGGGC CAGAAGCTAC CGATACGGAG CGAAGAATTG CGCCAGCTGC CGGACGGGGG AACCGAAACA GAAGAATCAG GCCACATTGG TGATTGTGCC AACGAAGCTC GCTTTCCCGA CGTTTCCGAA AAACAAAATT ACAGTGCCAA CGCAGGCCCC TCAAAGCACG AATATAAAGA AGATACCTTA GACGATATTG ATATGGTCAA GGTGCAAGTG AAATTCACAG CAACAGAGAA GAAAATCGTC AAAGAAGTGA CGCATGCGGA CGGTTCCAAG ACTATAACTA CAACTGTCGA AGAGCTAGAT ATTGCATGCA ATGATGACAA TGGCAGTAAA CACAATCAAC GCGCCACCAT AGCGCTGGCT TCGTTTCCTT TGTCCAGCGA AAAGAGCGCT GTGCAGGAAA GTGCACAACT CTTACCAGAA AATAAATGCA TTCACCATGA AGGTTGTCCA ACGCATTTTT CAACCAAGAG CAATACGGGC TGGACGTGTG AAAATCACAG TAACCATTCC TTTCAGCTCA CCCTGGGAAA GAAGACAAGC AAGCATCTTG CGGAGCAAAA GAATGCAGAT ATGGAAGAGA TTGACAAGTA CATCAACCAA AACTCGTGAG TAGAATTCAT GTTTACTTTT
|
Protein sequence | MQRIVPLNWP ESKLTRSSAS QNDTITSVLH LTVRNESLMK EPFPMPRLQT NQRPEQRQDQ QNVDDYSVES PSFPWLSRLF PSRKPPMKPT ILVLSALSFL FATLQLSHCS FMHTGLPGSE VAMGLFSTTA YDKNGASIGC LAYTDQIQMD GMFRAGRAFG TITALWTGTA FLLLLVALTF CPPRTEKLWN IARGLLTTAT ITQMFTFFVM GSEQCPTNDC FLSGVGVLAV FNTFVLLGVS SMTYLEAIPV EPWLVLSNDD CDVAHNDTSV TPIAQEECVP HASCSTTGIS SNPANLLDTV LPNPEVLEMK DVDLNDKHDE PSDADPSSSD ASVQSHDPAE NVAHERDTFI STIGTHHRNK LRLLSFGLVL AAWSISLVGI QRCTFILVGL RETGKSNYSG LGLFSRAVYY NGEILGCVAY PDEVRGDFDS VFQASRAFGV FTVLLLTTVT ILFCLQLFTN KAKSPIWLAV RVLLTCAVIT QLLVFLVFKS DTCSINNMVE CVPGGAGIMV VLNLFLILTL AVYTNKMEPP RNPVFLSWRN NNHEALLPVG QKLPIRSEEL RQLPDGGTET EESGHIGDCA NEARFPDVSE KQNYSANAGP SKHEYKEDTL DDIDMVKVQV KFTATEKKIV KEVTHADGSK TITTTVEELD IACNDDNGSK HNQRATIALA SFPLSSEKSA VQESAQLLPE NKCIHHEGCP THFSTKSNTG WTCENHSNHS FQLTLGKKTS KHLAEQKNAD MEEIDKYINQ NS
|
| |