Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39943 |
Symbol | |
ID | 7195558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 421546 |
End bp | 423969 |
Gene Length | 2424 bp |
Protein Length | 807 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183993 |
Protein GI | 219127544 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCTACA CAAAGCTATT AGGCCGCACG ATTGGTACTT TGATTTTTCT GGTATCGATG GGGCAGATAT ACAACCACGA GCGGTTCAAC CGAACGTCAG ACCTTTCCAT CCATGCAGCT CGTTTGCCCT TTCTCAATGA TTCCTACGAT AAAGGCCAAA GAAACATCAG CCAAGTCATC CCATATACGG AAGAGACGGC TGATTTCGAA AAGAGCTCCA ATAATTCTGG GCATCGACAG AAGTCAGCGC CACCGTTGCC GTCGTGGGTC ACCGAATATT TTCACTGGCA TGCAGATGAA CGACAAAAGA TGACTCCCGA AAGCTGGGAA GAGAGACGCT ATCTCGTCTT GCGCTGCCTC GAATCTGACA AAAGGTGCGG CGGCACGGCC GACCGCTTAC ATAACTTACC CGTCTTGCTG CGATTGGCGC AAAAGTCCCA ACGAATTCTG TTTATTCATT GGGAAAAACC AGCACCGTTG GAAGACTTTT TACTTCCTCA ACCTCCGGAA ACTTCGGAAT TGCGTCTCGA CTGGAGACTT CCATCGTGGT TAAAAGAGCC CATGCAGCTG GGTCAGATAC CGATTTCGCG GCTTATGAAT GGTACGGACG GAACAGGCAT CATCGACTCC GATGAGCGAG TAGTGGCAAT GCGGTCCCTA CACGGATCGC TTTTTTACGA TGAATTGAAA GGGCCCGACG AACCATCGTA CAACAATATT TTAAAGTTGC TCTGGTATGC CGTCTTCGTA CCAGCTCCAA TTGTTCGCGT TCGTGTTCAA CTCCAGCTCG CCAGGCTAGG ATTGACACCT GGAGAGTATG CGTCGGCTCA CATTCGTTCC TTGTACATTG GAGACGAGAC CCATGTTGAT ACCTTGTATG TACACGCTGT CACATGCGCA GCGCAAGCTA CCAACAATGC TTCTTTCCCC ATTTTTGTGA CGTCGGATTC TCCGTTGGTG GAGGAACAGG CCGTTGTCTT CGGATCAGCC GTCCACCATC TACGTATAGT AACGCATAAC AGATCGGAGA CCTTACACTT GGATAGAGGC CGCGATTTTT TAACGCAAGG CCGATCTGAT AGCTGGAAGA GCATTGATAT GGACGGCTTC TACGATGTTT TTGTGGATCT TTATTTGGTT GCAAATAGTC GGTGCGTATC ATTTGGAGTG GGCGGATTCG GAAGGCTTGG AGCACTACTG AGTGCAGATC CATCCTGCGC TTATAAGTAT GCCCCGGCTA CTCGTAGTGA CCAATGCACG GTTCCAAAAC CAATACGAAA CATCTCTGCA GCAAAAGCTG TATTCACGTC GAGCTCGCTG TTTTCCTCTC CTTCTATTCT ACCAATTTAC AATTCCACAA CAATTCACTC GTGGAACAAT ACAAAGATGA TTCCGAAGTG GATGAAACAG TACTTTTGTT GGCACCAGGA TGCCCGTCGA TTGCTTCAGA ATGGAGAAAA GTCAGCTTCA GACTACAAGT ACTTGGTACT GCGATGCTTA ACGAAGAATA AGAAATGTTC AGGCGCCGCA GATCGGCTAA AGTCAATTCC AACTGCGATA CGAATGGCAT ACGACTCACA GCGATTGCTT TTTCTCAAGT GGGAAAGACC GTGTGCGCTA GAACACTTTC TAGTCCCACC TCGCGGTGGA CTGGATTGGC GAATTCCTTC CACTTTAGAG CTGGATTTTG AAGAAAAGTT CAGCTGGCGT GACAAGGCTA TTGTTCTGAC GGAAAGCAAT AATGTCGAAA AGGCTCTCAA ATCCGAAGAA GTGATCGTCA GTTTGAAGTC AGTACGCGAC AGAAAGTATT TTGAGGAACA AAGAGAACCA GGAGACTACA GTTTCGAAGA AGTATATCGA GAGGTTTGGT CTTCTGTTTT TGAACCATCC CCACCTGTGG CTCGCTTAGT CAGCACGGTC ATGGAGGAGC TAGGTTTGCG GCCTGGTGAA TATGTTGCCG CCCATGTCCG AGCCCTCTAT GTGCAAAATA CTGTCAAGAA TCGAGAAGAG ATCAACGCAC TGAACTGTGC TTCGCAATTG GGACCACGAG CGACGATCTT TTTCGCCTCA GATTCCGCTG AAACAACCCG ACTTGCACTT CAGTATGGCA GGGGAAAAGA AGCCACCATC GTAGCGCGTA TCGGCGAGAG TGAGCCACTT CACCTCGACC GTGGGCACGT CTTTTTGGAG CAGCATGGGG TAGTTGCCGG TGAACATGAG CCTCAAGACT TTTACGATAC ATTTGTGGAT CTTTATATTC TAGCCGAGAG TCGCTGTATA ACTTACGGGG CAGGAGGGTT CGGAAGCTGG GCGAGCCTCA TTTCCAGAAA CTCACTGTGC TCTATTCGAC ATCGCACGAC TAATTGCGTT TGGTTTGACG ATCCCATACT GGGATCCCCA TCTTTATCAG CTATTCGACC CTGA
|
Protein sequence | MSYTKLLGRT IGTLIFLVSM GQIYNHERFN RTSDLSIHAA RLPFLNDSYD KGQRNISQVI PYTEETADFE KSSNNSGHRQ KSAPPLPSWV TEYFHWHADE RQKMTPESWE ERRYLVLRCL ESDKRCGGTA DRLHNLPVLL RLAQKSQRIL FIHWEKPAPL EDFLLPQPPE TSELRLDWRL PSWLKEPMQL GQIPISRLMN GTDGTGIIDS DERVVAMRSL HGSLFYDELK GPDEPSYNNI LKLLWYAVFV PAPIVRVRVQ LQLARLGLTP GEYASAHIRS LYIGDETHVD TLYVHAVTCA AQATNNASFP IFVTSDSPLV EEQAVVFGSA VHHLRIVTHN RSETLHLDRG RDFLTQGRSD SWKSIDMDGF YDVFVDLYLV ANSRCVSFGV GGFGRLGALL SADPSCAYKY APATRSDQCT VPKPIRNISA AKAVFTSSSL FSSPSILPIY NSTTIHSWNN TKMIPKWMKQ YFCWHQDARR LLQNGEKSAS DYKYLVLRCL TKNKKCSGAA DRLKSIPTAI RMAYDSQRLL FLKWERPCAL EHFLVPPRGG LDWRIPSTLE LDFEEKFSWR DKAIVLTESN NVEKALKSEE VIVSLKSVRD RKYFEEQREP GDYSFEEVYR EVWSSVFEPS PPVARLVSTV MEELGLRPGE YVAAHVRALY VQNTVKNREE INALNCASQL GPRATIFFAS DSAETTRLAL QYGRGKEATI VARIGESEPL HLDRGHVFLE QHGVVAGEHE PQDFYDTFVD LYILAESRCI TYGAGGFGSW ASLISRNSLC SIRHRTTNCV WFDDPILGSP SLSAIRP
|
| |