Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_42494 |
Symbol | |
ID | 7196054 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 211490 |
End bp | 214274 |
Gene Length | 2785 bp |
Protein Length | 813 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176546 |
Protein GI | 219109583 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATCAACGAGA AGCGCAGAGA CGATCCGTTT GTCGCAGAGG ATCGGTATCG AATCGAACCT AGACGGGTAG TTATTGTGCA CAAAACGTAC GCCCACCTCA CCCGTTGCTC GACTATCACA AGCTTACAGT TAGTTCCTTC ATCCGTTTAT CGTCCACCAA TAGATACGAT TCGCAATCGT AACAAAACAC TCTGCAAAAA TCCCATTGGT AGCACAGCTG TGATTTGGAA ACCTAAAGAA TGAAGATTTC GCAATCCGCA CTCAGTGTGT TGCTGTTGTG CACTTCGGCG GCGGCCTTCG TGCCGTCACG TCCGACGCTA TCCCGCATTT CCACGACGTC TACGATCGCC AACGCGGCGA CGAAGCAGGA CGCCGATATT GCGGTACCGT ACGACGCTGC AGCGCGTTTG GCCTACGACG AATGGCGCAA CAAATACAGT AAGGGAGATT TCGACGCTAC ACGATACGAA TTTTTCAAAA ACAACTACGA AACCATTACT ATTGCCAACG TTGTAGCCAA AAAGGAAGCC CGGGAGCAGA AGAAGGACGC CGTCCCACTC ATGACACTCA ACGAGTTTGG GGACTTTTCC GAAGAGGAAT TTAAAGCTCG CCAAAGTGGA GACAAACCAG CCGCCTCCGA TACGAAGACT ATATCGACGG GTGACGTGCT CGGCAAGGCC GTAGCTGCAG CGGAAAAGCA ATCGGAAGCC AGTGGAGCGC TGAAGGAAGC AGCCGACGCG TTGGCGGAAG ACGAAGAGGT GTGTGGTGAA TTGATTTCTT GTCTTCGTCC TCCTGTATAT TTTGTAATGC ACAGTCAATC GTTTTGTCCA ACATGAACCA TCCCATTCTC ACCCAATCTT AGGCTCTGGC ACAGAAGCTT GGACTGCAGT CGGTGGAGGA GTTGGAAGTG GCGATCGATA GTCTCGAAGG CATTGCTGCC GACGGTGGGG AGCTGGATAA GGAAAATCTG TCTCGCGAGG CCCGTGTACG AGCCGCCTAC TTGAACTGGT GTAAGGAATA CAAAAAGGAA CCAGACGAGG ATCGCTTTCT CGCCTTTTCC GACAATTTTC TCGTCATGGA AAAATTTGCA CAAGACACGG GTAAGGAAAT GGCACTTAAC GAGTACGCGG ACTTGACTGA AACAGAGTAC AAGGAAAAAT TTGCCACGAA GAGCAAACCT AAAAAGGAGA CCACGCCCGC CGTTAGTAAG AACGCTACTC CTTCCAGCAA AGTCAAAGTA GCGGAGGTTG TGAAAAATCT GTTCACCCCT GCCGTAGAGA CTCCGGCAGA GAAGGTGGAA CGGGAAACAC GCGCCAAGGC GATTGCCGAG CAGCGCAAAG CCCAGGAAGC TGCTGATGCG ACTCGTCGTC AGGACATTGA GAAGGAGCGT GCCGAGCGCC AAAAGCAGCA AGACCAGGAA CTTGCAAAAG CGAAGACAAC GACGGAGCAA GCTGTCATTG CAGCGGCGAA GGAAGAAGCG GAACGGGAAG CTCAGAAGAA CGCGGCACGT CGAAAGATTG AACAGCAAGC GGCGGAGCAA GCTCGACAAA AGGCACGGGA ATGGGACGAA AAACAAAAGA AGATGTCCGT CTCCTCCACG CAGTCGCTGA AGAAGACCGA GCCCAAGCCT GCACCCACTT TGGATTTGAG CTCTTTCTTT CCCACTCCGG CGCCGCAAAA GGAGGCAGCC CCTGATCCAG CAAAGGCCAA AGTAGAAGCC CCGAAGAAAA AGGCGCCCTC TACTCCCAGT TTTGGAAGCT TCTTCTCACC TCCATCCCCG AAAACGGTCC CCGCACCCAA AGCGGCTCCC GCTCCAGTGC CTACGCCTGT GAAAACTAAA AAACCTACCG TGAAATCTTC ACCAACATTC AACTTAGGTT CTTTCTTTGC GCCGTCCCCG GCGCCGAAAC AGAAGGAAGC TCCTCAGCCC GCCCCGATTG TGGAATCGGA AGAAGTTGAT GTTGCCGACC CTGTTACGGA GGCTTTCAAC TCTTTTTTTG GATCGGGAAA AAAGAAGACA GAGCCAAAGC CCTCTCCGGC TCCAGCCCCC GTCAAGAAAG CTCCTGTCAA GTCGAACCCA ACTTTTTCAT TTTTTTCTCC TCCCACACCA GCTCCCGCAC CGAAGCCAGC CCCAGCACCT GTACCGAGTA AAGCAAATCC AACATTTTCT TTCTTTTCCC CTCCTAAGCC GGCTCCCACC AAGAAAGAGG AGCCCAAGGT ACCCAACCGA CCCGGTACAC TGAGCTTATT CGATTTTTCT CCCAAGCCTG GCCCCGAGAA GGAGGATAAA CCTAAGATGT CCAATCGACC GGGTACTCTG AGTTTGTTTG GATCTGGCAC TCCCAATAAG GACCCTAAGC CGGAGAAAAG GGTACAACCA AAGGTCCCCA ACCGCAAAGG GACCATCAGT TTGTTTGGCG GTAACGCCAA AACGCCGACC CCCGCTCCGC AAAAAAAGAA GGAACCGGTT AAGAGACCAA CTCTTTCGCT ATTCGGTTCC CCCAAGAAGA AAGAACCGGA AATCGCTCCG GAGCCGGTCA AACAAAAACA GGCCTTTTCC TTTTTCGGTG GAGGCGGTGG TTCCAAATCT GCAGCAGCAC CCAAGGATAA GATTCCTGTG TTGAGTCGTT GGAAGCAAAA TCCCAACGGT TCCATTACTG GTATCATCAG CAATTCTCCC AATTTCCGCA GCGGGACGGA GATTACTACA TCCCCCGTCA AACGGGGAGC CAAGGCAGGG AGCGTAATCA AGACGGGGTC CGGTTCGCAG TATCGACTTT CGTAA
|
Protein sequence | MKISQSALSV LLLCTSAAAF VPSRPTLSRI STTSTIANAA TKQDADIAVP YDAAARLAYD EWRNKYSKGD FDATRYEFFK NNYETITIAN VVAKKEAREQ KKDAVPLMTL NEFGDFSEEE FKARQSGDKP AASDTKTIST GDVLGKAVAA AEKQSEASGA LKEAADALAE DEEALAQKLG LQSVEELEVA IDSLEGIAAD GGELDKENLS REARVRAAYL NWCKEYKKEP DEDRFLAFSD NFLVMEKFAQ DTGKEMALNE YADLTETEYK EKFATKSKPK KETTPAVSKN ATPSSKVKVA EVVKNLFTPA VETPAEKVER ETRAKAIAEQ RKAQEAADAT RRQDIEKERA ERQKQQDQEL AKAKTTTEQA VIAAAKEEAE REAQKNAARR KIEQQAAEQA RQKAREWDEK QKKMSVSSTQ SLKKTEPKPA PTLDLSSFFP TPAPQKEAAP DPAKAKVEAP KKKAPSTPSF GSFFSPPSPK TVPAPKAAPA PVPTPVKTKK PTVKSSPTFN LGSFFAPSPA PKQKEAPQPA PIVESEEVDV ADPVTEAFNS FFGSGKKKTE PKPSPAPAPV KKAPVKSNPT FSFFSPPTPA PAPKPAPAPV PSKANPTFSF FSPPKPAPTK KEEPKVPNRP GTLSLFDFSP KPGPEKEDKP KMSNRPGTLS LFGSGTPNKD PKPEKRVQPK VPNRKGTISL FGGNAKTPTP APQKKKEPVK RPTLSLFGSP KKKEPEIAPE PVKQKQAFSF FGGGGGSKSA AAPKDKIPVL SRWKQNPNGS ITGIISNSPN FRSGTEITTS PVKRGAKAGS VIKTGSGSQY RLS
|
| |