Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50430 |
Symbol | |
ID | 7199247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011698 |
Strand | - |
Start bp | 34323 |
End bp | 37294 |
Gene Length | 2972 bp |
Protein Length | 917 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185419 |
Protein GI | 219130536 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCAAAAGCAA CACCACCAAG CGTACCTGTC AACTTTCGTC CTCCACACCG TGCTTTTCTT GTCATTGATT TGTTCTGTGA CTCTAGCGAT TGACACCGAG AAGCCGTTCT TTCGCAAAGC ATCCCCGGAC AGCACGAGGA CGATTGCTGG ATCACTTTCG CGCCCACCAT CCTTCCGTTT ACGAGAACAG ACGCATCCCC ACACCATACG CCAGTGTCAT GTCCTCCTCG TTGACTCCCG CTTCACTCCG ATCCGACGCC TCAACGGAGT CGGGAACAGA CGAAGACACA CAGGCATACG ACAAGCTGGC TCTCGACGAT TGTGTGAACA GTCCCGTCCC CTTGAATCCG GACGAATCAC AAAACGCCGA AATCCGAGCG CTTGCGGCCA AACTCGCCCC TGATTTTGAA CAAGCGGACG CTCTCCTGCA GGTTCCATTC GAATTTCAGT CGCATCCCTC CTTCTCTGTT TTGGAAGCAG CCACTTCGGA AGACGATGAC ATGGATCGGG AAATGGAGAG TCTGCAATTC TCGGAAGCAC TCTTGCGACA AGAATTGGAA CTCGCCCAGG ACTTTTCCAC CTTATTTCAA ACAGTTTCGC ACAATATGTC AAACACTAGC GACGCAGTCG ACAAATTCAA TTTATCGGGA ACACCCTTTT GCGAGGGGAT GAAACGACAG CTCTTTGATC CGCCGGATCA CCAAAGTGCC GACGGTTCAC CTGCCCTCGC TCTGCAGTCG TCGTCCAGTC GAGAGGCCGA TATAGATCCC AGCGACGGTA ACAAGCAGGA AGAATGCAGC TTGGAAATTG ACGACGACGA TTGTCCAGAC AAAGTTCTGT CGCCAATCCG TATACCATCT CCAGTATCTT CGCCGAATGG GAAATCGACA GCCACGACTT CTTCGTCTCC CGAAGTTGAC CAAGGACCCA GGGATAGTTC CATACTGCTA ACGGCAGGTA CACCATCCCG TTCCGTAGCC AATCCTATTC AGCCCTACAC ACACAAGGAT CACTTTACCG CTTTGCGCCT TTCGTTGGAA GAACACGGCG GTTGGTATTC GGTCGACTTG ACTTCCTTCG TTGTACCACC CCGCGACACA ACTACCGGGA ATCTCATCGG TTTGAAAGAC TATTGTCTCG CCATTCCCGA ATCCAAACTT AAACATTTAT ATGTGGGTTT ACCAGATGCA GCCAAACCCG GACCTGATAC TACTGTTACT GGTTGTAGTC AATACACACT ACCCTTACCG GTTCGCACCT TGGCAATCCC AGTCCGACCT GACGTTCTCT GTGGAGCCAT AATGGATGCC GTACATCAGG TTCTGACTAG TACCGCCATA CATGCACGGA TTCTCAAACG ACAGGGCGGT CATTTACGAG GAATCATATC CGGGTGTGTC GTCCCACGAG ATCCGGATCG ACTTGTGGAA GAATCCTTTC ACAGTAAATC AAGTGCGGGC GAGTTGGAGG GTGTACCCAG TACATACCCA CCCTTTCTCA TCGATGCTCA ATTATGCACG GCCAAATCTG ATTCTTGCGA GCGAGTCTTG TTGCTACGCG TGTATCACTG TTGCAGCGAG TCTATTCAAC CCAGTGACGA TGTGGACAAT ATGAACACAT CTTGGGTCAC GGCACTGTCA CAGCAAAATC CGGCCGGCGA TCACGTATTA GGAACAAATC ACTTTGCGTT GGTAGAACAC CTTGATATTG AGGCATCCAC TCGTTTGCGC GAATGTTGTG CATTGGTGCA GCGGGTAGAA GCTCCCGAAT TGTCAAAAAG GATCCGCTCC CCGGGCAAAC GCTTCGACAA CCGTGAGTCC ATGCAGACTC TTGTCTCCAG TCATCTATTG GAGCATTACC GAGCGTGTCC GTCCGTCCGC GAAGGAAGCA TCACCTTACC GTCTCTGAAT TCGGACGACT GGCCAGTGAT TCAGTCTTCG TGGCGTTTCG TCCAAGCGAC GTGGGAAGAG CTAGAAACAC GGGACTTGAC TTACACGACA TTGACCACGG CGCGCTTTGG GGCTTTTCCG GCTTTGCCTA CCTTGGATGT ACACTACTGC TCGCAGATTC GACGGTTTTC ACGGGAAGTC ATGATAATGC AGCTACTGAA GAGCGCGAGC GAATTAGAAG AGTACGCGCG CGAGGCCGAG TACGCTTGTG CCAACATGAT TTCGCTACTA CAGCCAACAT TTGACGCATA CGGTATGGAA GCCCCATTGT TGCCCAAACC TGTGCCGCTC AACGAATATC CCTTGGACTT TACACCACCC CAACAGGCAT GTCCACCTTG GGGGCTGAGA GTGATGGAAG CTTTAAACGA GACTCAAGCT CTTACAAGTG ACGCCGGGCG AGACGAACCG ATTTTGTCGC CGACAACTCT GTATGCTATT GATGCGTCAG AGTCATTGGC CATGGCACGA CGTGCGGTCT CTCTTATCCT GAACGCATTT CAAATACAAG ACGACGAGGA GAAGGGTGCT CGGCTAGGCC GTAAGAACCT ACAAGTAATG GATAGGTTGG CCAAGATGCA AGCACATCAG CGCACTTTGA TTCAATCCTT ACAGAACGGT ATCGCATTGT CCGAGAAGGC AGCCAAAGCT GCAGATGATT TTCACAGGAA AGCGGGTGTC ATGGAAGTAC CTTTGTTAAA ATGGAATATT GTTGTTGGGG GGGCTTCCGG CACCTGCTCT GTAACGGCAA AACACCTTTT GTTTATCACT CAGCTCATTC CGGTGATTGG TGGCAGCCGG ACGGCCATCT TCCGGATAAG CGAAGTGGAC TTTGATGTGC AAGAATCAAC TCCCTCTATT CTTAATCCTT TACCAACAGT AGTAAGCGTG CGAAAAGACG GCCAGCAAAT ATACAGCTTT CGACCTTCAG CGGGTGGCAA GAGACTGAAG AGCGTTTTGG AAACAATCAA GGCAACGGCT CTGGACCAAG ATGCGCTTCC AGAATCACCC TCATCAGCAT AA
|
Protein sequence | MSSSLTPASL RSDASTESGT DEDTQAYDKL ALDDCVNSPV PLNPDESQNA EIRALAAKLA PDFEQADALL QVPFEFQSHP SFSVLEAATS EDDDMDREME SLQFSEALLR QELELAQDFS TLFQTVSHNM SNTSDAVDKF NLSGTPFCEG MKRQLFDPPD HQSADGSPAL ALQSSSSREA DIDPSDGNKQ EECSLEIDDD DCPDKVLSPI RIPSPVSSPN GKSTATTSSS PEVDQGPRDS SILLTAGTPS RSVANPIQPY THKDHFTALR LSLEEHGGWY SVDLTSFVVP PRDTTTGNLI GLKDYCLAIP ESKLKHLYVG LPDAAKPGPD TTVTGCSQYT LPLPVRTLAI PVRPDVLCGA IMDAVHQVLT STAIHARILK RQGGHLRGII SGCVVPRDPD RLVEESFHSK SSAGELEGVP STYPPFLIDA QLCTAKSDSC ERVLLLRVYH CCSESIQPSD DVDNMNTSWV TALSQQNPAG DHVLGTNHFA LVEHLDIEAS TRLRECCALV QRVEAPELSK RIRSPGKRFD NRESMQTLVS SHLLEHYRAC PSVREGSITL PSLNSDDWPV IQSSWRFVQA TWEELETRDL TYTTLTTARF GAFPALPTLD VHYCSQIRRF SREVMIMQLL KSASELEEYA REAEYACANM ISLLQPTFDA YGMEAPLLPK PVPLNEYPLD FTPPQQACPP WGLRVMEALN ETQALTSDAG RDEPILSPTT LYAIDASESL AMARRAVSLI LNAFQIQDDE EKGARLGRKN LQVMDRLAKM QAHQRTLIQS LQNGIALSEK AAKAADDFHR KAGVMEVPLL KWNIVVGGAS GTCSVTAKHL LFITQLIPVI GGSRTAIFRI SEVDFDVQES TPSILNPLPT VVSVRKDGQQ IYSFRPSAGG KRLKSVLETI KATALDQDAL PESPSSA
|
| |