Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_48672 |
Symbol | |
ID | 7194861 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | - |
Start bp | 551027 |
End bp | 554924 |
Gene Length | 3898 bp |
Protein Length | 1123 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183242 |
Protein GI | 219125971 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATGCT GCACATCCCA ATCCGCAAAT AGAAAAAGAG TATCGACGTG TTGGCTGTCG CCTTTCAAGT CGTCGCTTTT ACTTTTGATA TGGGCTTTGA CTGTATGTCT TTCGCTCGGC TTTTCCTCCC GTCCGTTACT GTACAGTCCG CGTTTTTTCG CTTCTTCTAT TGTGGTGCAA AAAGAACAGC GCATTACATC AACAAAACAG GCTTCAGACG ACGGTATCGA CGAAACCGCC AAATCTCTTT CTAAACCAGA TTCAGCTTCG AAAAGAGTAT CAGCCACAAC AGAAAATAAG CTTATTAGAA ATGCTCCCAA TCGACCTTCG AATGGAGCAT CGAAAAGACC CCTTCGGCGA AACACCCCTG CTGGTCGTCG TGGCAGTCCG GGCAGCAGTA TCTTGCAAAA CTCGAAACGA CTGAATCAAC TTCTAGTAGC CTGCGAGAGC GCTTCCGAGG TTTTGACACT ATTGCAAAAT ACAAAAGGTT CCTTGACACA AAAGGCCAGC GGTGGTACAA TGAACAGTGT AAATTTTTCC ACTTCGATCC ATCGTCTTTG TCGACATTCG CTTAACCAAC GCGATACCCG TGCAGCAACG CTAGCCGACC CCCGGTTCGC CTTGTTGCTA GCGTCGACGG CCGAAGCCAT GGTAACTATG CCATTCCAAT CACGTGAATT GTCGAACATT GGTTGGGCCT TGGCGAAACT GAAGATTGTA CCTCCATTGA CGGCCATGCC TTTTGAACAA TCCGACGACG AGGCCCTTAA AGCGGCCGCT CAAACAGTCC GTGACGGCGT TTTCAAAGCA GCCAAAGAGC GGCAAGAATC GGGAACACCC TCCAAGGCAT GGATTACTGC CCTTTCACAA CTGGCGGGTC AAATTTTGGA TCGCATATCG CAAAATGTGG TCTCGACACA AACCGACGGC TTTCGACTCC AAGAATGGGC AAATTTGATG TGGGCTTGGG CCACAGCAGA ACGAGCTGAT CCGGTAGCCT TTGGAGTGGC TGTGGACAAG ATGATTGATC AACAGCAAGA GGCGGATCGG ACGGGTGAAC CTAATCTTCG ACCGCAGGAA TGGACCAACT CTGTTTGGGC GTTTGCCACA GCACAGGTTT ACGGAAAACA CGAGAAGTTG TTGATATTCG TCGCGGAACT TATGGAACGA GAGTACGCAT TTGTGCAGAT GTTCAAACCT CAAGAATTAA GCAATACCGT TTGGGGAGTG GCAACCTTGC TCTCGAATAA GGAAGGAGCA TTAACAGATG CGGAACAAGA AGCGGCACTA AGCATTGTTC GAATAGTATC GAAAGCTTTG CTGAAGCGAT CAAACGAGTT CAAAACTCAA GAACTTAGCA ACACTTTATG GGCCTTTGCC ACTCTGGGCT TCGGCTTGAA GTCATCGGGA GAGCAGTCAT TGAACAACTA TGTCGTTTTA GCAAGCAATC AATTCGAAGA AGACAGAGAG CTCATGCAAC AGGCTGTTGA AGCTGTAGTA CTGGCAGCCT ACCCTCAACT CGACCGATTT CGCTCTCAGG AGCTGAACAA TCTTGCTTGG GCTCTCGCTC GTTTAGTGGA TCACAAATCG GCTCTTGTCG AAAATATTTT GAGAGGTATA GGAATGCAGC TCTGCGATCG AAAGCGATTT GTGACACCGC AAGATATTGG TGCCACTATC TGGAGCTTGG CTACTTTAGA ATTTTTTGAT GAAGAGATCT ATCGAGGCAT TGCATTTCGT CTCACTCCTG ACAAAGGGGG CAGTTGTAAA CCTCAAGAGT TGTCAAACAT AGTGTGGGCG ATTGCGACTG CCGAGGTCCA AGTGAAAGAT CGGGACGCTT TTGACACGAC GTTGGTTCCA GAATCGAAGC GCCAGCCCGT GCGTGACCCA ATAACCCGCA GTTTCGCCAT TGCGGCAACG GAGCTCATGC GAAGGCCTTC TCAATTTAAA TCTCAAGAAA TTAAAGACAT TTTGTGGGCA TTTTCAAAAA TTGGTATTCG CCACCCGAGC TTGTTCAAAA GTGTTTCCGA GCATCTTGTG GGGATAATCG GACCAGGAAA GCCTCGTGGG TTGACTGAAT TCTCACCTCA GGGGTTAGGA AATACAGCAT GGGCATTCGC AAGACAAGCG CAACTTAGTG AAGAAGCAGC CAATCGCCTT GGTGGTGCTT CACTGTTGCC TTCGAGCAAC GGTCGCCTTG CAATTTACAC AGCTTGCTAT TTTGATATTG GCGAGGAACT GATTCACCGA TTGTTTGCAG CTATCGCTGA GGCAGGCATC ACTAAGCATG TCAATTTGAC TAGTTTTAAA CCCCAAGATT TGTCGAACAC AGCATGGACA TTTGCAGTGC TTGGTTTACG ACATACAGCT TTTATGGAAG TCGCAATGCA CGAACTTGAG CGGAGATTAT CCCTGTTTCT AAAGGGAGAG CGGACGTCCA TTACGACCTT TAAAGGCCAA GAATTGGCAA ACTTACTGTG GGCGCTAGCA ACGCTGAACA TTCGAGTCGA AAACTCTCTT GAGATAGTAA CTCCGTATCT TCAAGAGGTT TGCTTTGAAG GCAGGACTGG AATGCCAGTA CAAGCGATAG CCCAAATTTT CAAACGCCAA GAACTTGCCA ATGTAGCTTG GAGCTGTGCT GTCTTTGGCA AGTATCCAAC GGCTTTAATG CAACTGTTAT ATGCTGGCTT GATTGGACTT GATAAAGAGT GTGATGCCGA GAAATTGTCA AACGTGTACG GAGACAAAGG TCTGCAATCG CAGGCATTGA TGAGTTTGAT CTATGTTCAG GCTTCTATGG ATCGCGCCGG CAAAAGTACG CTGGGGCTTC CGCCAAACTT TCCTGACGCC TGGCGACAGT CTACTCCCTC CGAGGATGGT CAACGCATGA CAGAAACGAA CATAGAACTT TCTCTGAGTA CAAGCAAAAT CCAAAGAGAC GTTTCCGCTG CTTTCAATCG CATCGGATTC AAGCACATAG AAGAGCACAC TATTTCCATG CAGGAAATGG TAGTCGAATA TGGGGTAAAT TTTGCTCCAC AACAACTTGA CATTTTGTCA ATTGATATTG CGAATGTACC AGAAAAGATT GCTATTGAAG TTGATGGACC TGCCCATTTC ATCAACCTTA TCGACAACGT TGACGAAAAC GACTACGGTT CTACGAAGGC GCCCAATGGG AAACTAGAGT ACCAGTTTCA ATGGACCGGT GACCGCCAGA TGATGAATGG CTCTACAAGC CTCAAGCATC GCCTTCTCGA ATCGCTCGGC TGGAGAGTAA TACATATTCC GTTTTGGGAA TGGTACCAAA TGGGGAGTGA CGAGGAGCAA GGCGAGTACT GTCGAGACGC TCTCGATACC CTTGGAGAAT AGCATGCGCC GACGAGCAAC GCGATTTCGC TGTATTCCTT AGTACCGCTA ATATATTCCA TAGACAACCA ATAGCGACTG TTCTATAGAG TGTAGAAAGG AGATGTAAAC CAGTCTTTTA GAGATCAAAC ACAGAAGCGC TGCATCTCTC TCCGCTACTT ATCAGCTTTA CGTTTCAAAA ACACCAACGC GGACGAACGG AGAATGATAA AAACTGCCAA TAAGGCCAAC CAGTACCACC AAACGTCATC CAGGTCAACG CTGGCATTGT CTAACACGCT GTCGCAATTC TGGTGGGCCT GATCCGAACC ACAGTCACGG TCGAATTCGC CAGCCAATTC CAATTTCACA GCGTACGTCA AAGGCATGAC GTATTGAAGC CAACGCAGCC AAACTTGAAT AAGTGACTGC GCAATGAAGA AACCCGAAAA CAAAATTTGT GGCACAAAAG TCATTGGAAG AAATTCAACA GCCAGTTTTG GATCTTCGAC GCTGGAACCA AGAAGCATTG ACATGGCAGT ACCGGACA
|
Protein sequence | MKCCTSQSAN RKRVSTCWLS PFKSSLLLLI WALTVCLSLG FSSRPLLYSP RFFASSIVVQ KEQRITSTKQ ASDDGIDETA KSLSKPDSAS KRVSATTENK LIRNAPNRPS NGASKRPLRR NTPAGRRGSP GSSILQNSKR LNQLLVACES ASEVLTLLQN TKGSLTQKAS GGTMNSVNFS TSIHRLCRHS LNQRDTRAAT LADPRFALLL ASTAEAMVTM PFQSRELSNI GWALAKLKIV PPLTAMPFEQ SDDEALKAAA QTVRDGVFKA AKERQESGTP SKAWITALSQ LAGQILDRIS QNVVSTQTDG FRLQEWANLM WAWATAERAD PVAFGVAVDK MIDQQQEADR TGEPNLRPQE WTNSVWAFAT AQVYGKHEKL LIFVAELMER EYAFVQMFKP QELSNTVWGV ATLLSNKEGA LTDAEQEAAL SIVRIVSKAL LKRSNEFKTQ ELSNTLWAFA TLGFGLKSSG EQSLNNYVVL ASNQFEEDRE LMQQAVEAVV LAAYPQLDRF RSQELNNLAW ALARLVDHKS ALVENILRGI GMQLCDRKRF VTPQDIGATI WSLATLEFFD EEIYRGIAFR LTPDKGGSCK PQELSNIVWA IATAEVQVKD RDAFDTTLVP ESKRQPVRDP ITRSFAIAAT ELMRRPSQFK SQEIKDILWA FSKIGIRHPS LFKSVSEHLV GIIGPGKPRG LTEFSPQGLG NTAWAFARQA QLSEEAANRL GGASLLPSSN GRLAIYTACY FDIGEELIHR LFAAIAEAGI TKHVNLTSFK PQDLSNTAWT FAVLGLRHTA FMEVAMHELE RRLSLFLKGE RTSITTFKGQ ELANLLWALA TLNIRVENSL EIVTPYLQEV CFEGRTGMPV QAIAQIFKRQ ELANVAWSCA VFGKYPTALM QLLYAGLIGL DKECDAEKLS NVYGDKGLQS QALMSLIYVQ ASMDRAGKST LGLPPNFPDA WRQSTPSEDG QRMTETNIEL SLSTSKIQRD VSAAFNRIGF KHIEEHTISM QEMVVEYGVN FAPQQLDILS IDIANVPEKI AIEVDGPAHF INLIDNVDEN DYGSTKAPNG KLEYQFQWTG DRQMMNGSTS LKHRLLESLG WRVIHIPFWE WYQMGSDEEQ GEYCRDALDT LGE
|
| |