Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_33648 |
Symbol | |
ID | 7197936 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 12638 |
End bp | 14491 |
Gene Length | 1854 bp |
Protein Length | 595 aa |
Translation table | |
GC content | 46% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178150 |
Protein GI | 219114709 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAGCATG CCTTGGTTAC TCGAACGCAT GGTCGATATT GGACCGTACG ACGATTGTCG GGTACATCAG TACTTACATT CCTGTTGCTG ATAGCAGTCG GTCTGAACAT AGGCCGGGTG CGTCAATTGA TTATTTACGA GGACCTGAAT GTATTGGCGT ACTTGGATAT CCCCCTTCAG GTTGAGACCA CTGAAGATGT AAGTGTAAAA GGGAGCAGTG GAAACACTTT AAATCCTAAT TACTGTTGGC TCTGATTTAC AAAGCATTTT TTATTTCAGC GAAACGCGGA AGAGCTGTCC ATGGACGAAT ACCCAGTCAA ATTCAATACA TTGAAAAGCA CAGATCGGCT GCAAAATGTC ACACCATCAC CCTTGTTGAC AAAGACAGAT GGTATCCTTT CCACGGAAAA GGGGTCCGTT GAAGCGGCTA TTGCGACAAA GGATGGTGCG TCGACCACAG TCGCTTTGTC TCGACCAACC GACCGTACGC TTCGTTTTCC TACAGTAGCT GAGCGGGTTC GGTTTTACAT GTCGTCTTGG TACGAACCAC CCTGCAGCGA AGTGGACCTT TTACAAGTTG TGCAGCATAC TGGTAGTAAG CTAAACAGCG AAATGGGGGA TACGACGATT GTCACGAACG GAGGCAACCC GGTCATTGAA TTTGGAGGCC ACGAGAAACG AGTATATCCT TCTTTTTCAC TGCAGCGTCG AATGCAGTCC ACGAACAACT CCCTCTCGGT GGTCTTGAGA GCAAGGGCAA TGGCTGGGGG AAATGTTATC TTTGCTTTAG ACAAGACGTC GCTTGATGTA TGCCAATTTG GTCCCCAACC AAAGAAAATT TTATGGCAGG TATACTGTCC AGAGCTCCGG GACAAGTTGT TGCTACCGTA TTTAGCTGCA AACCAAACCA AGTTGGCCGG CAAAGCAAAG AGTGGAGACG AAAAGACGCT CATACTCGCC CAAGTTGGTG ATGCTTTGGC TACCAGAGTT TTAAATGACT TAGGAGAAAT CAGGCATCAA TCGCCCAAAC CCTCTGTTCC TCATTTCAAA AAGGTACGAT CTGCCTGGGA AGACAAGGAT GGAAGAGAAT CCTTGTTGAA TGCATCACCA GTGGCTTGTT CCACGTTACA GAGCCGACGA ACCAACCACC AAAACTTGGA ACCAATTATT TGGAAAATGG AAATTGACCG GCATTACGGT GCAGTCCACC AAATCACAGA AGTTGATATT CCTTGGGATC AAAAACGGAA TGTAGGCGTT TTTCGCGGGG CCACCACCGG GAATGTGAAC CAACGTTTAC CAATGCGCGA GCGCTGCCTG GAAAACCAAC GTTGTCAACT TGTACTCATG TACCATAATT CATCCTTTGT GGATGCCAAG TTTACCAATG TCCTAAAGCA AAGCAAACTA CCAACCGAAT TTGATGGCAT AACAATGACC GGCAGTCGCT TTCAACTGGA TAAACTTTTG GAATTCAAGG TGTTGATATT TTTAGAAGGA AATGACGTTT CTTCTGGGCT CAAGTGGGGG TTGTACTCCA ATTCGGTTGT ACTGATAAAC AAGCCCTCCG TATCGTCATG GGCAATGGAA GAACTGTTAG AGCCGTGGGT GCATTATGTG CCTCTGAAAG ATGATCTTAC GGACGCAGAA ACCCAAATAA AGTGGGTCAT CGAGCACGAT AGAGAGGCAA AGGAAATTGC GATTCGGGGA CAGCTTTGGA TTCACGACCT TTTGTTTGAC AAGCATTCAG AAAGAGACAA CGCTGCAATC AACCAGGAAA TACTTAGCCG ATACGAGGCA CATTTTCGAC CAGGGATTGA ACAAGAGAAT GGGCAACCAA ACTTGGAGAA TTAG
|
Protein sequence | MEHALVTRTH GRYWTVRRLS GTSVLTFLLL IAVGLNIGRV RQLIIYEDLN VLAYLDIPLQ VETTEDHFLF QRNAEELSMD EYPVKFNTLK STDRLQNVTP SPLLTKTDGI LSTEKGSVEA AIATKDGAST TVALSRPTDR TLRFPTVAER VRFYMSSWYE PPCSEVDLLQ VVQHTGSKLN SEMGDTTIVT NGGNPVIEFG GHEKRVYPSF SLQRRMQSTN NSLSVVLRAR AMAGGNVIFA LDKTSLDVCQ FGPQPKKILW QVYCPELRDK LLLPYLAANQ TKLAGKAKSG DEKTLILAQV GDALATRVLN DLGEIRHQSP KPSVPHFKKV RSAWEDKDGR ESLLNASPVA CSTLQSRRTN HQNLEPIIWK MEIDRHYGAV HQITEVDIPW DQKRNVGVFR GATTGNVNQR LPMRERCLEN QRCQLVLMYH NSSFVDAKFT NVLKQSKLPT EFDGITMTGS RFQLDKLLEF KVLIFLEGND VSSGLKWGLY SNSVVLINKP SVSSWAMEEL LEPWVHYVPL KDDLTDAETQ IKWVIEHDRE AKEIAIRGQL WIHDLLFDKH SERDNAAINQ EILSRYEAHF RPGIEQENGQ PNLEN
|
| |