Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37043 |
Symbol | |
ID | 7202213 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 29004 |
End bp | 31002 |
Gene Length | 1999 bp |
Protein Length | 643 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181288 |
Protein GI | 219121886 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACACTG AAGAGAATGA GGAACCGCTC TGGGACCCCA TTCAACAAAT TTATACCGGA GGAGTACTAC CCCAAACAGT CGAAATCCAG GATTTGATTG CGGAGAACAA CGGCACCCTC CGACTGTTCG GCTACGGGTC TCTGTGTTGG AACCCCGGCA CCGGAGCCCT TGCCGATCCG TCAGTGCGAT ACGCGCCAGG CCAGGCACGA GGATACCGAC GCTGTTGGGC ACAAAAGTCG ACCGATCATC GCGGCCTTCC TTGCTTCCCT GGAATTGTGT GTACTTTGCT GAAAGACCAA GAATTTCGAG AATTTCTCTC GTCTGGCGTA GACGAGGAAA CGTTGACGGA AGGACTAATT TTCGAAGTCC CTCCGCCTTT AGTTGAAGAA TGCTTAGCGG AGCTAGATTT TCGAGAAAAA GGTGGCTACG CCAGAGACAT AATAGAAGTT GTCGAAGACA AGAGTGGTAA AGTGGTTCAG GCTTTACTGT ATAGAGGCAC CCCACAGAAT CCGGCGTTTT GGCCAAGAGC ATTGCGAGAT CTTCCGTTTG CAGCAGGCGA GTCCCTTCAG AGTGGCACTA CGTGCCAGTC AAAGCATTGA TACATCTCGC TCACTATCAT AATTGTCTTT TTTTTTAGCA ATCATGGCTA CGGCGATCGG TCCGAGTGGT GAGAATGAAG TTTATTTGAG CCGTTTGGAC CACTTTCTAG GAAAAGTAGC TTCAGGTTCC ACACTGAAGA AATACGACGA TACACTGGTG CTTGCATCTA TGACGAAACA GTTACGCAAT CAAAATTTGC ATTTCATGTT TGGCTCGGGG TCTAATCAGC GAAACCAGCT TTTACTGCAA ACAGAAAATA ATGCGGCCGC TCTATTTAAC AACGAAGATG CCCATGAAAT GAAAGAAATC GTTTTGTGCG CTGACCAAAC CGACAAAATC GAAGTGAATG AAAAAGTTAT TTCTTTATTT GCAGGAGGAG GACACAGTGC GATCCTTATG CAAAGCGGAA GGCTATTCCT ATTCGGGTCA AACGAACATA GTCAGTTGGG AGCAAGTGGA ATCACACAAT CCTCCTTTCC GCTCCCTATC CTTACTTGTC TCCGTGATTT GTTTATTTCC TATTGCTCGC TTGGTTTTTC TCATTCATTG GTGGTCGAAA AAGAGACAGG TCGTGTATAT TCCTTCGGGG ACAACGCAAG AGGCCAAGCT GACCCTGATA ACTGCGCCTC CACCATTCCA TTGCCAACTG CTCTACCGCT CAAGGAACAT ATTGTGGCCG TGTTCGCTGG AGTTTTCCAT TCTGCTGCCG TGAGCGAAGA TGGCGAACTC ATAACCTGGG GCTGTGGTCG ATTTGGACAG TGCCTTCCTG TTGTACGGCA TAGATTGTAC GGGCACTGGA AACCAGATGA CGGAAGCAGG GTGCTTGGTG TTGCTTGTGG ACGTCGCCAC ACAGTTACGT TTGACGATCG TGGGCGAGTG TGGAGTTTTG GTGAAAATAA ATACGGCCAG CTTGGACGCG ATCTTAAAGG TGAAAAATAC AGTAGGGTAC CATCGCTGGT GGATGGCGAT TGGGGGCTCG ACAGCTTGTC AGTCACCGGA GTGCACTGCG GCTGGTCTCA CACTATCCTT CAATTGGAAA ATGGCAAGGG AGAACTAATA TTATTTGGCT GGGGAAGGAA TGACAAAGGC CAGCTTGGGG TTGGCACAAG CAGCATCGTC TTTAATCCTG TACGGTTGTA TCCTTCGCAT AAAATTAGAC TTGTCGCTTG CGGATCCGAG TCTACTGCAA TCGTCGACAC TGACGGCGAA ATATGGAGCT GCGGTTGGAA TGAGCACGGA AACCTGGGTT TAGGACATGA CTTTGATGCA TTCGAACTAA CCAAAATCAA AGGGGCTCCG ATCACTTTGA CTCCAGGCTA TTCAGAAAAG AGTAGTTTAG GACTCGCCTT GGGTGGAGCT CATATGATTG CTATGCGACT CGCTAAAAAG ACAAGCTGA
|
Protein sequence | MDTEENEEPL WDPIQQIYTG GVLPQTVEIQ DLIAENNGTL RLFGYGSLCW NPGTGALADP SVRYAPGQAR GYRRCWAQKS TDHRGLPCFP GIVCTLLKDQ EFREFLSSGV DEETLTEGLI FEVPPPLVEE CLAELDFREK GGYARDIIEV VEDKSGKVVQ ALLYRGTPQN PAFWPRALRD LPFAAGESLQ TIMATAIGPS GENEVYLSRL DHFLGKVASG STLKKYDDTL VLASMTKQLR NQNLHFMFGS GSNQRNQLLL QTENNAAALF NNEDAHEMKE IVLCADQTDK IEVNEKVISL FAGGGHSAIL MQSGRLFLFG SNEHSQLGAS GITQSSFPLP ILTCLRDLFI SYCSLGFSHS LVVEKETGRV YSFGDNARGQ ADPDNCASTI PLPTALPLKE HIVAVFAGVF HSAAVSEDGE LITWGCGRFG QCLPVVRHRL YGHWKPDDGS RVLGVACGRR HTVTFDDRGR VWSFGENKYG QLGRDLKGEK YSRVPSLVDG DWGLDSLSVT GVHCGWSHTI LQLENGKGEL ILFGWGRNDK GQLGVGTSSI VFNPVRLYPS HKIRLVACGS ESTAIVDTDG EIWSCGWNEH GNLGLGHDFD AFELTKIKGA PITLTPGYSE KSSLGLALGG AHMIAMRLAK KTS
|
| |