Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37902 |
Symbol | |
ID | 7202835 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011682 |
Strand | - |
Start bp | 357337 |
End bp | 360325 |
Gene Length | 2989 bp |
Protein Length | 841 aa |
Translation table | |
GC content | 62% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002182054 |
Protein GI | 219123485 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0559986 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCCGA CCGCCGACTT CACCATTTCC GACTTTCCTC ACAAAGTCCT CGCTCCCATC GCCACCGACA CCACCGCTCC CTCGTATTCG TCGCTTCTCC TAGCCCAACG CCAGCTCTCC GCCAACGCGT CCGCCATTCC CAGCCTTAAC GGCGGCGGGG CCCATGGTCA CATGGCCCTC ACGCTCTCTG CCGAAGCGTA CGCCGAACTC TCCGACATCC CTTTTGTCAT CCCCGTTGCT CCCCCTGCCG ACCCTGAACC CGGCACCACG CAACCTCAAA TCACGGAGAA CAACCGACTC CACAAACGCG CTGTGGCCAT CCACAGCCTC TACGTGGCGG TCAACAACGC CCTTCGTCGC CAGATCCTCG ATGCCGTTCC TCGCGTCTAC GTTCGCGACC TAGAACACCC CCAGTTTGCC TACAGCCACG TTTCCTGTCG CGACCTCCTC GACCATCTCT GGCGCAACTT TGGTACCATC TCCGCTTCGG ACCTCAAAAC CAATATTCAG TCCATGTACA CCCCGTGGAA CCCTGCTGAC CCCATCGAGA CCATTTTCCA TCGCTTAACT GACGCCATCG CCTACTCCAC GGCGGGACAT GACCCCATCA CCGAAGCTGC CGCCGTTCGC GCCGGCTACG ACGTGCTCGA GCACTCTGGC CTGTTTCCCC GTGCCTGTGA AACCTGGCGT ACCGCCTCGC CGGATACCCA CACGCTTGCC AATCTGCGCA CCCTATTCAA GGTCGCCGAT ACCGACCGCA AGCGTACGGT TACCACCGGC TCCCTTGGGT ACGCCAACGT CCTTGCCGCC GCGCCATCGG TTCTCCCTTT GGTCTCGCCC GACTCGCTCA GCCTTCCTTT TTCTGCCCTC TCGGTGTCAC ATTCCTCTTC TGCCCTCTCT GAGCGAACTT ATTGCTGGAC CCATGGATCC AGCAATAACC GTCGGCACAC TAGTGCCACT TGCAAAAACA AGGCCCCTGG CCACCGCGAC GACGCGACGG CCACCAACAC CCTTGGCGGC TCCACCAAGG TTTGGACTGC CCCCAAGCCT CCTGAATAGG AAAGAGGGAC GGCTACGCCG ACGATTAACA CTAGTAATAC CGATTATCTA AATCATATTA CTAGTCTTAA CTCGTCTGTA GTCCCCTCCC CGCCTAGTAC CCACACCTCG GCCATTGCCG ACACCGGCTG CACCGGCCAC TACATTACCA TCAACTGCCC TCACACGCAC CGGCACCCAG CCAACCCCAG CCTCTCCGTC CGTGTCCCGA ACGGCTCTGT CCTCCGCTCC AGCCACGTTG CCACCCTGGA CCTCCCTGGT TTCTCCCCTG CCGCCTGCCA AGCCCACATT TTTCCTGGGC TCGCTTCCCA TCCGCTCCTC TCCATCGGTC AACTGTGCGA CGACGGCTGT ACGGCAACCT TCTCGGCCAC TCGCCTTGAC ATCCATCGCG ACGCCACCCT GCTGCTCTCT GGTGCCCGCT CCCCCCACAC TGGCCTCTGG CACCTTGATC TTACCCCTCC CAAGTCCCCT GCTACAGCCC ATGCTCTCGT TCCAACCACC CCCCTCGCCG ACCGCATCGC TTTTGTTCAC GCCTCGCTCT TCTCCCCGGC TCTCTCTACC TGGTGCCAGG CCCTCGACTC CGGCCATCTC GCGACCTTTC CAGACCTTTC CTCCCGCCAG GTCCGCAAGT ACCCACCCCG CTCCCCCGCG ATGATCAAAG GTCACCTCGA CCAACAACGC GCAAACCTGC GCTCCACCAA GCTTTCCCCT GCCTGTTCCC CTCTCTCGAC GGAACCCCCT GCCATCGCTG TGCCCGACCT CGATCCTCCT GACGCCCACC CTATCGCACG CACACACCAT GTTTTTGTTG CCCACCAACG GGTCACCGGT CAAATCTACA CGGACCAACC GGGCCGTTTC CTCACGCCCT CAAGTGCCGG ACACAACGAC ATGCTTGTGC TCTACGATTT TGATAGCAAT GCCATCCATG TCGAGCTCAT GAAGAACAAG TCCGGCCCCG AGATTCTTGC CGCCTACAAA CGCGCACACT CTCTCTTTAC CCAACGCGGC CTCCGTCCCC AGCTCCAACG CCTCGACAAC GAAGCCTCTA CAGCCCTCCA ATCCTTCATG ACCTCGGAAC ACGTCGACTT TCAGCTGGCA CCTCCCCATC TGCACCGTCG TAATGCCGCC GAACGAGCCA TCCGTACCTT CAAAAACCAC TTTATTGCTG GCCTCTGTAC CACTAACCCA GATTTTCCCC TCCATCTTTG GGACCGCCTC CTCCCCCAGG CCCTTATCAC CCTAAATCTT CTTCGTCGCT CCCGCATCAA TCCCAAGCTG TCCGCCCACG CCCAGCTTCA TGGTGCTTTC GATTACAACC GCACCCCGCT TGCTCCTCCC GGGACTCGCG TCCTAGTCCA CGTCAAGCCG TCCGTCCGCG AAACTTGGGC CCCCCATGCT GTCGAAGGTT GGTATCTCGG CCCCGCTCTC AACCATTATC GCTGCCATCG CGTATGGATC ACGGAAACAC GTGCCGAACG TGTTGCTGAC ACCCTTTCCT GGTTCCCGAC CCGCATTCCC ATGCCCGCCG CTTCGTCCAC CGACCGCGCC CTGGCCGCCG CCCGTGACCT GGTCCATGCC CTCCAGAATC CTTCCCCGGC GTCTCCGTTC GCCCCCCTCG ATGCCACACA GCACCAGGCA CTCACCGATC TTGCCACCCT CTTTGCCACT GTGGCCGCCC CCGCCGACGA CGTCCCTGCA CCTGCTCCCG TGCCTCCGGT CCGTCCCCCT GCCCCAGCAC CTCCCCTTGC TCAGGTCCGT TTTGCCGTTC CTCTTGTCAC GGCTGAACAT GCCCCGGCAC TTCCGAGGGT GCCCATTCCG GCCCCCGCAC TTCCGAGGGT GCCCACCCTG GCCACATATC ACTCTCGCAC CGGCAACCCA GGCCGTCGCC GTCGCAAAGC ACGCACACAA CCGGCACCCC CAACCCTAG
|
Protein sequence | MSPTADFTIS DFPHKVLAPI ATDTTAPSYS SLLLAQRQLS ANASAIPSLN GGGAHGHMAL TLSAEAYAEL SDIPFVIPVA PPADPEPGTT QPQITENNRL HKRAVAIHSL YVAVNNALRR QILDAVPRVY VRDLEHPQFA YSHVSCRDLL DHLWRNFGTI SASDLKTNIQ SMYTPWNPAD PIETIFHRLT DAIAYSTAGH DPITEAAAVR AGYDVLEHSG LFPRACETWR TASPDTHTLA NLRTLFKVAD TDRKRTVTTG SLGLNSSVVP SPPSTHTSAI ADTGCTGHYI TINCPHTHRH PANPSLSVRV PNGSVLRSSH VATLDLPGFS PAACQAHIFP GLASHPLLSI GQLCDDGCTA TFSATRLDIH RDATLLLSGA RSPHTGLWHL DLTPPKSPAT AHALVPTTPL ADRIAFVHAS LFSPALSTWC QALDSGHLAT FPDLSSRQVR KYPPRSPAMI KGHLDQQRAN LRSTKLSPAC SPLSTEPPAI AVPDLDPPDA HPIARTHHVF VAHQRVTGQI YTDQPGRFLT PSSAGHNDML VLYDFDSNAI HVELMKNKSG PEILAAYKRA HSLFTQRGLR PQLQRLDNEA STALQSFMTS EHVDFQLAPP HLHRRNAAER AIRTFKNHFI AGLCTTNPDF PLHLWDRLLP QALITLNLLR RSRINPKLSA HAQLHGAFDY NRTPLAPPGT RVLVHVKPSV RETWAPHAVE GWYLGPALNH YRCHRVWITE TRAERVADTL SWFPTRIPMP AASSTDRALA AARDLVHALQ NPSPASPFAP LDATQHQALT DLATLFATVA APADDVPAPA PVPPVRPPAP APPLAQAVAV AKHAHNRHPQ P
|
| |