Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_56506 |
Symbol | |
ID | 7201649 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011678 |
Strand | - |
Start bp | 207257 |
End bp | 209246 |
Gene Length | 1990 bp |
Protein Length | 594 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180962 |
Protein GI | 219120448 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.286813 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTGACCC GGTTGACGTA ATGGTTTGCG CCAAGGATGG AAGCCAAAAG AGTTTTCCAG AACCTTCCTT GTTGCAACGG AAACTCTCCC TATCGATCGA AATCCACATC TCGAGTTCGA ACGGCGAGAA CGCAAACCGG TTCCGTTGAC ACGTCTCCGA TTTGCGGGGG AAGTCGACGT TTTCCCGCTC CATGGCTCAC TGTCAGGGAC TTTCACAAAC AGGCGCTTTC TGGCATTCGC ATTCATTTTG TACAAACTAC CACGCTTCTC TCTTCTGCGG AAGGCCCCAG CCCATAATTG TGACGGTCAA GCAATGACGG ACTTGTCAAC AAATGCGACC GCCAAGGATA AGAAAGATGG ACCAAACTGG ATTCGAGGAG TCAATCTGGG TGGTTGGCTC GTCATGGAGC GATATATAGT TCCCTACCAG TTCGCCATCA CCACTTGTCA CGTCCAGGGC GATTTTTGCT GGTACCCTGG GGCTCTAAGT GCGCCTCCTT TGTTACACAA GGACTACAAG CTCTGCGACT TGACACGCTG CCACCCTTAT CGGAATTTGA CTATATTCAA TGGCACGGAC TATCCCATCG ACGAATGGAA TCTGGCCAAG GCTTTTGACA ACCAAACCTT GGCCACAAAA TGGTTTGACT ATCACTTCAA CAATTTCATT CGCAAAGAGG ATTTGGTGCG CATGAAAAAG GCTGGTTTGA CTCATTTACG GGTGCCGTTG CCGCACTGGA TTCGTGGCGA TATCCGGGAG AATGAGCCTT GGATTGCGGG AAATCGCTGG AAAGTTTTTG TGAGGCTTTG TCAGTGGTGT CGAAATATTG GCCTAGAAGT TTGGCCGAAT CTACACACGG CTCCCGGTTC TCAAAATGGC TTTGACAACT CCGGAATCGA AAGCTCGGTC TATACGTGTA AAGGCTGGGG GAGGCACCCA GAGAATATTG AGCGTACGTT GGATGTTATA CACGAAATTT CTGACGCGAT CGCCAAGGAC CACCTGTTGG ATGTGGTCAC GGGTTTTGGA CTTTTGAATG AGCCTTTCGG TGATTGCAAG TTGAATGGGT ACGAACGGTT TCTGGAGGAC GCGTTGGCTA TCACTCGTGC CAACATGGGC CCAAACGTTC ATATTTTCGT TTCGGACTTG TTTGGAGCCC CAAAATTCAA TGACGGATCC TGGTGGTTGG ATCCAGTCAA GTACCACAAC ACCTACTTGG ATACTCATTT TTACCACACG TTCGACAGCC ATACGCGCTC CATGAGTCCG AAAGAGCACA TCAATCACGT TTGTCACCCT GAGGAACTCC AAGCCGAAAT TACTAGCTGC TGCTACCAAG ACGCCCCCAG TACCAATTCC ACGCCCAGCA GAGGAGTCAA GCGCATTTCT GCCGAATGGA GCGCGGCCTA CGACGCCATG CCCGGAGAAC TTTTGCAGTT TGTCATGGAA GGCATTGCGG TCAACGGGAC GGCACCAGAT TTCTACCGTA TCCTGGAACC CGACCGCCGC GAATTCTTGA GAAAATTTGT CGAATCTCAA ATGGTGGCGT ACGAAGCAGC CGATTCCGAT TTGGGGCGGG GCTGGTTCTA CTGGACAATC AAAATGGAAG GCGGTGCTTT CGCTGAGTGG GACTTCTCTC GTGGTCTTGA GGAAGGCTGG ATTCCGCCCA TTGCAACCCC CGGTGTGGCG AGTCAAGATG TTTATGGGAA CTGTGACCGC ATTTTGGAAA GCACCAACAA TTCCATGGCT ATTGTCCACG CCTTTCCCTG GGGCGACGAA TCATACTGGA AGCAATATCC GATCGTTGAA TCTAGTGAAA CGCGGATACA TCGCAGCTTC GTTTGGCTTG GTATAGCATT TGTTCTAGCT TTTGCTCTGT CCAACCTTTT GCGCCGACGC CGTCAGCTTT TTGGAAAATA CACCAGCATT GACTCCGAAG TTGACGTGGA AGTACCAGGT AAAAGCAATC CGCAATATGT GCAGGAATAG
|
Protein sequence | MLTRLTRFLA FAFILYKLPR FSLLRKAPAH NCDGQAMTDL STNATAKDKK DGPNWIRGVN LGGWLVMERY IVPYQFAITT CHVQGDFCWY PGALSAPPLL HKDYKLCDLT RCHPYRNLTI FNGTDYPIDE WNLAKAFDNQ TLATKWFDYH FNNFIRKEDL VRMKKAGLTH LRVPLPHWIR GDIRENEPWI AGNRWKVFVR LCQWCRNIGL EVWPNLHTAP GSQNGFDNSG IESSVYTCKG WGRHPENIER TLDVIHEISD AIAKDHLLDV VTGFGLLNEP FGDCKLNGYE RFLEDALAIT RANMGPNVHI FVSDLFGAPK FNDGSWWLDP VKYHNTYLDT HFYHTFDSHT RSMSPKEHIN HVCHPEELQA EITSCCYQDA PSTNSTPSRG VKRISAEWSA AYDAMPGELL QFVMEGIAVN GTAPDFYRIL EPDRREFLRK FVESQMVAYE AADSDLGRGW FYWTIKMEGG AFAEWDFSRG LEEGWIPPIA TPGVASQDVY GNCDRILEST NNSMAIVHAF PWGDESYWKQ YPIVESSETR IHRSFVWLGI AFVLAFALSN LLRRRRQLFG KYTSIDSEVD VEVPGKSNPQ YVQE
|
| |