Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44482 |
Symbol | |
ID | 7197713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | - |
Start bp | 700014 |
End bp | 702003 |
Gene Length | 1990 bp |
Protein Length | 600 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178557 |
Protein GI | 219115523 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00564143 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCGGGTCGGA GAAAAACGCG TAACAGTTAA TCTGAAAGGC CAAATACTGT GCAATCCTAT ATAGTACACA GTCGAATAAG ATCGGCGATA ATCTATGCCC GTTGACGCAA GGAAAGCACC TCAAGTTCAG TTTCTGCAGA TTTAGATCAA CGGAATACCA TGTTATCAAA AAAAGTTCTG TCTAGTTTCC AGAAATCTTC GGTCGGAGCT TGTGTACTCT CGGGGACAAG GCGTGTATCT ACCGCACATG TCCGTTGCGT CTCGGTGGCT GCATTGAGCC ATTTCGCTGG CAACGATAAT GCTTCACTGT CAAGTAAATG CTCGTTAATG AATGTTGCTG CTGCTGCCGC TGCTGCGACC GCCACGATTC TAACCTGGGC CGGTGCCACT TCGCTTTGTG ACTCCAATGC AAAGCCCATG AGTACCGAAG ATTCACATTT GACTCCTTCA GAAGTCGGCA AGGAAGACTT TGAGGAGTTC CAGGCATCCC ACGATATCAA TAGCATGCCT GTCTACTCAC TGGAAGAGAT AGCCGAAAAG AATGGAGAAA ACGGAAACCC CATCTGGATG TCGTACGGTG GTGTCGTTTA CGATGTTACT GATTTCATCC CTAACCATCC CGGCGGATCG GAAAAGATTC TCACTGCAGC CGGGTCGGCA ATTGAGCCCT TCTGGTATCT CTATCGTCAA CACTTTGCCT CAGATCTCCC AATGCGTCTA ATGGAGCATA TGGCGATTGG ACGTCTGTCA GAGGAAGATC AAGAAAGGAT AGAAGAACAG ATGGCGACAT TGGAAGAAAC GGACCCATAC GCCAAAGAGC CATACCGACA CCGAGCTCTG CTGGTGCACT CAGATACACC CATGAACGCA GAATGTCCCA CGCGCTTCTT GACACAAAAC TTTCTGACGC CTGCTTCTAT TTTTTATATC CGCCATCACC ACCCGGTCCC GTTTTTGTCT GAGAAGCAGG TGAATGACTT TCGGCTGAAG GTCGATTTGA CAGCTTATGG GAAAGGCGTC GTGCTATATA GCGTGGATGA ACTAAGAAAA ATGAAAAAAG TGGAAATCAC TGCCACACTG CAGTGCAGCG GAAATCGCCG CAGTGGATTC AATCAGTTCC AGAGAACTTC TGGCACGCCA TGGGGCCAAG GAGCTATATC TACAGCCAAG TTCGGTGGAG TTCGCTTGAC AGACTTATTA AAGGCGTCCG GCCTGAAGGA TCCCATTGAG GCCGAAGAAA AGTTGGGCCT TGAGCACGTG CGCTTCCACA GTTTAGATGG TATGTCGGCA TCAATAGGAA TGGAGAAGGC AATGAATCCT TACGGCGATT GTATCGTTGC TTACGAAATG AACGACGAAC CGATCCCTAG AGACCACGGT TTTCCACTCC GAATTATTGT TCCCGGCTAT GCTGCGGTTC GGAATGTCAA GTGGTTAGAA AAAATTGAGC TAGCCAAAAC TGAAGCAGAA GGCCCTTGGC AACGTGGGCT CAACTACAAG ACACTCCCTC CAAACATGAC AAATGCAAAG AACGTTGACC TAAACAAAAT GCCCTCAATG ACGGAAGTAA GTTTATTTTC TGGTATTACA CAAGTCGAAA AGCCAGAAAT GAAAGAAGGC ATGAAGGCTG GTGATAAAAT CACTGTGAAG GCAACCGGTT GGGCGTGGGC AGGAGGAGGT CGCAACATTG TTCGTGTGGA CGTAACTGGA GACAACGGTG CATCCTGGGC AACCGCAATT CTAAAAGAAG GATCCGAGCA GCGCTTTGGT CGCGCTTGGG CATGGACATT TTGGGAATGC GACGTTCCGG CGATCGTACA AGAAGATGGC AATGTGCATT TAGCATCTAA AGCGGTCGAC CTGGCGTTTA ACGCGCAGCC AGAAGACGCG AATCATACAT GGAATGTCAG AGGACTCGGC AATAATAGTT GGTACCGCAC CAAGATCAGA ATATTGGAAT AAATGTTAAA AAGCGTCGCC ATTTATATTG
|
Protein sequence | MLSKKVLSSF QKSSVGACVL SGTRRVSTAH VRCVSVAALS HFAGNDNASL SSKCSLMNVA AAAAAATATI LTWAGATSLC DSNAKPMSTE DSHLTPSEVG KEDFEEFQAS HDINSMPVYS LEEIAEKNGE NGNPIWMSYG GVVYDVTDFI PNHPGGSEKI LTAAGSAIEP FWYLYRQHFA SDLPMRLMEH MAIGRLSEED QERIEEQMAT LEETDPYAKE PYRHRALLVH SDTPMNAECP TRFLTQNFLT PASIFYIRHH HPVPFLSEKQ VNDFRLKVDL TAYGKGVVLY SVDELRKMKK VEITATLQCS GNRRSGFNQF QRTSGTPWGQ GAISTAKFGG VRLTDLLKAS GLKDPIEAEE KLGLEHVRFH SLDGMSASIG MEKAMNPYGD CIVAYEMNDE PIPRDHGFPL RIIVPGYAAV RNVKWLEKIE LAKTEAEGPW QRGLNYKTLP PNMTNAKNVD LNKMPSMTEV SLFSGITQVE KPEMKEGMKA GDKITVKATG WAWAGGGRNI VRVDVTGDNG ASWATAILKE GSEQRFGRAW AWTFWECDVP AIVQEDGNVH LASKAVDLAF NAQPEDANHT WNVRGLGNNS WYRTKIRILE
|
| |