Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_35863 |
Symbol | |
ID | 7201062 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 923256 |
End bp | 925227 |
Gene Length | 1972 bp |
Protein Length | 603 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180347 |
Protein GI | 219119161 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.186525 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTTAT CCAACAAATC TTTCCACGAT ATTTTTCTGC AAGTCATGGT AGGGCACCCT GACGACGATA TACATAGGGA CGCCCACATG CAGGATACAC GGCGTTCCTC CCACCCCAAC GATACCTCAC CGCTCCCATC CGACGTCGTG ATCGGATGGC GTTCGCTGTG TGGTTCCGTC CTGCCGGAGA ATATCCGTGC CCACGCGGAA GCCTATTCTG CAACGGCCGT ATCACCACAC CATTCCAATA CTCGTCGAAC GACCCAGGCT ACCGCCAATT CTGTGACTAT TAGCAACCGC ACATTCCCGC TCGATTCGAA ACAATTGTCT GATCCTAGTA ACGTGCCTGC CTCCAACCAC AACGAAAACA GCTATCTATA CGAGCGTCAT AGTCCTGTGC CACTTGTGAG CCCATTTCTA CCCGATATGA CGGACTGGGA CAATTCTATG CAGCTTGGTT TACAACCAAT ATTTACGGAT CACGATTCAA CATTTGCCTC TGCTTTCGGC AAAACGTACG TATCTCCCGA AACCTTGCCG GTTCCTGCTC TCCCGGCGCC TCCCCCCGCT TTCGATCGGG CGCACTCCTT GGCCTTTTCG GTTTCCAGCG ACTGTACGAG CCAATCTGCT CTCTTTTTTC ATCCTATATC TAATCCCACA CACGATCCAC CCGTTTCACC ACTCAAACCA TGTTTCTTGG ATACACCAGA GCAGCCCGTG GTGCCAGACA TCGTCCACAA CTTGCCAATC GAACCCCGTC CCACCAAGAA GCTACGCACA ACGGAAACGA TCAATAATGA TGATCTGTGG CCTCGCGCGT TGTGGCCACA ACCTACACCG CAGCTAGCGA TCAACACAAC GTGTGCGCCA CTCGCCCCAC CACCGGCCCA GCACGACTCC CAACCAGTCG TATTGCTCCC CATGCCAATT GTTTGGCCCA AAACACCAAC AGCCTTGAAC ACAAACCTGA CTTTCGACAC GTCGTGGCTC TCCGCCGCTC AAGTGGCCAC CAAGAAAAGT GACCCCGTCG TGAAAAGGAA TAAGGACAGC GAACAGATTT CCGCCAGCAC ACTTGTCGCC TCGCGACCTA GCGTGGTCGA TCCACGATTG GCTACGTTCT TGGAACGCTT TGACAACGCC GAATGGCGAT TGCAAGCCCT GCAGGCCGAA AATGCCGAAC TCCAAGCCAA AGTGCAGGAA GCCGAACGGC AAAAACGCGC CATGCAACAG TTGGCCAGGT AATGCAAGTG CCTCGATAGA AATTCTCATA GATAGAATCG GCGCGGTGAT CTATAGTGTC GTAGCTATGC ACACGGGGCC CGTAGTAACA AAGGTCAGAG TTGAACGGTC TCGCTCGGTA GCTCAACATT GCATCGTTTG GACTACTCCG ACAATCAGAC GCCATTTGTT CCAAAGTGTT GTGAATGTGG CAGGAGTAGC GATCGTAGGG AAGCGTCCCA AAAATATGGC GTGGGTGACT TTGCGAGCTC GTCGACATGC TGGGACGATG TTGCACCACC ACTGCAACAG TAGCCGATTC CTCTGTACCG ATCGTGTTGT AATAGAGGAA AGGACAATGA CTCTGAATCC AAGGAATCGT CCGAACCGTC ACACGTTCCT TTGCGAATAC ATATCTGATG TGGATGCGTG CGGTCGTCGT CGCTGGAACC GGACAAATTC ACGGTCCATA CGGGCCGTCG CGAAGTCCAG ACGCACTGTG TGTGCGACTG CTGTCGTTGC TCCTCTCCTT CGGATACGTG AGCGGGATCT CGCACATTCT CTCCTTTCGA AAGGATTCCT ATCCCGGGCA GTCTCCAGGC GCATACCCCA CGAGCTAGAC ACAATTCGTC CAGCGATGGT AATGTTGGCA AGCCTTCCGC ACGGCATGGG TACAACATGG AGTCAAGGAA AGGTTGTTGA AGTTGTTCTA AAACACCGAT CCTGTGTTTG CGTCGAGAGT TGGAGAGCTT GA
|
Protein sequence | MDLSNKSFHD IFLQVMVGHP DDDIHRDAHM QDTRRSSHPN DTSPLPSDVV IGWRSLCGSV LPENIRAHAE AYSATAVSPH HSNTRRTTQA TANSVTISNR TFPLDSKQLS DPSNVPASNH NENSYLYERH SPVPLVSPFL PDMTDWDNSM QLGLQPIFTD HDSTFASAFG KTYVSPETLP VPALPAPPPA FDRAHSLAFS VSSDCTSQSA LFFHPISNPT HDPPVSPLKP CFLDTPEQPV VPDIVHNLPI EPRPTKKLRT TETINNDDLW PRALWPQPTP QLAINTTCAP LAPPPAQHDS QPVVLLPMPI VWPKTPTALN TNLTFDTSWL SAAQVATKKS DPVVKRNKDS EQISASTLVA SRPSVVDPRL ATFLERFDNA EWRLQALQAE NAELQAKVQE AERQKRAMQQ LARRHLFQSV VNVAGVAIVG KRPKNMAWVT LRARRHAGTM LHHHCNSSRF LCTDRVVIEE RTMTLNPRNR PNRHTFLCEY ISDVDACGRR RWNRTNSRSI RAVAKSRRTV CATAVVAPLL RIRERDLAHS LLSKGFLSRA VSRRIPHELD TIRPAMVMLA SLPHGMGTTW SQGKVVEVVL KHRSCVCVES WRA
|
| |