Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50128 |
Symbol | |
ID | 7198837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011695 |
Strand | + |
Start bp | 137745 |
End bp | 138956 |
Gene Length | 1212 bp |
Protein Length | 239 aa |
Translation table | |
GC content | 55% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184974 |
Protein GI | 219129604 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.969073 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCAAGTATT GTAGCCGTTG CTCATCTTAT TTGCCTACCT ACCTATCAGT CATACTACCT ACCTCCCTAG AATCCTTCTT ATCTATCTAT CTATCTATCG AACTAGTTGC CCACGCACCT ACCTACCGTA CAGGAGACGA CTCTCACAAC AACACGTAGA CTCACACTTC AGCGTCAGAT CATGCCTACT AGTCGTCTCC GCCGTCGTCG TCAGAGTTCG GCAAGCCGTC CGGTCCGGGC CCTCAACGGG GGCAAAGCCG GACACGAAGC CCGGAGCACC AGCACGCCAT CCGCCGGAAC GTCCAACAAG CTCCACACCT TTTGGCACCG CATCGACGAA GTACACTTTG AGGCCCAGTC TTCCGGTGCC GACCCCGTGC ACGCCCGCAC TCCTGGTCGG CGCCCTACTC TGGAACGGGG TCACTCCCAG AATAGTCCGA ATGCGGATAC CGGGAGAACC CAGAAGCGGG GTAGTCAACG CAAGGTACGG GAACGACAGT ATTCCTTCGG AAACCGACAG TTGACGGACC GGACACTCCC CATGGATTCT TGCTGGATGT CCGGAGACAG TGAAGACGCA GACAGTAGTT GTGATGAGCA GCCGTTGTCG AAAGCAGACA ATCCTAACAC CTCCGAGGCA TTCCTGAAGC ACAACGACGA CGATCGAGAT GAAGACGATC AGTACTCGGA GCAGAACGTC AGCAGCAGCA GCGAAGACGA AGCCGATGAT GAAGATGATC GCGTAGTGGC CTTTGTGTAT CACGAAATGG GTCGCAACGG TCCTTGGCCG CCACGGAAGA TCTTCACTCC GGCAGTACCC CCGCCACCCG GCCGTACCTC CAAACGTAGT CCCTTGGTAG CATTCTGGCG CAAACTGGGA TCCAACCACA AGACACCCTG AGTGAGCAAA GAGGGAGCTC CCTTTCATTC CACACTTGTG GATGAAAGAG GGACGCCGTG CTCGGAATTC TATGTCGATC GGAACTTGTG GCTGTGGCGG ACTGTGGGTA CCGTGTCCGT CACGCTTTCC TTCGGGAACA CAAGCGTATC TAACGGCCTC GCACCCATGG CACCGCGCAG GCCGAAGCAC CAATCTTCTG TTTTTACATT GCTGTTGATT GAAACCTTCT GATGAGGTAG GGTATACACA AACAATGAGA AATACACAAA CTTACATTAC CAACGCACTA CATAGTATCA CCTTTCTGTA CT
|
Protein sequence | MPTSRLRRRR QSSASRPVRA LNGGKAGHEA RSTSTPSAGT SNKLHTFWHR IDEVHFEAQS SGADPVHART PGRRPTLERG HSQNSPNADT GRTQKRGSQR KVRERQYSFG NRQLTDRTLP MDSCWMSGDS EDADSSCDEQ PLSKADNPNT SEAFLKHNDD DRDEDDQYSE QNVSSSSEDE ADDEDDRVVA FVYHEMGRNG PWPPRKIFTP AVPPPPGRTS KRSPLVAFWR KLGSNHKTP
|
| |