Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47447 |
Symbol | |
ID | 7202569 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | + |
Start bp | 643840 |
End bp | 645736 |
Gene Length | 1897 bp |
Protein Length | 613 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181604 |
Protein GI | 219122547 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCTAG CCTTCCGTTG GGTCGCGGTG GGACTTGAAA TCGGTACCAG AATAAATCCC ATGCTCCTGC GTCTGCATAT GGTAAACAGC TCCCTGTACC TTCAGGCTAA ATGTGTTTCC AGCACACCAC CATCGACGGT GAGCTCCATA ATCCGCGATC TGGTTTCCGC GAGAGCGAGG CATGTGGTCG TCGGGACAGC GATGATCTAC CACAACATAC GAAACACTGC CGCCGTTGTA CTGTCTCTAG GCTACACCAT GACGAGGCCT TTATATGTAA AGGAAAGCTC AGCGTCGCCC AACCGGCGCT TAAAAAAATC TCGTGGAAGT TTGCGAAATC AGCCGTCTCT TCCGGAACAG TTGGTCACTT GCTATGCACA CAATCGACGA CTATGGGCCG CCTTGATAAT GGGTCTCTTT GTCATTTCGC AGTGGTCCTT CCAATCCTCT CCTTGGAATG TTGACGACGA CGACGAAAGA ATGTCGGGCG TTCCGGAGCT AGCAGAAGAT TCCCGCCCCA CGAGTGGGCG CCAGACTTGG CCCTACCACA AAGACTCCAG AGTCGAAAAT ATAGACAATG ATGGATTGGA GCGGCGGGCA CGACAAGCGG AACAATCCGC AAGAAATTTA GAAACGTACG ATTGTACGGA AAGTCCCGTC ATTCCCCAGT CAGTCTACCA ACCACAATCG GCTCTGCCCT TGCCTGTGGT TTTGTTAGGG AGTCGGAATG CAACAGAAGA CGAGTATCAC GGGAGCTTGC TAGACGGCAT TTCTCGTTCT CGTTATTTGT ACGTGTCTAC AATTGCTTTG TACGATCTTG GAGAACGAAC ATGGCAATCG GACACTTGGC AAGTACCTTC AGCAAAAGTA GTACTTCCAA CTGTATATAT CGTGGACTGG GCTGCCTTGG AGCGCGATTG TCACGCGCTG GAGCATATGA TGGCGGAAAC GCAAGTTCCA TCGAATGCGT TTTTTGTTTA TTTCGACTAC TCCAGTAGTG CTCGGACCAA GGTTTGCCCT TGCATTGAGG ATCATTTTAC CAAGGAACGG ATACGACTTG CCAAGCGAAG CCTGGTACAA GAACGATACT GGAATCAGTC GAACGACTGG GTGGAAATCG GCGAGGTTCC TAGCAATATT GGCAACACAA TTACAGGAGG ACCTGTTTTA CACGCGCCTC ACGCATTACG GGAGCCCTTT GTTGACCTTG TAAAACGTCA AGTCGGCACA ACGCCATCCG CGAAGAAATT GGTTAATCGG AATCGCAAGA CAGATGTTTC TTTTTTCTGG AGAAAAGGCG ACAACTCTCA CTACAGTTTC TTGCGACGCC TCGTAGGTTA TGTGGTAATG GATATGGGTA AGAAATACAG GTGGTTGGTA AAAATCCTAG GCGATGAAGA CGCGATCGAA TTCAATCTGG CTCAGCCCGA ATACGCTGAT CAACTGTTGA ATAGCAAAAT TGTGGTCGTC GCACAGCGCG ACGAGTGGGA AGATCACTAT CGCTTATTTG AATCAATGGC AAGTGGCGCT CTTGTCTTTA CGGACGTTAT GGTGGCGGCA CCACGGAATT TGCGAAACCG CACGAACGTT GTTATCTACG ACGGTCCTGA GTCACTGCAA AAATTGCTGC GCTACTACTT GAGTTCCAAG CAAGCCAAAA CAAGGCTTAC AATCGCGCGT CGTGGATACG AATATGTCAT GGGCAAGCAC CGTTCATGGC ATCGCCTAGA AGCGCTCCTT TTCGGAACGC AACTAACCCA GCCCAATCAG CCTGACATGG ATGCGCCTCT CCGAAAAAGG GGAAAATCGA GAAAACAGTA TAGTGATAGA GCCACTACTT GATCGAAGCG AAATTTAGTT ACTATGAAAC CAGGTAAGAC TACAAATTAT TAGAATT
|
Protein sequence | MNLAFRWVAV GLEIGTRINP MLLRLHMVNS SLYLQAKCVS STPPSTVSSI IRDLVSARAR HVVVGTAMIY HNIRNTAAVV LSLGYTMTRP LYVKESSASP NRRLKKSRGS LRNQPSLPEQ LVTCYAHNRR LWAALIMGLF VISQWSFQSS PWNVDDDDER MSGVPELAED SRPTSGRQTW PYHKDSRVEN IDNDGLERRA RQAEQSARNL ETYDCTESPV IPQSVYQPQS ALPLPVVLLG SRNATEDEYH GSLLDGISRS RYLYVSTIAL YDLGERTWQS DTWQVPSAKV VLPTVYIVDW AALERDCHAL EHMMAETQVP SNAFFVYFDY SSSARTKVCP CIEDHFTKER IRLAKRSLVQ ERYWNQSNDW VEIGEVPSNI GNTITGGPVL HAPHALREPF VDLVKRQVGT TPSAKKLVNR NRKTDVSFFW RKGDNSHYSF LRRLVGYVVM DMGKKYRWLV KILGDEDAIE FNLAQPEYAD QLLNSKIVVV AQRDEWEDHY RLFESMASGA LVFTDVMVAA PRNLRNRTNV VIYDGPESLQ KLLRYYLSSK QAKTRLTIAR RGYEYVMGKH RSWHRLEALL FGTQLTQPNQ PDMDAPLRKR GKSRKQYSDR ATT
|
| |