Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_47119 |
Symbol | |
ID | 7202032 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011680 |
Strand | - |
Start bp | 510373 |
End bp | 511824 |
Gene Length | 1452 bp |
Protein Length | 459 aa |
Translation table | |
GC content | 49% |
IMG OID | |
Product | nucleotide-binding protein |
Protein accession | XP_002181390 |
Protein GI | 219122098 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAGCGCCTTC TGTAAGGGGT TACACAATAC GCCCGTGACA AAAATAAATG TCATCGCACC CTAACAACCC CTCAATTTTG TCGCTGTGGC AGCTTCGACA GAGCGGCCAT TGTACGGCAA GAGCATTTTC AAATGAAATC ATTCGGAAGC GTTCTATGAA GGCGCTGATT TCAACACCCA GTATACGGGC CCCTGATAAA TGCGTCACCT CGTTAGATGT GCACAAATTA GACTGTCGCT ATTTGCTATC AGGAAGTATG GAGGGGATCG TGTCGGTATA TGACTTGTCG CGGTGGGGAG CATCGCCTCT GGGGAAGACT GGTTCTGAAC AACAAGAGCT TGTGCATCAG CCAATCGCGC GCTCGTTCAT CGCAACTGCA ACCGGTCAAC AAAGTTGTGA AGTACCTAGG GGTCACTCCC TTGCTATTCT TAGTGCGCTC TGGTATCCGT TAGACAGTGG AATGTTTCTC TCGGCCTCTA CCGACGGCCA CGTTTTGCTG TGGGACACCA ACGAAATGAC TCCAGTCACA AGACTTGCCC CTTTTGAATT CGAAAACACT TCAATGGGAG AAGAAGTCTT CGCTCGGCCA AGCTGTATGG AGATATCTTT GTGGTCAGGT AATTTGGCGG CTTTAGGTTC GCGGCGGGTC AATGCCGTCA AACTCGTGGA CATTCGGACT GGTACGTCAT CACATTCCTT GACGGGACAT CGAGCAGGGA TATCTACCGT CCAATGGTGT CCAACGGCCG AGCACGTTTT AGCCAGTGGG GGTCAAGACG GATGTATACG CATATGGGAT ATTCGTAAAG CAGGAAGTAC CGGGTGTCTT ACTACTCTAG ACCGGGACGC ATGCCAAGAG CCTTCTGTTT CGAATCGCTT TAAGGCTGAC TACTCTCATT TAAAAATCAC AAAACAAAAA TTGGCTCCAA ATGATTACCA TCTCGCGGAG AGCAAGTCTA TATGGTCGGT CGAAGGATCA GTAACAGGGC TAGCTTTCAC GCCGGATGGT CATTCCATCG TTAGCACCAG TTCTGATGGA CAGATGCAAC TTTGGGATTT GCGAGGACCC GCCTCCATGG CGCCCCAACG TTTCACAAAT TCTGTTGGAG GACGACCCCT TGCCAAGTCG AAATCGACGA CTCACCGAAG ACCATTGTTA ATCACAAAGA ACGGAAACTC ATCAACAGTA TGGGTTCCGA ATGGCAGGGT TCTACTTGGA TTTTCGCTGG AAGACGATGG ATCCCCTACT CAGATTCTCA GCGGTCACCT ACAGGGAATT CAAGCAATCG CCGAGGTTCC GGATACAGGC GAAATAATCA CTTCCTCTAC GGATCGCTTG ATTTTGACTT GGGGCTATCC ACAACCACTG CATACATTTG CAAGCAGCAG GAAACGCCGA CTCGAAGATG ATAGAGACTC GTGGTAAACA TAAATACAAT GAGGTTACTT GT
|
Protein sequence | MSSHPNNPSI LSLWQLRQSG HCTARAFSNE IIRKRSMKAL ISTPSIRAPD KCVTSLDVHK LDCRYLLSGS MEGIVSVYDL SRWGASPLGK TGSEQQELVH QPIARSFIAT ATGQQSCEVP RGHSLAILSA LWYPLDSGMF LSASTDGHVL LWDTNEMTPV TRLAPFEFEN TSMGEEVFAR PSCMEISLWS GNLAALGSRR VNAVKLVDIR TGTSSHSLTG HRAGISTVQW CPTAEHVLAS GGQDGCIRIW DIRKAGSTGC LTTLDRDACQ EPSVSNRFKA DYSHLKITKQ KLAPNDYHLA ESKSIWSVEG SVTGLAFTPD GHSIVSTSSD GQMQLWDLRG PASMAPQRFT NSVGGRPLAK SKSTTHRRPL LITKNGNSST VWVPNGRVLL GFSLEDDGSP TQILSGHLQG IQAIAEVPDT GEIITSSTDR LILTWGYPQP LHTFASSRKR RLEDDRDSW
|
| |