Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50003 |
Symbol | |
ID | 7198705 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | + |
Start bp | 110281 |
End bp | 111648 |
Gene Length | 1368 bp |
Protein Length | 455 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184826 |
Protein GI | 219129292 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.1229 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTTGGTC TACGTTTTGC ACACGTTGCT GTGATGCTCG TTGTGGTCGT AAAAACGGTC CTGGTCAGCT TCCACAACGT GAAATCCATA AAGCCAGTAG TGTCGCAACA CATTCCCGGG CAGCCGGAGC GAGCGGCTGT TGTTTCTATC AACGAGGATA CGGGAGTAGG CGTTCCCAGG CGACACACCC CAACCGGAAA AGCGCCGTCC GCCAACTCAA CCAGGAGTGA CACAATAACC GGAAAAGCGC CGTCCGCCAA CTCAACCAGG AGCGACACCA TAGCCGGAGA TGCACCGTCC GCCAACTCAA CCAGGAGTGA CACCCTAACC GGAAAAAGAC CGTCCGCCAA CTCAACCAGG AGGAGCACAC GCATTACTAC TCAAAGAACT GCGACAGAAT TTAGTGTCAG TCCGACGACA CAGGAAATCG GTATGAAGCC TGATTGGACA AAGCTAATAC CGCAATGGTT GCACAATCGA AATACAACCA GCGTGAAACT GCTGAGAACT GGGGCACCAG ACATGCCTTC AGGTGCTTTC GTGCATATCG GTAAAACGGG TGGATCAACG CTGAGCTTGC ACTTGCGCAA TGGCTGTCAC TCCTTCATCA AACAGCCGTG TAATCAGCTG TCCCTGAACG AGGTGGAATC GCATGTATCC CGCTTGACTA CGTACTATCA TGTGCCTGAC TTTGCCAATC TTAGGCGAAC CCGTCACCAT CTGTTCTATG TCCTTAGTAT CCGCGACCCG TTAGCTCGGC TGTTGTCCGT CTTCACGTAC ATGCATCCGG ATAACATCAT TGCCCGCGGC GCTTGTCAAT GGAATATCTG TGAAACCGAA AGTATATGGA AATGTTTTCC AAATTTGGAT GAGTTTGCTA AAGCCCTCAA CTTATACCGA GAAGAAAGCA AAGGACAAGA GGCTGTGGCA GAGAGAAGAT GCGTACAGGG CGCTCAAAGC TTTATTCGTA GTAGCGACAA TGCACCGGAC CACTTTCGAC ACAGCCTCGA GGCGATTGTG GAGGAATACA TGCCACCAAA CGCCATAATG AATTCGACCA TTCTTGTGGT AAGAAATGAA CATATTTGGG ACGATTGGAT TTCCGTTAAC CAATGGCTTG GTCAGGAAGG AAAGGTGGAC GCATTTCCAG ATAACAATGT ACGTGATTTT TCACACATGA CGCTCCCCGT TTCCAAAAAC TTTTCCGAGG AATCTCGCAA AAGCTTATGT GCCGGTCTCC GCGATGAATA CCGAATCTAC ATGACCATCC TCTCAAAGGC CGCCAATATC GGGCCAGTTG AAATGGCGCA AAGCATCGAT TTGGCTCACA GAAACTGCCC GTCTCTTCAA TTTTCTTCTT TTGTTTGA
|
Protein sequence | MLGLRFAHVA VMLVVVVKTV LVSFHNVKSI KPVVSQHIPG QPERAAVVSI NEDTGVGVPR RHTPTGKAPS ANSTRSDTIT GKAPSANSTR SDTIAGDAPS ANSTRSDTLT GKRPSANSTR RSTRITTQRT ATEFSVSPTT QEIGMKPDWT KLIPQWLHNR NTTSVKLLRT GAPDMPSGAF VHIGKTGGST LSLHLRNGCH SFIKQPCNQL SLNEVESHVS RLTTYYHVPD FANLRRTRHH LFYVLSIRDP LARLLSVFTY MHPDNIIARG ACQWNICETE SIWKCFPNLD EFAKALNLYR EESKGQEAVA ERRCVQGAQS FIRSSDNAPD HFRHSLEAIV EEYMPPNAIM NSTILVVRNE HIWDDWISVN QWLGQEGKVD AFPDNNVRDF SHMTLPVSKN FSEESRKSLC AGLRDEYRIY MTILSKAANI GPVEMAQSID LAHRNCPSLQ FSSFV
|
| |