Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_39221 |
Symbol | |
ID | 7194924 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 638895 |
End bp | 640688 |
Gene Length | 1794 bp |
Protein Length | 567 aa |
Translation table | |
GC content | 47% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183132 |
Protein GI | 219125741 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGAAAA ACATTGTTCG AGAACATGCT TTTGATTTTG ACGCCCAAGT TGTATTTGCA AAAATCGTTA AACATTACAC GGCATCCACA GCCGCGAAAA TCAGCTCCGG CACTACTCTC TCATACTTGA CCTCTGCAAA ATACGGCAGC TCCTGGACCG GCACTGCGGA AGGTTTATCT TGCATTGGAA AACCATCTAC GCATCTACAA CAATACCGTG CCAACTACGG AACATTTGCC ACCGCAACTC TGCCTCAGTT TGCTCGAGTC CTCTGTTCGC GACGTCTCCG AACTACGTCA AGTCAACACT ACCGCGAATT TAGATTTAGC TAAAGGGGGG TCTCCTTTTA ACTATGAAAA TTATCTAAGT CTACTTCTCG CTGCAGCAAC TTATACGACA AAGGGAACAA CCTTTCCAAC TCTCGTAGCC CTAAGACCAA GCGTAGTGCC TTTGTTGCCG AAACCATCTT CCCTGACAAC GACTACGGCG TTGATTACGA CATTGATTTA TCTCCGTCCA TTCTGTACGA AGCGAATGCT CACAACCGCA GAGCAGGAGA TCAAAATCGA GACCGCCAGG GCAATGTCAA CCGTGAACAA CCGTATATCC CCCGTGAGAC ATGGGATAAA CTGTCCGAGG ATGCAAAGGC GATTCTCCGA GGCATGTCTT CTCCCGCGGA AGGTCAAGCC TCGCCTAACA GCAAGTCAAC ACCCGCATTT CATGCCAATT CTCATTCTCT AGCCGACATG GGACACCCCT CCCCAACCAA CAACTCGTTG AATGAAAGCG ACAACGAAAA ATTCCACGAT TGTGGAAACG ATTCGGAGTT ACTTGTCCAC CTTACTGATT GTTCCAGTCC TATGGCAAAT GGAGACATTC GCAAAGTCCT TGCCTCTGCC TCTTCCCACA AGAAAAATGA AAACAACTGC CTCCAGTCAA ACATGCTTGA GTACACCATT TCACGGCACT CCATCATTGG AACCACATCT TCTCTCATTG ACAGAGGTGC CAATGGCGGA CTCGCTGGAA GCGATGTTAA GGTTATCAAC AAAACCGGCC GTTCGGCAAG CATCACCGGT ATCAACGACC ACACTCTGCC TGATTTAGAT ATTGTCACCA CTGCTGGTCT TGTTGAATCC CAGAACGGAC CTATTATTGT CGTACTACAT CAATATGCCC ATCATGGAAA AGGAAAAACT ATCCATTCTA GTGCACAACT AGAGTACTAC AAGAACACTG TCAAAGACCG ATCTTGTGTA CTTGGAGGTA AACAACGCAT TGTAACTCTA GATGACTATG TTATTCTTTT ACAAGTTCGT CAGGGACTTG CATACATGGA CATGCGCCCT CCTTCCGACG CAGAGTTTGA TACACTTCCC CACGTTGTAC TTACTTCCCA TGTTGATTGG GATCCGTCCA TCATTGACAA TGAGATTGAC CTTGCCACGG ATTGGTATGA CGCCGTTCAG GATCTCCCGA ACGACCCATA TGTCGAACCT CGTTTCAATT CAACTGGGGA CTACTGGCAT AGACATGTTG CGAATTTTGA CATATTTTTG TCATCTGAGA TCATTGCCCA TTCCACCGCT ATTGACAATA TACTCTCGTC CAATAAGCAC AACATGGTTC GAAATGAACG CAATTACGAA GCCTTGCGCC CTTGTCTTGG CTGGGTCTCT ACTGACACAG TCAAGAAAAC TATCCTGGCC ACCACGCAAT TTGCTCGAGA AGTATATAAT GCGCCCATGC ATAAACATTT CAAGTCCCGC TTCCCAGCGC TTAATGTTCA TTGA
|
Protein sequence | MGKNIVREHA FDFDAQVVFA KIVKHYTAST AAKISSGTTL SYLTSAKYGS SWTGTAEGLS CIGKPSTHLQ QYRANYGTFA TATLPQFARV LCSRRLRTTN LYDKGNNLSN SRSPKTKRSA FVAETIFPDN DYGVDYDIDL SPSILYEANA HNRRAGDQNR DRQGNVNREQ PYIPRETWDK LSEDAKAILR GMSSPAEGQA SPNSKSTPAF HANSHSLADM GHPSPTNNSL NESDNEKFHD CGNDSELLVH LTDCSSPMAN GDIRKVLASA SSHKKNENNC LQSNMLEYTI SRHSIIGTTS SLIDRGANGG LAGSDVKVIN KTGRSASITG INDHTLPDLD IVTTAGLVES QNGPIIVVLH QYAHHGKGKT IHSSAQLEYY KNTVKDRSCV LGGKQRIVTL DDYVILLQVR QGLAYMDMRP PSDAEFDTLP HVVLTSHVDW DPSIIDNEID LATDWYDAVQ DLPNDPYVEP RFNSTGDYWH RHVANFDIFL SSEIIAHSTA IDNILSSNKH NMVRNERNYE ALRPCLGWVS TDTVKKTILA TTQFAREVYN APMHKHFKSR FPALNVH
|
| |