Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44980 |
Symbol | |
ID | 7199502 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | + |
Start bp | 879632 |
End bp | 881395 |
Gene Length | 1764 bp |
Protein Length | 492 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178867 |
Protein GI | 219116144 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACCAA AGTTTCCTCA AGAAATGACG CGACGTTTGC GTGCAAGTTG AGTTTGCAGG GTTCACAATT TTGTTCCTGG TACCAGTGAT TCTTTCTGTT TTGACATCTG TCTACTACTC GATGACAGCT TTTGCGAGAC GTTCGCTCAA GTTCTTACTC ATCGCCGCTT TACTGAACGG CAGTGTTGCA TCTGCGTTTG TTAGCCCAAC GTCGTCAATA CTGCAATTGC AATCCTCGCG TGCCACCAAT CCATCGACGA TGCAAACGAT AGTATCACAG CAAGCAGACT CTGATTCGGA GACCGTTCGG ATTCGGGGCG GGGGGCCATT GCCTCCTTCC AACCCGAAAG ATCCGGCTGC CAAAAATAAA AAAATTCGAA TTTCCTCCTT TGATTCGATG CGCTTTTTCC TCATTACTCA GATTGTGTTG GGGCACTTTA TCCGTTTCGC CGAGCCGTCG GACGTCGTCT TCAAGTTTTT TTCGCACCAC AACGTGATTG TGGGCTTCTT TTTCGTTCTG AGTGGATACG TGACGGCCTA CACGACGACG GAGAATGCGC AGCGCATCGC CTCTCCGCGC CTGACGGAAA CACCTTCGCA GAAATGGATC CTGAGCAAAA TCTTTGGTTA CCTACCTCTG CATTTGTTCG TGTTGGCCTT GTTCAGCCCA CTTTTTCTGT ACGCTGATGT CAAGTACAAC GGATGGGTAC GTGATTTTTA TGTCTTTTGG AAGTACGGCT GTCCGGGTCT CTTATTTGTT CGCTCTTTTG TTTCTTGATT CATTGCACTC TAGCCAACGG CTGTCTGGCA TGGCTTCCTA TCGGGCACCA TGACACAAGC ATGGTTTCCG ATGCATGCCG AGATCTGGAA CGCTCCAACT TGGTTTTTGT CAGCCTTGAC TTTTGCGACA GCTCTTTTCC CGTTCTGTTT ACCTAAGATT GCTCAAATGG ACAAGAAGCA ACTGCGAAAG ACAGTAGGAT GGCTCTGGAT CATCAATCTT TTGCCCAAGA TTGGATATTG TTATGACTTG AACGTTTGGA AGCTGGCGGA AGGCGTTTCT TCTGCCAAGG CTCATCCCAG TATGGCAATG TTCAATATGG TCCGCTTCAG TCCTTTGCTA CAAGTTGCCG AAGTCTTAAT TGGTGCAGTT GCTTGCCGCC TCGTCATGTT GGACGGTACC GAAGGTGACG ACACCAAGAC CAATGCCTTG AGTACTTTCG TTCCGCTCGC CGGTATCATC GGTCTTATGT TAGCACGCGC CACTGGTCTT GTCGAGATCA GCGACATGCT AGCTCGTGCT GTAGTGTTTG TTCCGCTTTT TCTACGATTC ATGATGGCGG CTCATCGCAA CACAGTCAAG GGTGTCAAAG ATCCATTGGT ATCATTTTTG TCGAGTAAGT TCTTGGGTTC GTTGGGCGCT TTGGCATTCC CCATATTTGT GCTCCACGGA CCGATTGGAC AAGTGTTTTA CAAGAAGCTG ATCGCAACCA GGGTCTTTGG CAAAGTAATG ACGGGTCCCC AGAACTTTGG TTTGTACCTA TTGACGACTG TAGCCTCGGC TTGGATTGTT CAAAAGACAT TTCTGTCCAG CAAGGCGGTG GCCAATTGGT CAAAGAATAG CGTGGATAAA TTGTCGTCGT GGGTATAAGG GAACGCAGGT AGCTGTGCAT GTCATGAATA TAAGTGTTTC GAGAAGGTGG TTTTGAGGAA GACCTACATA TTTGCCTACG CATGCCTTGA TAAACAGAAC TAGTGGCGTA ACTC
|
Protein sequence | MTPKFPQEMT RRLRATFARR SLKFLLIAAL LNGSVASAFV SPTSSILQLQ SSRATNPSTM QTIVSQQADS DSETVRIRGG GPLPPSNPKD PAAKNKKIRI SSFDSMRFFL ITQIVLGHFI RFAEPSDVVF KFFSHHNVIV GFFFVLSGYV TAYTTTENAQ RIASPRLTET PSQKWILSKI FGYLPLHLFV LALFSPLFLY ADVKYNGWPT AVWHGFLSGT MTQAWFPMHA EIWNAPTWFL SALTFATALF PFCLPKIAQM DKKQLRKTVG WLWIINLLPK IGYCYDLNVW KLAEGVSSAK AHPSMAMFNM VRFSPLLQVA EVLIGAVACR LVMLDGTEGD DTKTNALSTF VPLAGIIGLM LARATGLVEI SDMLARAVVF VPLFLRFMMA AHRNTVKGVK DPLVSFLSSK FLGSLGALAF PIFVLHGPIG QVFYKKLIAT RVFGKVMTGP QNFGLYLLTT VASAWIVQKT FLSSKAVANW SKNSVDKLSS WV
|
| |