Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_12174 |
Symbol | |
ID | 7200679 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011675 |
Strand | + |
Start bp | 148944 |
End bp | 150800 |
Gene Length | 1857 bp |
Protein Length | 585 aa |
Translation table | |
GC content | 54% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179590 |
Protein GI | 219117595 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGCGT TTGATCTCTT CCAGTCACAA CTCGAATCCA ACTCGACCGA AGCCGAAATT GACGCCATGA AGCGTTTAGC CGTTGTTGCC ATTACTATGG GAAAAGATGA CGCCCAAGCT ACGCTCATAC CCTATCTAAC ACAGATAGGT ACCGCGCAGC CGCTTCCTTC GGACGAACTC TTGCTTATTC TCGGACAGGA ACTGCCCGCC GTCGCCAAGT TCATCGGCCC CGCTTGTGTT GTGGACTTTT TGCCCCTTCT CGAACGTCTC GCCGCGGTAG AGGAAACAGT CGTTCGCGAT CAAGCCGTCG TGGCGCTGTG CGAACTCCTT GGACAGGCAG GGACCGGGCT GGACGCCATT CCCTGGACGG CACTCGCCAA ACGTTTGGGC TCGGCTGACT GGTTCACCGC CAAAGTCTCC GCTTGCGGCG TCGTAGCTTC TATTCTCCAA CTCAATAACA GTAATTCGGA AGAGTTACTC GCGCTTTACA AAGATCTCTG TCAGGACGAG ACACCCATGG TGCGGCGGGC AGCAGCCAAG CATATTGGCA AAGTTCTCGG TGTCGCTGGG TACGAGCAAC GTGATTTTTG CACCGCCACC TTGCCCGTAC TCTGTCGGGA CGAACAAGAC TCGGTCCGAC TCTTGGCGAT CGGGTCCTTG GCCGATGCGG GATCCAGCTT TTCTGTGCAT CCGTCGTGGA CTGCCACAAA TTGGTTGCCC TTGGTCAAGG ACGGATCCAC CGATATGAGT TGGTACGTGT AAAAAAGCAG AGAGAGCTCG CCTTGCAGCG TATGCAGCCC TCCCAATGGT CGCTTTCCCT GTCTCACCGT TTGTGATTTC TTGACTCGTA GGCGTGTGCG AAACAATTTG GCCAAGAATT TCGCCAACGT TGCCAACAAC CTTGGTTTTC AAAACGATCC TGACCAGCAG ACCGAGCAAA GTGTCGTTAT GGCTTGCTTC GTGGCTCTCT TAATGGACTC GGAAGCGGAA GTCCGAGCGG CCTCCGTCGG TCACCTTTCC AAAATGGTGT CTTGGGGCGG AGCGACTCAC TTTTCGAGCC ATCTCCAATC CTTGTTGCCG GCGTTAGCCG ACGATGTGGT CATGGAAGTC CGCAGCAAGT GTGCTCTCGC ACTCATGAGC GCCGCGCACA GCGGCGTCCT CGATGATGCG GTCATTCTCC AGAGCTTCGG TCCCTTGCTC GAAAGCTTTT TACAAGACGA ATTCCAAGAA GTCCAGTTAC AAGTATTGAC CAATCTCGAC AAGATTGCAC ATTTGCTGCC CGCACTGTCG GGCGTTGTGA CCAGTTTGCT GCAAATGTCC AAGGCCAGCA ATTGGCGCGT ACGGGAAGCC GTCGCCCGGC TTTTGCCGCA TTTGGCCCAA ACTCGTGGGC TCGACTTTTT TGCCAATGTT CTTTTGGAGC CCGCTTGGTT GACTCTCCTA CTGGACCCGG TCGCCACTGT CCGCAATGCC ATTGTCCGCG GTATGCCATT GTTGGTAAGC GCAACCGGGG AAGAATGGTT GACGTCCAAA TTGATACCGG AGCACGTACA AATTTTCAAC CAAAATTCAT CGTCCTACCT CATTCGTATG ACAATTATAC AAGGTCACGT GGAAGCAGCC GTGGCGCTGA AGGATGGCCC CCTGTGGAAT GAATTAATGG TGCTGCTACT GCGCGGCCTC AATGATCGCG TTCCCAATGT ACGCATGGTG GCAGCGCAAG GCCTGGCTCA AGTTATGCGT GAAGGCGATT CAAGTGTGAT CGAAGCTAAG CTCCGCCCTG CGTTGGAGAA GCGGTTGCAA GAAGATAATG ATGAGGATTG CCGGCGTTGT ATTTCTCTAG CTCTGGAAGT GGAATAA
|
Protein sequence | MSAFDLFQSQ LESNSTEAEI DAMKRLAVVA ITMGKDDAQA TLIPYLTQIG TAQPLPSDEL LLILGQELPA VAKFIGPACV VDFLPLLERL AAVEETVVRD QAVVALCELL GQAGTGLDAI PWTALAKRLG SADWFTAKVS ACGVVASILQ LNNSNSEELL ALYKDLCQDE TPMVRRAAAK HIGKVLGVAG YEQRDFCTAT LPVLCRDEQD SVRLLAIGSL ADAGSSFSVH PSWTATNWLP LVKDGSTDMS WRVRNNLAKN FANVANNLGF QNDPDQQTEQ SVVMACFVAL LMDSEAEVRA ASVGHLSKMV SWGGATHFSS HLQSLLPALA DDVVMEVRSK CALALMSAAH SGVLDDAVIL QSFGPLLESF LQDEFQEVQL QVLTNLDKIA HLLPALSGVV TSLLQMSKAS NWRVREAVAR LLPHLAQTRG LDFFANVLLE PAWLTLLLDP VATVRNAIVR GMPLLVSATG EEWLTSKLIP EHVQIFNQNS SSYLIRMTII QGHVEAAVAL KDGPLWNELM VLLLRGLNDR VPNVRMVAAQ GLAQVMREGD SSVIEAKLRP ALEKRLQEDN DEDCRRCISL ALEVE
|
| |