Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49791 |
Symbol | |
ID | 7198459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011692 |
Strand | + |
Start bp | 310600 |
End bp | 312356 |
Gene Length | 1757 bp |
Protein Length | 560 aa |
Translation table | |
GC content | 59% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184523 |
Protein GI | 219128655 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGACACTTAC AACATACACA CTCCCCCGGA ATCTCCCCGT CAATCCTAAT TATTCCCACA TATTCATTGT TCGCATGGTA TCGGTAACCC GCAAACGCAT TCAGGGCAAT CACCGCCGCG GATTGCGGCG TCCCGGCAGC GACGCGAGTC GCGAAACCCT CGGTGCCTTG GGCCTGGCGT TCCTCAGTGT CGTCTTTATC GCTTACGCCG TTCTGGGTAC GACGGACGAG ACAACACCGG TGACGCCGGT CCACGCCACT CCCGACACTG TCCCTGGAGA ACGGGCGGTA GCCTGGGGGC GGGGGGATCC GGAACATCCG CGATTGCGCA TCGCATCGTC CGCGGCCGAT CCGCTCCGGA CGGTACCCGA CGACGCCAAC GGCGAAGCTT ACAACGCCGT GGCGTTGGAT ATTCTCCAAG TTCTCAATTG TCAACAACTT TTGAATACCA CGAGTTCTCC GAACGACGAA GTCGTGGACG ATGCCTCGTG GCGCCGGCGT CTCGAACAAG TCACCACGGA TTCGCGAGAC GAAGAAGAAG AAGAACGCAT CGAGTCCGAG CTGGAGGACC AAGACGACGA CTGGCACGGT GGGTTGGTGT CCAATCCCAC CGGGATGCAT CTCTTTTGTT TGGCAGCCTT GGACGAGTCC GAGTCCAACG CCGCTATTGC GCAAGTCTGG AAGTCCAAGA TTCACTGTGA CGCCACGCAC ACCAAACAAA GGGCGTTACT AGATCTATGG TCCACCGCAC GCACCGAACT CAGCAAGATC GTCCTGGAAC AAACCCTCCG CTTGGTGGAA GAAACCGAAC GCGACCTTTT GGGGACCAAC CTACACCTCT GGGCGCCCGC ACGCGACTCC GGCATCGACT ACACTCTACG GATCCTCAAC GACCCGGACG CCGGATCGGC CGACCGGGGT GGGGTGCTGG GTCTCGACGA AAATCTCGGC GTCGACAAGC TCTGGGTCGA CGTTGGTTCC GGATTGGGAC TCACGTCCAT GGCCATTGCC CTACTCTACC CCGGCACGCA CATTGTTACC GTGGAAGCCG CCAAACCCAA CTGGCTCCTC CAAAACATGA ATTGGGAGTG CAACGACTTT CCCCATCAGA GTCGGCGGGA TGTCTTGTTG GCCGGCGTCG GGCCTAGTAC GCACACTTCG CTCCTCGCCA AGTTCATCTG GCGACCCACC GCCACCACTT CTACGCGTTC CTGGACCCCT AGCTCGGAAC GAACCCCCGA CGACCTCGAA CTCGCCGTCA AACTCCGCCC CTGGACTCAA ATCCTTCGCG AAGCCGATAT ACCCCGCGAC AAAATCGACG TCCTCAACGT GGACTGCGAA GGTTGCGAGT ACAATCTCAT CCCCTCCCTG GACGACGCCG ACCTCGATAA CATTTCCACC ATTATGGGTA ACGTACACTG GGGCTACATT CCGCTCGCCA AAAAGCCTTC CTCGAGTCGG GCCCGACTCA CCCACCAACG CCTCTGCGTA CACGAAAACT TTGCCGCTCT CGCCAAGGAA TGTTGCGCCT TTCCCGATCT ACCCGTCGTG TCCTCCGTAG CGGGAGAAAT ACTCGTACAC GAACGAGACA ATGCCGACCA GGCCTCCCCA GTTTTCCCCG CCAAAGCCTC CACCGTGGTC GACGTGGCCG GTCCCCTCTG CGACGGGTTC GACGATTGGG CCCGTGAAAA TGACCTCGAC ACGGTGGAAT CTGATTGGGG ATGGTTCCAG ATTACCAGCA TGGCTGAGGA AGATTAA
|
Protein sequence | MVSVTRKRIQ GNHRRGLRRP GSDASRETLG ALGLAFLSVV FIAYAVLGTT DETTPVTPVH ATPDTVPGER AVAWGRGDPE HPRLRIASSA ADPLRTVPDD ANGEAYNAVA LDILQVLNCQ QLLNTTSSPN DEVVDDASWR RRLEQVTTDS RDEEEEERIE SELEDQDDDW HGGLVSNPTG MHLFCLAALD ESESNAAIAQ VWKSKIHCDA THTKQRALLD LWSTARTELS KIVLEQTLRL VEETERDLLG TNLHLWAPAR DSGIDYTLRI LNDPDAGSAD RGGVLGLDEN LGVDKLWVDV GSGLGLTSMA IALLYPGTHI VTVEAAKPNW LLQNMNWECN DFPHQSRRDV LLAGVGPSTH TSLLAKFIWR PTATTSTRSW TPSSERTPDD LELAVKLRPW TQILREADIP RDKIDVLNVD CEGCEYNLIP SLDDADLDNI STIMGNVHWG YIPLAKKPSS SRARLTHQRL CVHENFAALA KECCAFPDLP VVSSVAGEIL VHERDNADQA SPVFPAKAST VVDVAGPLCD GFDDWAREND LDTVESDWGW FQITSMAEED
|
| |