Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_29702 |
Symbol | |
ID | 7194881 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011686 |
Strand | + |
Start bp | 383713 |
End bp | 386719 |
Gene Length | 3007 bp |
Protein Length | 878 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183086 |
Protein GI | 219125646 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAGTATCTCG AGCGCTATTG ACATTGATAG CGCTCAGAAA TACCAACTAC AGACTGAAGT TTACTGAAGG TTTGAGTTAA AAGGACAGTA AGATGCTCCG CTTAATCCAA AGGACCGCTT TCTCCCGCTT GGTGTCGCGA AAGCCGTCGG CGTCACCCAC AGTTTCGTTA CGCACGGCAG TCCCTGTCGT CGGTACCACC ACGACAACAT TTCGTCGTGA ACTGCACTTA TCGCCCCGTG AAGCAGAGCA CTTGCAACTC CACCAAGTCG GTCGGCTGGC TCAATACCGT CTCGCTCGAG GCGTTCGTTT AAACTACGTG GAAGCCGTCG CGCTTATTAG CATGCAAATG ATGGAAAAGA TTCGGGACGG ACAGGATTCG GTTGCTGATC TGATGACGAT GGGACAGTCA CTCATCGGAC GGAATCAAGT AATGCCGGGT GTCGCTAAAA TGATTGGGCA AGTGCAAGTG GAAGCGACCT TTCTGGATGG GACAAAGCTG TTGACGATAC ACAACCCTGT CTCGGCGCAA GATGGAAATC TCGAACGGGC TCTGGACGGA TCCTTCCTAC CCGTTCCCAA TCTCAGCATC TTTACCAAGG GAACCGACGA GGAAGAAAAT CTTGTTCCCG GTCAAGTCAT GACGCAAGGG GATCCCATTA CCATCAACGC CAATCGTGAG TTGATTGAAC TGTCCGTCAC CAACACGGGC GATCGCCCTA TTCAGGTTGG CTCGCACTAT GCCTTTGTTG AAACCAACAA GGCGTTGTCC TTCGACCGCT CCGCGTCCAT CGGTAAGCGC TTGAACGTAC CGTCGGGAGC GTCTGTACGT TTCGAACCAG GCGAGCGCAA AACGGTCACC CTCTGCGCGC TCGGTGGAAT CCAACGGGTC GTTTCTGGCA ATCGACTCAC AGACGGGGAT GCCCGAGACC CTGCCCGACA CGCCGCTATT CTCGAACGCG TCACGTCCCA AGGATTTCAG CACGAACCCG TCGACCCGGC GGATATACCC AAGGGGCGTG CGTACGTCAT GGAGCGATCG TCCTACGCCG ACATGTACGG CCCTACAGTC GGGGATAGAA TCGCGCTCGG CGACACTGGC CTGGTCGTTC GTGTCGAACG GGACTATACC GTCTACGGCG ACGAATGCAA ATTCGGCGGA GGCAAGACAT TACGGGAAGG AATGGGACAG GCAACGGGAC CAACATCCGA CGATGCATTG GATGTGGTTA TTACCAATGC TTTGATTATC GATCCCTGTA TTGGTATCGT TAAGGCCGAT GTGGGCATAA AGGGTACTTC TATAGTGGGT ATAGGCAAGG CCGGCAATCC CGACATGATG GACGGAGTGA CGCCCAATAT GATCGTGGGA AACACCACAG ATGTCATTGC CGGCGAAAAG CTAATTTTGA CAGCCGGTGG CATCGATACA CACGTTCATT ACATTTGCCC CCAACAGATT GAGGAAGCGA TTTCGAGTGG GGTGACGACC ATGTTTGGAG GGGGCACTGG ACCGGTACGT TGCTAAAATT TGTCCAGCTC ATTATCCGAT CTTCCGCAAG GTAGGATACT GACACTTGTT TCTCTCCTAC TTTGGAATTT GCATGTTAAT ACGTTTCAAT TCCATGTTGT TGACAGTCTG CCGGATCGAA TGCTACAACC TGCACTCCGG CTCCGAGTCA AGTTGAAATA ATGCTCAAAG CGACCGATAA ATACCCTTTG AATTTTGGAT TTTCCGGCAA GGGGAACACG AGCGATACAA AAGCTTTAGA GAACGTACTC AAGGCTGGCG CGGCAGGGTT CAAACTTCAC GAAGATTGGG GCACCACTCC GAGTTCTATT GACGCCGCCT TAGACTTTGC CGACGAGCAC GATGTGGCGA TCACAATCCA TTCCGATACA CTCAACGAGT CCGGCTTTGT GGATGATTCC ATCGCAGCCA TGAAAGGCCG CACTATCCAT ACGTATCATA CTGAAGGGGC CGGTGGTGGT CACGCTCCGG ACATTATCAA AATTGTCGGC GAAAACCACG TGTTGCCGAG TTCCACGAAT CCGACGCGTC CGTTTACTGT GAACACGATT GACGAACATC TTGACATGCT CATGGTATGC CATCACCTCG ACAGTAGCAT TCCGGAAGAT GTGGCGTTTG CGGAATCTCG CATTCGTGCC GAAACAATCG CTGCCGAAGA CATTTTACAC GATACCGGTG CAATCAGTAT GATCTCGTCC GATAGTCAAG CCATGGGCCG AGTCGGCGAG GTCATTACTC GCACCTGGCA AACAGCCGAC AAGATGAAAG CTCAGCGTGG CGCGCTACCA GAAGATTCTG CTGGCGACGA CAATGTACGC GTCAAGCGAT ACATAGCGAA GTATACAATT AATCCAGCGA TAACCCATGG GATGAGTCAT ATGATCGGCT CCATCGAAGT GGGTAAGATG GCTGATCTAG TCTTGTGGAA GCCCTGTATG TTTGGTGCCA AACCGGAAAT GATCGTCAAG GGCGGAACTA TCGCGTACGC CCAAATGGGG GATCCCAATG CCTCTATACC AACGCCGCAG CCCGTCAAGA TGCGCCCCAT GTTCGGTAAC ACATCAGCCG GTATGAATTC GGTTGTTTTT GTATCACAGG CCGCCATTCA TGCTGACACT GCCGGCAAAT TGGGTTTGCA GAAAGCCGCA GCGGGCGTCG TGCGGTGCCG GGCGGTAACG AAGAAAGACA TGGTCTGGAA CGATCATACA CCAAACATCA CTGTAAATCC CGAAACCTTC GAGGTGGTAG TAGACGGGGA ATTGCTGCGC TGTGATCCGA TTGACAAGGT TTCTTTGGGA CAACGTTTTT TCCTTTTTTA AAGCAAGGCA AATGTGGCTC TGGAGAAGGA CGGATCAGCC TACAAAAATC GAACTTACAA TTTAACGCGC CGCCGCAATC ATGCTCCGGT TCTGCATCAA CACAATACAC TTACGAGTTC CAACTAACCA TAAGAAATAC CTTATTAATG TAACACA
|
Protein sequence | MLRLIQRTAF SRLVSRKPSA SPTVSLRTAV PVVGTTTTTF RRELHLSPRE AEHLQLHQVG RLAQYRLARG VRLNYVEAVA LISMQMMEKI RDGQDSVADL MTMGQSLIGR NQVMPGVAKM IGQVQVEATF LDGTKLLTIH NPVSAQDGNL ERALDGSFLP VPNLSIFTKG TDEEENLVPG QVMTQGDPIT INANRELIEL SVTNTGDRPI QVGSHYAFVE TNKALSFDRS ASIGKRLNVP SGASVRFEPG ERKTVTLCAL GGIQRVVSGN RLTDGDARDP ARHAAILERV TSQGFQHEPV DPADIPKGRA YVMERSSYAD MYGPTVGDRI ALGDTGLVVR VERDYTVYGD ECKFGGGKTL REGMGQATGP TSDDALDVVI TNALIIDPCI GIVKADVGIK GTSIVGIGKA GNPDMMDGVT PNMIVGNTTD VIAGEKLILT AGGIDTHVHY ICPQQIEEAI SSGVTTMFGG GTGPSAGSNA TTCTPAPSQV EIMLKATDKY PLNFGFSGKG NTSDTKALEN VLKAGAAGFK LHEDWGTTPS SIDAALDFAD EHDVAITIHS DTLNESGFVD DSIAAMKGRT IHTYHTEGAG GGHAPDIIKI VGENHVLPSS TNPTRPFTVN TIDEHLDMLM VCHHLDSSIP EDVAFAESRI RAETIAAEDI LHDTGAISMI SSDSQAMGRV GEVITRTWQT ADKMKAQRGA LPEDSAGDDN VRVKRYIAKY TINPAITHGM SHMIGSIEVG KMADLVLWKP CMFGAKPEMI VKGGTIAYAQ MGDPNASIPT PQPVKMRPMF GNTSAGMNSV VFVSQAAIHA DTAGKLGLQK AAAGVVRCRA VTKKDMVWND HTPNITVNPE TFEVVVDGEL LRCDPIDKVS LGQRFFLF
|
| |