Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_37749 |
Symbol | |
ID | 7202290 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011681 |
Strand | - |
Start bp | 861848 |
End bp | 863587 |
Gene Length | 1740 bp |
Protein Length | 579 aa |
Translation table | |
GC content | 53% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002181819 |
Protein GI | 219122993 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGCCGA CACGGACGAG TATCCTGCTG TGTGTCGTGA TGTTGTTGTC GAATGCACGT GGGGAACACA GAACCGCCTG GCAGGCTCTG CGTAGCAGGT CTGTGCGTCC GCAGCCCGTG CCATTGCTTG CGCCACCACC GTCGGCACCA TTCCGATTCT CGTCTGGCGC AAAGTCGAGT ACGATTCTCT GTGCTAAGAA ACCAAACACA AAAGCCAAGC CGTCCGTTGC ACCTCTCCAA TCGTTGTCGC GGAAAAATCG CATTCAAAGC GTATTGGATT GGGCACAAAG AGCGGACGTT CAAGTAAGCA AGGAAATAGC GTTGGATTCT CGAGTGGCCG AGTACGGGCT CGGCTGGTAC GCCTCCACCA ATATTCCCAC CAATCAAGTT TTGCTGAGTG TGCCCTCCAA TCGAGCCTTG ACAGTGGAAA TTCCCGGTGA GGGACCGGAC GATCGCTCCG TTCTGGACTT GGTGGCGAGC TCGGACAGTG GCAGCAAGAC AGAGGTACGG GCCTTGCCCT GGTTTGTGCA AATGAGTCTG TACATCTATA AATTGGACCA AGTCGATGCG GACAAAGAAG GTGTTGATAT GCGCCCCTGG TTGGATTCGC TACCGAGGTC TTTTGATACC GTCATACATT GGTCCGAGGC AAATCGGCAA GAGTTACAGT ACGATTCTAT GGTAACTGCC GTGGCCAGTC AAGAACAAGA TTGGAAACGG TACTACCAAT CGCTCTTGCA AGCTGGAGCC TCATCGTCGT CCTTGACATG GGAGCAGTTC CTGTGGGGTT GTGAGATTGC TCGATCACGA GCCTTCTCCG GAGGATTTAC AGGATCCGCC TTCAATCCAG GAGTATACGC CTTTACGCTC TTGCTCGTCA CAATCTATGT GGGTCTGGGT GTGGGTAGCC TCGAACAAGC AGCCAACGGA GCTGGTGTGG TCTTTTCCGC AAGTATACTC AAGGACTTTG TGTTGCCCAA ACTCTTCAAA AAGAGGCGAT ACGTAATTTG TCCCATGATT GATATGGCCA ACCACCAGTC GGTTAAATTT GCTGGCCAAG TCTCCTTTGA GTACTTTGCT AATGCTTACA GTTTAGCCAC GGATCAAGCT ATTCCGTCCG GTGACGAAGT TTACATTTCC TACGGGCCGC GATCCAACGA TCAGCTATTG CAGTACTACG GATTTGTTGA GCGCAACAAT CCAAACGATG TGTATGTCAT GCCACCTCTA CGAGAGTGGG ATATTGAAGC CTTGGAACGG GCCACGGATC GCAAGTTTGC GGTGGGACGG TTGGAAAAAC TCAATCGTGC CGGATTGTTG GGGAGTGCAA CGACGGTACT TTCAGACAAA AAGTACGACG AGACGGAGGT TGCCAACGCC AATGGGGGCG TTGTGATAAC GCGCGTGTTG GGCCTAGACC CGGCCATTCT TCAAGCCTTG CGAGCACTCG TGTCGACAGA GGACGAATGG AATGCCGCGG GCCAAGCAGT CGGCAGTTTT GCGGAAGAAG GGTCGGGCGG AGCCGCCAAC GAGGCAGCCG CTCGGCTAGC GGCGCGAACG GCGGTCGGAA TGGAGCTCCA ATCAAAAGAG ACCACCCTGC AAGAAGATGA AGCCCTACTC CAACGAATGG ACACTGTGAA AAGTATGGAT GCTAGCAGGG AAGAGAAATT GGCGGTCCAA TTTCGGATCG AAAAGAAAAA GTTGTTGTCC GAAACGCTGG ACAAGTTGTC AGTAAGGTAA
|
Protein sequence | MWPTRTSILL CVVMLLSNAR GEHRTAWQAL RSRSVRPQPV PLLAPPPSAP FRFSSGAKSS TILCAKKPNT KAKPSVAPLQ SLSRKNRIQS VLDWAQRADV QVSKEIALDS RVAEYGLGWY ASTNIPTNQV LLSVPSNRAL TVEIPGEGPD DRSVLDLVAS SDSGSKTEVR ALPWFVQMSL YIYKLDQVDA DKEGVDMRPW LDSLPRSFDT VIHWSEANRQ ELQYDSMVTA VASQEQDWKR YYQSLLQAGA SSSSLTWEQF LWGCEIARSR AFSGGFTGSA FNPGVYAFTL LLVTIYVGLG VGSLEQAANG AGVVFSASIL KDFVLPKLFK KRRYVICPMI DMANHQSVKF AGQVSFEYFA NAYSLATDQA IPSGDEVYIS YGPRSNDQLL QYYGFVERNN PNDVYVMPPL REWDIEALER ATDRKFAVGR LEKLNRAGLL GSATTVLSDK KYDETEVANA NGGVVITRVL GLDPAILQAL RALVSTEDEW NAAGQAVGSF AEEGSGGAAN EAAARLAART AVGMELQSKE TTLQEDEALL QRMDTVKSMD ASREEKLAVQ FRIEKKKLLS ETLDKLSVR
|
| |