Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_31764 |
Symbol | |
ID | 7196114 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 870442 |
End bp | 872170 |
Gene Length | 1729 bp |
Protein Length | 548 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176671 |
Protein GI | 219109836 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACCGC CGGTTCCTAC TCGGCATCGA AAGGTGTGTC TTGAGCCGAG AAAATACGAA AATACAGGCG AACGTTTCTG TGCGGCTGGG GGAGAGTATT ATTCTCCTCC ACACACGACA CGAATGGACT GCATCGGACC GGTGCAGCCT CCGGGATATC GCACGTATCG CGATTCATCG CCGTATCCGT TTCCCCATCC AACTCATTCC ACACCATCTG GTTACTACAA CACAACCCCG ACACTGGTAT CTAGTTGTCA CACGCGCACA CATACACATA TACAAACACA TACCCCTGAA CTACGCGACG CAGCGTGGAG TGAAGTCGTG AGGTCGCAAT CGGTGCAATC GAATGAGAGA ATGCCCCCAC CGCAGCATCT TCCGTACAGT GGCAGAGAGT ACGGTATCTT GCCCCCACGG ATGCCTCCGC GCACCTCTTC CAGCCCGTGT CGTGCCGGAG TGCCTACGTA CCAGTCGACT CCGGTCAGCA CCATGCCCGA GCACTACCGG GGCGAATATC ACGGGTTCAC AGCCGAAGAA CGATCGGGGT ATTACACAGG CTACCGGCGA GCGTCAAATG CGTTGAATTG CCCTCCCACG AAAACAGTGG TAGCGTCCAC CAGCAAAGTA ACGGATGATG GGGCCTCGTT CACGTCGGAA TCGTGGGATG AGCAGCATCA GCTTTTCGTA AGGGCGTCCA TGAAAAAGAG AAGAGCTCCG TCTACTACAT CCTTCCCGTC CAAATTACAC AAAATTATAT CCAACCCTCT CCACCGAGAA TTTATAGACT GGCTGCCCCA TGGTAGAGCA TGGAGAATTT TGAAGCCAAA GATGTTTGAG AAGGATGTGA TTCCTAAGTT CTTTCGTTCG GAGCGATACG CATCCTTTAT GCGACAGGTA AGTCGATGGG GGGTCACATC GGTATCTTCC GGACCTGCAA ACACTGGCTC ATACTGATAT TCTTCCCTTC CTTTATCAGG TAAACGGCTG GGGTTTCAAA CGTATCACGG AGGGTCCGGA TCTTAACTCC TACTACCACG AGCTATTCCT ACGTGGGCTC CCTGATATCT GCCTTAAAAT GCAGCGTGTC ACCTGCAAAG CGAAACCCAC TGACGGTGCG GAGTTTGGCG AGTGTCCAGA CTTTTACAAA ATCAGTATGT TTGCTCCGCT ACCTGACCCT GACCTGCAAG ACGAAGAAAC AGCAGCAGCA ACCCCGAAGA CAATTGTCAC CAAAGATTCT TCGACCAAGC TATACAAGCG ACCAAGCTCT CCATCATCAC TAAGTACTAT GAGCGGATCT GCCGGGTTGC ACGATGACAT GGCCATGACG AACGCTATGG AGCCTCTTAC ACCAGTTCGT TCCGTGCATT CCCCCGCTCC AAATACGTCT CCTCTACCTT TCCATGCTTC TTTGGCCAAT TCTCCGTCGC TTGGATATAA TAGTAGCAAC GTTAGCTTGA GTTCGTTCGG GGCAACGCAC GAAGAGCTGA TGTGGGGCAC TGGACCTTTT TCATCCTTGC ATCATCGTGC CGAATCTCTC GGGGCTAGCG GTGGCCATGT GACACCTCCG TCAAGTAGAC AATTCTATCA GTCAACGCGG TCTGAATCAC ACCACGAAGA TACGAGCGGC CTTTCCGCAG CAGACTTATG CTATTTGACC CATCAGAACC GGGTTCTTTT ACACCAAGCA AAAGGTTTCC GTAACGACAA TGAGTATCAG GGAGTTTGA
|
Protein sequence | MAPPVPTRHR KVCLEPRKYE NTGERFCAAG GEYYSPPHTT RMDCIGPVQP PGYRTYRDSS PYPFPHPTHS TPSGYYNTTP TLVSSCHTRT HTHIQTHTPE LRDAAWSEVV RSQSVQSNER MPPPQHLPYS GREYGILPPR MPPRTSSSPC RAGVPTYQST PVSTMPEHYR GEYHGFTAEE RSGYYTGYRR ASNALNCPPT KTVVASTSKV TDDGASFTSE SWDEQHQLFV RASMKKRRAP STTSFPSKLH KIISNPLHRE FIDWLPHGRA WRILKPKMFE KDVIPKFFRS ERYASFMRQV NGWGFKRITE GPDLNSYYHE LFLRGLPDIC LKMQRVTCKA KPTDGAEFGE CPDFYKISMF APLPDPDLQD EETAAATPKT IVTKDSSTKL YKRPSSPSSL STMSGSAGLH DDMAMTNAME PLTPVRSVHS PAPNTSPLPF HASLANSPSL GYNSSNVSLS SFGATHEELM WGTGPFSSLH HRAESLGASG GHVTPPSSRQ FYQSTRSESH HEDTSGLSAA DLCYLTHQNR VLLHQAKGFR NDNEYQGV
|
| |