Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATR_44088 |
Symbol | |
ID | 7204027 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011671 |
Strand | - |
Start bp | 930302 |
End bp | 932392 |
Gene Length | 2091 bp |
Protein Length | 583 aa |
Translation table | |
GC content | 48% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002186436 |
Protein GI | 219113705 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGATTC CAACATACGG TGCAAATCAG GGAGCCGAAA TCGATGCTTA CATTTCGCCA TACGCCAAGG TACGTTCGCT AGGAGCGAGT AACCGAATTT TGTAGCCGGT GTCTTGCACC GCATACCCTC GAAAATCTTA ACCTCTTCTC TTGATCAAGG AGGAACCAAT GTCTCTTGAT CCTCCGAAAA TAGCAGTACA AGGTCAAGAA GTTGAGGATG CGCAATCGAT ATCAATTGGT TCAACCTTGC ACCGGTCATC CTCGGATTTC TGTAAGAGTC AACCGATTGA CGAGCCGAGT CTTCGCCGCG TTAGTAGTCG CCGTAAAGCA GAGCATCCGT TGAAAACAAG CAAATCCCCG TCGAGGCCGC CATCGGCTCA AAAGTTGCGC AAAAACTTCG TCGGCGTTTT GGAGTCTTTC GTCAGACACG TAGCATCGGT AGAGCCGAAT GTGGTTTCAT GGACTGGCGA TGGTTCCGGT TTCTTCTTGA ATGAGTTAGA GGATCCCCAA GCTTTGAACG TTGCCATTTC CAAGTTCTTT TGGTGTACGT AGGTAGCAAC ACATGTTTGC AGTCACACAG ATGCTGATCT ACATCTCACT AATTCATTTT GGACTTTCAG ATGGACGCTA CCCCTCGCTT CGCCGCCAGC TTAATGTCTA TGGATTCAAA AAATATAAAA GGAGTCACAG GTACGATTTG TGCGTTTTCG TTCTCGAGCT GGACATCCCA TCAGATGACA GGCCCTCATA TACCATTGTA CTCTGTTTCA GATATAAGGG AGCATTTCAT CACCCGTCAA TACACCGCAA CATGACCAAT TGGCAATCCC TAATGCTAAA ACCCATTGTC CGTCCAAGTA GAGCTTCTAA ATCGAAGAAT GCGCCATTTC TTCGGCACGG TAAACCGGTA GATGAAAAGC CATCACCAGC ATCACCACCA AATCGCATGG GAGAAAAAGC GGCTATATCC TCCCTGACGG GAAAGTGTGC ACTCGAGAAG CTCAATATTC CAAGCTCCGA TGGTTCTTAC GCTGGGCTTC CGGTTGTATC AGGACGGACC GACCGGGCAG CCCTCACGTC AGCCTCATCA GGAACGGAAG CCGCTTTGGA AGAGTACGGT ATGTACGACA AATATCACAA CTTGATTTCA GGAGGCAATG AAGGTGCAGA TGTGGGGGAA ACTGAAGAGA CTTCCTTTTC AATGGAATAT ACCACACCAT CAGAAATAAA GCCACAAATA GGATTGTTTG ATCCTGTTTC GTATCTCGAA GGGGGCGGGT TTTCCGAGAA GCGAGACCTC CGGACTCTTG CCGATGCTGC GGAGCTAGCG GCGGCTAAGG GCTGGAATAC CAATTTGACA AAGTATTCGC CTGTGATCCG AGCTGACTGT AAAGAGAACG ACACTCCTAT TCACGCAATA TATACAAGAG ACCTGGGCTC ATTGTGGGAC AAGAGTCCCG CTGCCGTGCG AATGCAGTTT GATTCAACGA CGCCCTTGGT TTCAGCCTCG CCTATTGATC CTGGAGTCAG GAAAAATATT GGCAAGTATC CGCATCCTGG TATGCCCGAC TACTCGCCTT CGTTCATAAC CTCAGTCTTG GAACGATCTG GAGCTACATT GTGGACACCG AATTTTGATG ACAAGGAAAT CGACATTCCG TCGCCTCCAA TGGAACAGTG GTTCAGTCCC CCTAAATTTC CATCTCTTCC CATGCACAAA TCATTCAGTC CGGGGAAAAA CCTGGATCTC GATACACTCG AAACTTACGT CACAAGGATT GACATTGAGT CGAAAAAACG CAAAAGAAGC ACTGTGCTTT GTTCTCCCCC GTATCTGTCT CCTGATTCCC CTACAATGAC TTTTGAAGAC TTTTGTAGTG TGTCTAGCGA AATTTCATTC TCAACGGAAA GATTGAGCGG CGAGGCTGAA ATCTGTACGG CATCTAGCTA GAGAGCTTCC AACCTTGGTG TTGGGTGTTC TTCTTTCAGC ATGCTTTATA TTTGTGTTTG CCAGATGGTG CCAACATCCA AAATGTTGCC TGTCAAGGAC GCGTTCCGCT AAACTTTGAG ATACAATCTT TCAGTCAGCC C
|
Protein sequence | MSIPTYGANQ GAEIDAYISP YAKEEPMSLD PPKIAVQGQE VEDAQSISIG STLHRSSSDF CKSQPIDEPS LRRVSSRRKA EHPLKTSKSP SRPPSAQKLR KNFVGVLESF VRHVASVEPN VVSWTGDGSG FFLNELEDPQ ALNVAISKFF WFTQMLIYIS LIHFGLSDGR YPSLRRQLNV YGFKKYKRSH RYKGAFHHPS IHRNMTNWQS LMLKPIVRPS RASKSKNAPF LRHGKPVDEK PSPASPPNRM GEKAAISSLT GKCALEKLNI PSSDGSYAGL PVVSGRTDRA ALTSASSGTE AALEEYGMYD KYHNLISGGN EGADVGETEE TSFSMEYTTP SEIKPQIGLF DPVSYLEGGG FSEKRDLRTL ADAAELAAAK GWNTNLTKYS PVIRADCKEN DTPIHAIYTR DLGSLWDKSP AAVRMQFDST TPLVSASPID PGVRKNIGKY PHPGMPDYSP SFITSVLERS GATLWTPNFD DKEIDIPSPP MEQWFSPPKF PSLPMHKSFS PGKNLDLDTL ETYVTRIDIE SKKRKRSTVL CSPPYLSPDS PTMTFEDFCS VSSEISFSTE RLSGEAEICT ASS
|
| |