Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_55138 |
Symbol | |
ID | 7198762 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011694 |
Strand | - |
Start bp | 365729 |
End bp | 367502 |
Gene Length | 1774 bp |
Protein Length | 489 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | cell surface protein |
Protein accession | XP_002184948 |
Protein GI | 219129547 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.177713 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAAAAGGCTG ACACAAAAAT TCAAGTCGTC TCCAGCAATC AAGCACATTT CAACCAACAA AGCAAGCCAC TGCTGTACCC ACACCATTTT ACCGGGCGGA TATGATGAAA CCTTTTAACA ATCTTGTCGC CTTTGTCGTT CTATCGACGT TGGGAAATGC GACCGCCTTT GAGCAACTGT CATTCTCTTC GCCTGCTGCT TCTCAGCTTT CCGGCTCGGA AATCATCATA CGTGATCTCG CCGACTTTGA TTGGGAAATC ACGCCTCTTG ATGGATATCC CATCATTGCC TTCAACACTA CTGAAGGCAA CGACGAGCTT GTGTTTCGCT ACAACTATAC CGGCGACTTG GTCGAGGGCG AGAAGCAGCT GACCGCCAGA CTCTTGCTAC CAGACTGCAT CAACGACGGA GACAACTCTG TTGTCCGCAG TCCTCCCAGT GCACTCAGCA ACAACGAATA TGAAGTTGGT GTTGATATTG TCAAAGACAC TATCAGTGGC TCCTCCTTTT ACAGCTCCCT GAATGATACT GCCGCATCCA TATCCTTTTG CCTTCGCGTC GACTACGAGC TTGTTGCAGG CAATGACCAG GAGTCCGTCA ACTTTCATGA AACAGTCGTG ACTGTCACTG TTGATTTGAC GGCTGACTTT CGGCTTTCGG CCATCGACAT TACCCGGGAG GGGGCCGCCA ACGCCACTGA GACGGCGTCG CTTGACTACC CGGTGAATGC CTACTTCTGC AACGACGGCG ATGTCCAGGT GCCCGGCCAG CTTTTGTCGC AAGGAAGCGC TCTCCAGTTC TGCGTGAAAA TCGACGACAG CGTTCTTGAC GCCGTCTTCG TCGAGGACAT TTTGGAGGCC ACCATATCGC AGCCCAACGC TAACGGCCGT GTCGGGGCCA GTGCATCGAA CATTATTACC GGAACCGTCG CCGACTCCTT GACGGAAAAG GCGTGTCGAG AACTGGGTAT CTGTAACGTC AAATCCCAAC TTCTTTCCAA ATTCTTTGCC GACGCGGATC CTCTCGACCT TGAGGTAACT GGAACCGCCC TGTTGTCGTT TGGATCGGCC TCAATGATGC CCAGCTCCTC GCCTGTGGCC GATACCCGAC GCCGCCTACG TGTCCCCATT ACCGCCACCA TCAACTCGAA GGAGTTGAAA GCCATAATTG ACGCTCAGGC GGGGGAGGCC ACTGGTAGCC ACTTCGCCCG AAGCAACAAT GTAGCGAGCC TCGGTCGCGG ACTTCAAGAC GACAGTGCAC AGTCGGAGTT TGGCCTCATC GTTGGTCTTG TCAACGAAGA CGCGGGACCT TCTGACACTA TCGAAAATGA TGGCGGCATT GATGTGATTG TGATTGTCGG TATTGTGCTT GTCATTTCAC TGCTGTCTGG GTGCTGCCTT TTCTTTTTCT GCTTCATGAA GAAGCGCAAG GACAAGGAAG AGAAGGTGGA TGAGAAAACC GTGATCCAGT ATGACATGGA GGAGAACAAG CGTGACGATC AGAAACATCT TTCGTCCAAT CGAGGAAGTA CTGGTGCTTC GCGCGGTACA CCCTACCGTC CCGGAAACTA AACGATACCG TCCTATTCCA GCCGCGGCTT ACATCAACGA CTTTGGCAAA TGAGATGAAG AACGAAGCAA GAAAGCAACG ACCCCGTACA AAGTTGCAAC TCATTTTAGA ATATAAACAA AAAGAAAAAG TTCACGGAAA CAGTACCTGG TAGATAGAGA ATCGTCCATT TCGTCGCAGA GCATATCCGG ATTGTTGTAA CCCA
|
Protein sequence | MMKPFNNLVA FVVLSTLGNA TAFEQLSFSS PAASQLSGSE IIIRDLADFD WEITPLDGYP IIAFNTTEGN DELVFRYNYT GDLVEGEKQL TARLLLPDCI NDGDNSVVRS PPSALSNNEY EVGVDIVKDT ISGSSFYSSL NDTAASISFC LRVDYELVAG NDQESVNFHE TVVTVTVDLT ADFRLSAIDI TREGAANATE TASLDYPVNA YFCNDGDVQV PGQLLSQGSA LQFCVKIDDS VLDAVFVEDI LEATISQPNA NGRVGASASN IITGTVADSL TEKACRELGI CNVKSQLLSK FFADADPLDL EVTGTALLSF GSASMMPSSS PVADTRRRLR VPITATINSK ELKAIIDAQA GEATGSHFAR SNNVASLGRG LQDDSAQSEF GLIVGLVNED AGPSDTIEND GGIDVIVIVG IVLVISLLSG CCLFFFCFMK KRKDKEEKVD EKTVIQYDME ENKRDDQKHL SSNRGSTGAS RGTPYRPGN
|
| |