Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49902 |
Symbol | |
ID | 7198527 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011693 |
Strand | - |
Start bp | 247975 |
End bp | 249836 |
Gene Length | 1862 bp |
Protein Length | 593 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002184758 |
Protein GI | 219129148 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGGATG TTTTGGTTCC GCAGCCGGTA GATATTGACA TTGGTGTATC CAACATCGTC GACGACGACG AAGAAGGGCA ACAACATTCC GGGACGCATT CCTGCGGCAG CTGCGTGTTG GCCTGGAACG GGGAAGTCTT TCGCGTCGTC ACCCAAACGC CACTCGGAAC GGACACTGAC GACCTATCGG GGTCACACCA TTCCGACACT ACGCTCGTGG CCGACTGGAT ACGTCAGGAA TTGACGCAGA CTGTCTCTAT CTGCCGTGAA GGTACCACCA TACGCAGCGT ACAACTCCAA CAACAAATGG CTCTGGCACG CGTCCTGGAA CGTTTGGTGG ACGCTGATTT TGCCGTGACG CTCGTCACAC CACACTCCGT ATTCTACGCA CGTGATCGTT TCGGCAAACG TTCCCTCCTA GTACAAGACG GTCTATCGAC AGTCACGGCT GCCGGGACCG CTACCGAGAC CAGTCGTTCC TGGAAATTGT CCTCCGTCAC TGACGGGACG GAATCGGCAG TGTGGACCGA AGTCGCACCG GAAATGGTAC ATTGCTACTG CGTGGCTACG AAACAGTGTC TGCCCCCGCT GTCCTACCAT CAACGCGAGA CTATGGATGT TGCACCGGCG CAAGTACTCC ACCAACCCTT GCAGGATGAG TACGACAAAG ACAATATCCG AAGAGCGTGG ACTACTCGGC GCATCAATGA CCAATCCTTC TCGTCGGAAC CTTGTACGGC TACATCTTTT GCTCCCTTAC CTTCACAGCC TACAAAGCTG GAGACAGCCG TCGCCACGTT GTACGTTTTA TTGCGCGACG CGGTACGCCG GCGTGTCACG GGTCCACGAG TGGCTGTCTT GTTTTCCGGT GGCTTGGATT CCGTCGTGCT GGCCGCCTTG GCTTTGGAAA TTCTGTTGGA ACGCTATGAA CACACCCACG AACTCGTTTT GTGCAATGTT TCGTTTGTAG AAGATGCCGC GCCGGGCGCT ACAGACTTTG CACGGACGGA TGCCTCAGCG CTGCCGAGCC GTACCGACGC ACCAATACCA CCGCAAGCAG CCGATACTCG CGCAGCGATG GTGTCGTACC GGGAGCTGGA ACGCCTGTTT CCCCAAGCGC GTATCTGCTT TTTGGCCAAA CAGGCGACGT GGGGGGACAT TGTCCGCAAC GAAGCACACG TACGTCAATT GGTGCATCCC CAGACGACGA CCATGGATTT GAACATTGGA ATGGCTCTAT GGTTTGCGTC TTTGCAATCC ACGGATAGCA AGTCCCAGCA CGTTTCCGCG AAAAAAGTAC CGTTGGGTGA CGACTGTCGT GTGTTGCTAT CTGGTTTGGG TGCCGACGAA CTCATGGGTG GCTACGGTCG TCACCGCCAG GCTTGGAAGG ATGGTGGCAA CGAGCAACTA CGTCGGGAGC TGGATTTGGA TTTGACCCGA CTGTGGTACC GCAATTTGGG ACGCGATGAT CGAGTGTTGA GTGATACCGG CCGAGAAGCG CGCTTCCCGT ATTTGGATAC GGCCGTTGTG CAGTTCCTGT CGAGGTTGGA CTTGGACGTG GTGTGTGACT TTAAGCGGCC ACCGGGAGAA GGAGATAAAC GCATTCTACG GGTTCTGGCG GCGCAGATGT TGGGCCTGGA GGCGGCGAGT ACGGCCGTCA AGCGAGCGAT TCAATTCGGG AGCCGGATTG CGCACGTGAG TGACAAACGA CGATTCGGTT CGCGGCGGCG AGCCTCCGGA ACGGCCCGCG CCATCCACCA GACCGCGGTT ACCGATTTCT GAGCACCAGA TATTTCCCGA GACTTGTGTA CAAAGTTGTT GACTGTTGAT GGGAGTACCG TAATCACCTT ACTACTATTC TT
|
Protein sequence | MRDVLVPQPV DIDIGVSNIV DDDEEGQQHS GTHSCGSCVL AWNGEVFRVV TQTPLGTDTD DLSGSHHSDT TLVADWIRQE LTQTVSICRE GTTIRSVQLQ QQMALARVLE RLVDADFAVT LVTPHSVFYA RDRFGKRSLL VQDGLSTVTA AGTATETSRS WKLSSVTDGT ESAVWTEVAP EMVHCYCVAT KQCLPPLSYH QRETMDVAPA QVLHQPLQDE YDKDNIRRAW TTRRINDQSF SSEPCTATSF APLPSQPTKL ETAVATLYVL LRDAVRRRVT GPRVAVLFSG GLDSVVLAAL ALEILLERYE HTHELVLCNV SFVEDAAPGA TDFARTDASA LPSRTDAPIP PQAADTRAAM VSYRELERLF PQARICFLAK QATWGDIVRN EAHVRQLVHP QTTTMDLNIG MALWFASLQS TDSKSQHVSA KKVPLGDDCR VLLSGLGADE LMGGYGRHRQ AWKDGGNEQL RRELDLDLTR LWYRNLGRDD RVLSDTGREA RFPYLDTAVV QFLSRLDLDV VCDFKRPPGE GDKRILRVLA AQMLGLEAAS TAVKRAIQFG SRIAHVSDKR RFGSRRRASG TARAIHQTAV TDF
|
| |