Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_43187 |
Symbol | |
ID | 7196563 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011669 |
Strand | + |
Start bp | 2265293 |
End bp | 2267272 |
Gene Length | 1980 bp |
Protein Length | 557 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002176946 |
Protein GI | 219110389 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.140423 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCGATACGCA TCGCATCATC TATACATTTC ACCACTACAC AAAGGACTTA CAGTTAACAA ACAAATGCAT TGAGTGGGCA TGTAACGAGC TCAAATTTTG TGGGGAAGCT TCTTTTCATT CTCCGACATG CATAGTTCCC TCTTCTTTTA CGCAGCTTTC TACTCTGTGT ACAGAGGCGT TGTCGATAAT GTGTCTGCCC TACAGTTAAT CAACCCGATT TCACTTGTTA GACGAAACGG TTCTTCGAGT TCAAGTAATA GGCGTTTCCT TAGTCGGTGT TCAAAAAGTT CCGAGAGGGA ACCTTGTAGA GTGGTTGTGG TTGGAGGAGG TTGGGCTGGT TTCACCGCAG CAGATACTTT GGCAAGAGCA TCTTCGAATG TATCAGTTAC CCTCTTGGAT GCGTCACAGA GGGGCCCCGG AGGACTAGCA GGTGGCTGGA GAACGCCCAA GACTGGCCGG CCCGTCGAAG CAGGAATTCA TGGCTTTTGG AGAGAGTATC GTAATACGTT TGCCATGATT GAAAACATTG GTTTAGACCT TGACGATGTC TTGACAAATT TCACACCGTC AATTTTGGTT TCGGAAAATG GACGCGTCGC CCTCGCACCC GTCCTCGGCA ATACAATGGA CAAGCCAAAT CATCTACAAG CCAACAACCT GGACTGGTCT AATCCTCGCA GCCTGTTGGA GCAAATAGCG CCTTTGTTGC CGTCGCCTTT AGATGTTGCT TTGTTGGCTG ATTTCAATCC CGACTCGGAA CTCTCCATTG CCGACAGAAT CAGCGGTATC GGGTTGCTTG GTGCATGGGC CGACTTTGTA CAAGAAGACC GCGACTCTTG GGAAAGATAC GACAAGATAT CAGCTGAAAA CTTGTTTCGC TCGATTGCAT CGATTTCACC AAACTTATAC CGCGACTTGG TCGCTCCTCT TCTTCATGTT TTACCAATGA CCCCAGGATA CGACTGCTCC GCAGCTGCAG CGCTATCCTG CTTCCACTTC TTTGCACTAC AATCTCGAGG TGCGTTTGAT GTACGCTGGT GCCGTGGGAG CATTTCGGAA CGCATCTTCA ATCCCTGGGT TGAGAAGCTG AGGGAAAGCG GCAATGTTTG CATTCAAGGC TCCGCTCGGG TGACATCCAT TGAACACTAT GGCGGAGAGT ACACGGTAAT GATCAACAAC AAGGTGTCCG TTACATGTGA TGCTGTTGTT CTTGCCGTTG GCGCCACAGC GGCAGGAAGA CTGATTGATT CGTGTCTACC ACTGCAGAAG ATTCCGAATC TAGCTTCAAA GTGGAAAGAA CTTCGTGGTG TGAGCTGTGT AGCGGTTCGT CTCTTCTTTG CAGAACTTCC GCCGAGTCTT GCCTCTGCTA TGAGCGACTC TCCTGTTGTC GTATGCGGGC CGAATATTGG GGGAGCCCCG CAACTCGTTG AGACTGGCTT TTGCATCTAC GATCTGTCTC GATTGCAAGA TGACTTCAAG GGCGGCGGTT TTGTGGGATT GGAGGTCGAT TTTTTCCGGG CGGATGCGAT TGCAAAAATG AGAGATGGCG ACATTATCAA GCTTACATTG GATGCTGTCC AAACAGCACT TGGAGTCGAA AGCATCGACA TGGAGCTCGT CGAAGATTCG GCGGTTATCA GAGCCCTAAA TGCTGTATCG CATTTTTGCG TCGGATCGGC AAGCAAGTCA CCTCCCGTGA GGATTACGAA TGGGCTCTAC ATTTGTGGCG ATTGGGTTGA TCGAAGCGGG CATGCGTCGT GGAGCACCGA GAAGGCTGTG GTTACTGGTT TACAGGTGGC CAACGCAATT GACAGAGACT TTGGATTGGA TTGTAGGCAA GCAGTTATTC CCGCTGCTGC CGATACCCCG CAACTCATGG GCTTGCGAAA GGCTGCCAAG GCATGGCGCA ATGCATCACC TCCGCGATCG TTCCCAGTGG CACCGTGGTT ACCCTTAAAG CAGTTTCGCC GTCGGATTGA
|
Protein sequence | MHSSLFFYAA FYSVYRGVVD NVSALQLINP ISLVRRNGSS SSSNRRFLSR CSKSSEREPC RVVVVGGGWA GFTAADTLAR ASSNVSVTLL DASQRGPGGL AGGWRTPKTG RPVEAGIHGF WREYRNTFAM IENIGLDLDD VLTNFTPSIL VSENGRVALA PVLGNTMDKP NHLQANNLDW SNPRSLLEQI APLLPSPLDV ALLADFNPDS ELSIADRISG IGLLGAWADF VQEDRDSWER YDKISAENLF RSIASISPNL YRDLVAPLLH VLPMTPGYDC SAAAALSCFH FFALQSRGAF DVRWCRGSIS ERIFNPWVEK LRESGNVCIQ GSARVTSIEH YGGEYTVMIN NKVSVTCDAV VLAVGATAAG RLIDSCLPLQ KIPNLASKWK ELRGVSCVAV RLFFAELPPS LASAMSDSPV VVCGPNIGGA PQLVETGFCI YDLSRLQDDF KGGGFVGLEV DFFRADAIAK MRDGDIIKLT LDAVQTALGV ESIDMELVED SAVIRALNAV SHFCVGSASK SPPVRITNGL YICGDWVDRS GHASGTVVTL KAVSPSD
|
| |