Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_51970 |
Symbol | |
ID | 7201046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011676 |
Strand | - |
Start bp | 842786 |
End bp | 844453 |
Gene Length | 1668 bp |
Protein Length | 555 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180331 |
Protein GI | 219119129 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0997978 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGTCAC GACTTCCCGT TACGGGCGTC GAAGTCGTTC GCGATACTAT TCGATATTCT GGGGTATCCA CTCCGATTTC CGAACGGGAA GGCTTGGGGA TTCGAGGCTT GGTACCCGCT GCCTTTCTCC CACTCGAACT CGACGTCGAA CGATGCATGT TGCAAATGCG GTCCAAAGAA TCGCCACTGG AAAAGTACAT TTATCTGCAC AACATTCAGG ATGTCTCGGA ACGTCTCTTT TATGCTATTC TTTGCAAGTA CACGTCCGAA GTCATGCCGT TGGTCTACAC ACCCACCGTC GGTGAAGCAT GTCAGAACTT TTCGGCCATT TATCGGGGTA CGCTTCGCGG CATGTACTTT TCGCTAGAAG ATTCGGGCAA GATTCGTACA CTCCTCGACA ATTGGTTTAC CTCCAAGATT ACTACAATTG TTGTCACGGA TGGTGAACGG ATTCTAGGAT TGGGTGATCT CGGTGTCAAC GGTATGGGTA TTCCCATTGG GAAATTGGCC CTGTACACGG CCTGCGGTGG CATCGATCCG GCCAAGGTCT TGCCGGTACA CATTGACGTG GGCACCAACA ACGAGGAAAA TCTGAACGAT CCGTACTACC TCGGTCTTCG ACGGCCTCGG GAGCGAGGGC AAGCCTACGA TGATTTGATT GCCGAGTTCT TTGAAGCTGC TCAGAACAAA TTCGGAGCCA ATGTGATGAT TCAATTCGAG GACTTTGGTA ACTTGAACGC CTTCCGGCTA CTAAGTGCGT GGCAAGACAA GGCCTGCACT TTCAATGACG ATATTCAAGG AACGGCAGCC GTGGCTCTGG CCGGTTTGCT TGCTTCCAAC CGACTCACTG GCAAAGACTT GATTGATCAC ATTTTTTTGT TTGCCGGCGC GGGGGAAGCC GGGACCGGTA TTGCTGAACT ATTGGCGCTC GCCATTGCCG AGAAGGGCCA CTTACCAATT GAACAAGCTC GGAAGAAGAT CTTTCTCGTC GATTCGAAGG GTTTGGTGAC CAAATCGCGT TTGGATAGCC TACAGCACCA CAAGGTCGAT TTTGCGCACG ATGTGGACGA CTGCCCAAAC TTGTTAGCAG CAATCGACAT GCTCAAGCCT ACCGGATTGA TCGGTGTATC CGCCATTCCG AATTCGTTTA CGAAAGAAAT TTGCGAAAAC ATGGCTGCCC ACAACAAAAT TCCGGTCATT TTTGCGTTGA GCAATCCTAC GTCCAAGGCG GAATGCACGG CGCAAGAAGC CTATGAATGG ACCGATGGGC GTGCAATTTT TTGCAGCGGC AGTCCATTTG ATCCGGTGAC GTTGCAGGAT GGGCGCCAAC GTGTCCCGGG GCAGGGCAAC AACGCATACA TTTTCCCGGG CATTGGGCTT GGCGTATTGG CGGCCGGATC TACTCGCATT ACAAATTACG ATATGCTGTT GGCAGCGGAA ACGTTGGCGG CGGAAGTAGG TCCCGAAGAG TTGGACGTCG GTTGCATGTA TCCTCCACTG TCTCGGATCA GACAGGTTTC GAAAAACATT GCAATCGCCG TTGCCAATCA GGCGCACGAA ACGGGAGTAG CAACCGAGCA GAGACCGGTG GATATGGGAA AGTACGTGGA ATCACTCATG TACGATCCAT TTGAGGAGGT TGACGTTCAC TTGGGATCCA AGAAGTAG
|
Protein sequence | MVSRLPVTGV EVVRDTIRYS GVSTPISERE GLGIRGLVPA AFLPLELDVE RCMLQMRSKE SPLEKYIYLH NIQDVSERLF YAILCKYTSE VMPLVYTPTV GEACQNFSAI YRGTLRGMYF SLEDSGKIRT LLDNWFTSKI TTIVVTDGER ILGLGDLGVN GMGIPIGKLA LYTACGGIDP AKVLPVHIDV GTNNEENLND PYYLGLRRPR ERGQAYDDLI AEFFEAAQNK FGANVMIQFE DFGNLNAFRL LSAWQDKACT FNDDIQGTAA VALAGLLASN RLTGKDLIDH IFLFAGAGEA GTGIAELLAL AIAEKGHLPI EQARKKIFLV DSKGLVTKSR LDSLQHHKVD FAHDVDDCPN LLAAIDMLKP TGLIGVSAIP NSFTKEICEN MAAHNKIPVI FALSNPTSKA ECTAQEAYEW TDGRAIFCSG SPFDPVTLQD GRQRVPGQGN NAYIFPGIGL GVLAAGSTRI TNYDMLLAAE TLAAEVGPEE LDVGCMYPPL SRIRQVSKNI AIAVANQAHE TGVATEQRPV DMGKYVESLM YDPFEEVDVH LGSKK
|
| |