Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_36013 |
Symbol | |
ID | 7201351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011677 |
Strand | + |
Start bp | 343560 |
End bp | 345281 |
Gene Length | 1722 bp |
Protein Length | 573 aa |
Translation table | |
GC content | 52% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002180416 |
Protein GI | 219119306 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGACAC CCGCCACCAA GACGGAATTC GGAGACTACC AAGTCAATGC CGCCATGGGC TTGGCCAAAG CACTCAACCT CAGTCCCCGC GAATGCGCCG CACAAATCGT CAAAGCCCTG CAACCCAAAA TTCAATCGTT CATGGAAGAG CCGGAAATTG CCGGCCCGGG ATTCGTCAAT TTGCGCTTTC GAACATCGTA CTTGACGCAG GCCGTCTCGA GCATGGCTGG GGACGCTCAA GGGCGTTTGG CGGTTCCCCG CACGGCGCAA ACACAAAAAA TTGTCGTCGA CTTTTCGTCC CCCAACATTG CCAAAGAAAT GCACGTGGGC CATTTGCGAT CCACCATTAT TGGTGATACC CTGTGCAACG TCCTTGAGTT CGCTGGACAC GAAGTGACCA GACTTAATCA CGTCGGCGAT TGGGGAACAC AGTTTGGTAT GCTGGTTGAG CATTTGCGGG ACGAGTATCC GGCAGCGCTG CGATCAGACA CCGCCGACGA TGTGGATCTG GGAGACTTGG TGCAACTTTA CAAGGCAGCC AAGAAACGTT TCGACGAAGA CGATGAATTC AAGACGCGAG CACGCGAAGG GGTCGTGAAG CTGCAAGCGG GAAACGAAGA GGAGCTGGCC GCCTGGGAGT CCTTGTGCGC AGCGAGTCGC AAGGAATATC AAAAGATTTA CGATCGCCTA CAAATTGAAG GCCTGGTTGA GCGAGGCGAG TCGTTTTACA ATCCGTTTCT GAAGGACGTC GTCGACGAAC TTGTCGAAAA GGGTTTGGCC GTCGAGAGTG ACGGAGCCTT AGTGGTATAT CTGGAAGGAT ACACCAATCG TGACGGTTCT CCGTTGCCCA TGATTGTGCG CAAATCCGAT GGTGGCTTTA ATTACGCAAC CACCGATCTG GCGGCCATGC GACACCGTAC GTTGATGCCC CGAGCAGAGT CGGGAGAAAG AGCAGACCGG GTCCTCTACG TCACGGATGC GGGACAGGCA CAACATTTCG AAATGGTGTT TGAAGCCGGA AAAGTTGCCG GATTCTGTAG GGAGGGTGCT TCCCTGGAAC ATGTACCCTT TGGCCTAGTC CAAGGGGAAG ATGGCAAGAA ATTTGCCACC AGGTCCGGTG AGACTGTCAA GCTAAAGGAT CTTCTGGACG AAGCTGTCCG GATTTCTGGT GCGGATTTGA AGAAGCGCAA CGAAAACGTC GATCAGGAAT TCCTGGACCG TGTGGACAAT GTCGCGCGTA TTGTAGGTAT CGGTGCTGTG AAATATGCCG ATCTTTCCAT GAATCGCGAG TCGAATTACC GCTTCAGCTA CGATCGCATG CTGAGTCTGA ACGGGAACAC TGCCCCATAC ATGCTCTACG CCTACGCCCG TGTATGTGGT ATCATCCGCA AGGCCAGTGG GCAAGAAGGA ACCGGGGCCA TTGATTGGCC AAAGGCTTCC GAAATAATGA TCACGCACGA GTCTGAGTTG GAGTTAATAC GGAATCTAGT CAAGTTACCC GACGTGTTGA ACGAAGTTGA ACGAGAACTG TATCCAAACA GAATGTGTGA CTATCTTTTC GAGACGTCAC AAAAGTTTAA TCAATTTTAC GAGAGTTGCT CGGTCAACAA AGCGGAAAGC GAAGAGATCA AAGCAAGTCG TCTTTCCCTG TGTACAGCAA CTGCGGGCAC TATTCGCTTA CTTTTGACTT TGCTCGGCAT CGAAACATTG GAAAAAATGT AG
|
Protein sequence | MVTPATKTEF GDYQVNAAMG LAKALNLSPR ECAAQIVKAL QPKIQSFMEE PEIAGPGFVN LRFRTSYLTQ AVSSMAGDAQ GRLAVPRTAQ TQKIVVDFSS PNIAKEMHVG HLRSTIIGDT LCNVLEFAGH EVTRLNHVGD WGTQFGMLVE HLRDEYPAAL RSDTADDVDL GDLVQLYKAA KKRFDEDDEF KTRAREGVVK LQAGNEEELA AWESLCAASR KEYQKIYDRL QIEGLVERGE SFYNPFLKDV VDELVEKGLA VESDGALVVY LEGYTNRDGS PLPMIVRKSD GGFNYATTDL AAMRHRTLMP RAESGERADR VLYVTDAGQA QHFEMVFEAG KVAGFCREGA SLEHVPFGLV QGEDGKKFAT RSGETVKLKD LLDEAVRISG ADLKKRNENV DQEFLDRVDN VARIVGIGAV KYADLSMNRE SNYRFSYDRM LSLNGNTAPY MLYAYARVCG IIRKASGQEG TGAIDWPKAS EIMITHESEL ELIRNLVKLP DVLNEVEREL YPNRMCDYLF ETSQKFNQFY ESCSVNKAES EEIKASRLSL CTATAGTIRL LLTLLGIETL EKM
|
| |