Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44876 |
Symbol | cupD |
ID | 7199802 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011673 |
Strand | - |
Start bp | 499783 |
End bp | 502846 |
Gene Length | 3064 bp |
Protein Length | 933 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002179013 |
Protein GI | 219116436 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCTCT CTCCACAAGC CGCCCGTATG GGTACCGTGT GCACGACACT CCGCAAGATA GTTTCGACGG CGCGTCAGCG GTCTCTCTCG CATTCGGCGT TCGCGTCGGT CCGTAGCGAG ACCGCGTTTG GAGCCCGGGC GTTTTCGACG CGTGCGAAAT CTACTCTAGC GGCACTCGAA GACTCGGATG ACGAGGATCT CACCTTTCGT AGTCAGGGAC ACGCCGCAGC CGCCGCGGCT TTGAGCAAGG CAGGATTGAA CAAGTCTCAC GATGAGGCCT GGATGATCAA CGTCAATCGC AACGACGACA ACGAATGGCT GAACGGGCCG CGCAGTGCCG AATGGTTTAC GGGCATTCAC CCTAGTCAAT GTCCTGGTAA GTTTACAAGA GCAATTGCCA AAACTTGTGG CGAACTTGCA CGCGCACGCG AACGGAAGTT GTGCTCACTC AGAGGGATCT AAATTATATT TGTATATCCA ACCAGGAGCC GACCAAGCGG GCACCATCCG CTCGCTTTCA CTTCCGAATC TGTCGGCCGT TACCCGTGAA GCTGCAAAAG AATACTTTGA CAACTCATGG ACGCTCTACG AGACATTGTT TGCCGGTCTC AAAGGAGAAG AAGGATTTTA TCGGTGAGTT TGAGTTGTGT GGTGATGTTT CGTTTTGTAT GACCGTTGTC TAATTTCACC GTCCCACCGG ACTGCAGCCC GCCAGTCCAC GGTCTCCGCC ACCCCCAGAT ATTCTACTAC GGACACACTG CTTGTCTCTA CATCAACAAG CTCCGCGTCA GTAAAGTCTT ACCCAAACCT GTGAACGCCT ATTTCGAGTC CATCTTCGAA GTCGGTGTAG ACGAAATGCT CTGGGATGAC ATGAACAAGA ATGATATGCT TTGGCCCACA GTTTCGGAGG TGCACGAGTA TCGACAGCAA GTATACAAAA CGGTTGTGGA CGCTATTTTG AATCATCCTA GTCTTGATCA AAGGAACGGT CCAGTGAAAG TCGATCAGGA TCATCCAATG TGGGCATTGT TCATGGGCTT CGAACACGAG CGGATCCATA TGGAAACGAG TTCAGTCTTG TTCCGTGAGA CGCCGTACCA TTTGGTCCAA ACACCCCAGC ATTGGCCTCC GATTCATCCG TCGGCTTTCA ACGATGCCTC GCCCACAAGC AATCCGATAG AAACTTTGGA TTACCCCGCG AACCGCATGA TTGCCGTGGA CAATGGAACC GTCGATCTCG GAAAGCCTGC CGACTTTCCT TCCTTTGGAT GGGACAATGA GTACGGTGAA CGCAATATGG ATGTGCCTCC ATTCTTCGCT AGTGAACACA TGATCACAAA TGGAGAATAC TGGCAGTTTG TCGACAATGG TGGCTATCGA AATCGAGAGT ATTGGTGCGA CGACGGCTGG GCGTGGCGCA GTCATCGCAA TCTTAAGTGG CCTTTTTTTT GGGAGCCCGC AGGACCCGCT GGGTCCAATA AGTTTTCGTT GCGAACCATT TTCAAGATCG TTCCCATGCC GTGGAGTTGG CCTGTTGATG TAAATTACTA CGAAGCGCAA GCCTTCTGTC GATGGAAGAC CGAGAAAGAA GGATCTCCGA CTTCAAAACC GTATAGAATT CTCACCGAAG CGGAGCATCA CATCATTCGA AACCACGATC ACAACTTGGA GGCTGCTCGT AGAGACGTTT CGGCGGATAA GGTGATGGTG ACTTCAGGGC AAGCGTTTCC CAAAGGATCG GCTGGATCAA ATTTGAACCT GGCATTCTCT AGCCAAAACC CCGTCGATTT CTTTGAGCCG TCCCAAACTG GCCACCGTGA TACCACCGGA AGTGCCTGGG AATGGACGGA AGACCACTTC AACCCTTTAA AGGGATTTGA AGTCCACCAC GTGTACGATG ATTTTTCCAC TCCATGTTTT GATGGCAAGC ACTCTATCAT TGTGGGGGGA TCTTTTATAA GTACTGGCGA CGAGGCATCA GTTTTTGCAC GATTCCATTT CCGACCCCAT TTCCTACAAC ATTCTGGTTT CCGTCTGGTT GCATCAGATC ACGATGCTCC TGCTACGCAC CTTTTTGCCG GAAATTTCGA TGGTCAAGTT GCCGCACGCG ATGCCGCGGT CGCGCAGGAA GAATCCAAGC CGAGACAGTC TTCATTAGGA AGCGGCAGTG GCAGCGGCAA TGTTTACGAG ACGGATGACA GCTTGCATAT GTATCTTGGC CTTCATTACC CTAATTCTGG CGAGAAGGAA GGCGTTGCCC CGATCCTTCC TCACGACAAC TCTCCAAACC ATGGAACTGG CTTCCCGCAA CGAGTGGCAG GTCTTCTGTC CTCACTGAAA CCCGAGTTCA ATAACAATCG CGCATTGGAT ATTGGCTGTG CTGTTGGAGG GGCGTCTTTT GAACTTGCTA AGACTTTCGA TCACGTGGAC GCCTTTGATT TCAGTGGATC TTTCGTGAGC GCTGCCAAGC GAATGCAATC GGCAGAAAAT ATCAAGTTTC GGGTTCCCGT GGAGGCTGAA CTATATGAAG ACCTTCAGGC TATCCACGAA CATGGTGTGA CGGATTCGGT GCGCTCCAAA GTGCAGTTCT TTACAGGGGA CGCCTGCCGA CTCATCGATA TGAAAGAAGA CGGAATTCTT GGCTCATATG ATGGCGTGGT CATGTCGAAT TTACTCTGCC GCCTCCCAGA TCCAATGGCA TGCCTCGCCG GACTTCCAGA GATTATAAAT CCCGGTGGAG TGGTCGTGAT GGTGACGCCA TTTTCGTGGT TGACTGAGTT CACCCCCCGG GGCAAATGGC TAGGAGGATT TTACGACCCC GTAACAAACG AAGCTATCTA TTCGAAGGAC ATCCTGCGCC AAATTATGGC CTCGAATGGG TTCGAGAAGA TTCATGAGGT TCAGATGCCG CTCGTCATTC GGGAGCACCA ACGTAAATAC CAGTATATTG TCAGCGAAGC TACAGGTTGG CGTAAGACAG GATGATGCAT TTCTGTTTGC ATTGTCTATA CATCCTTTAC ATTAATGATT TTTTCCATTT AAACTAGCAT ATTCTATTGA CTAG
|
Protein sequence | MMLSPQAARM GTVCTTLRKI VSTARQRSLS HSAFASVRSE TAFGARAFST RAKSTLAALE DSDDEDLTFR SQGHAAAAAA LSKAGLNKSH DEAWMINVNR NDDNEWLNGP RSAEWFTGIH PSQCPGADQA GTIRSLSLPN LSAVTREAAK EYFDNSWTLY ETLFAGLKGE EGFYRPPVHG LRHPQIFYYG HTACLYINKL RVSKVLPKPV NAYFESIFEV GVDEMLWDDM NKNDMLWPTV SEVHEYRQQV YKTVVDAILN HPSLDQRNGP VKVDQDHPMW ALFMGFEHER IHMETSSVLF RETPYHLVQT PQHWPPIHPS AFNDASPTSN PIETLDYPAN RMIAVDNGTV DLGKPADFPS FGWDNEYGER NMDVPPFFAS EHMITNGEYW QFVDNGGYRN REYWCDDGWA WRSHRNLKWP FFWEPAGPAG SNKFSLRTIF KIVPMPWSWP VDVNYYEAQA FCRWKTEKEG SPTSKPYRIL TEAEHHIIRN HDHNLEAARR DVSADKVMVT SGQAFPKGSA GSNLNLAFSS QNPVDFFEPS QTGHRDTTGS AWEWTEDHFN PLKGFEVHHV YDDFSTPCFD GKHSIIVGGS FISTGDEASV FARFHFRPHF LQHSGFRLVA SDHDAPATHL FAGNFDGQVA ARDAAVAQEE SKPRQSSLGS GSGSGNVYET DDSLHMYLGL HYPNSGEKEG VAPILPHDNS PNHGTGFPQR VAGLLSSLKP EFNNNRALDI GCAVGGASFE LAKTFDHVDA FDFSGSFVSA AKRMQSAENI KFRVPVEAEL YEDLQAIHEH GVTDSVRSKV QFFTGDACRL IDMKEDGILG SYDGVVMSNL LCRLPDPMAC LAGLPEIINP GGVVVMVTPF SWLTEFTPRG KWLGGFYDPV TNEAIYSKDI LRQIMASNGF EKIHEVQMPL VIREHQRKYQ YIVSEATGWR KTG
|
| |