Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_50546 |
Symbol | |
ID | 7199380 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011699 |
Strand | + |
Start bp | 76603 |
End bp | 78910 |
Gene Length | 2308 bp |
Protein Length | 705 aa |
Translation table | |
GC content | 56% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002185482 |
Protein GI | 219130669 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCAACA CACCCAAGCT TGCCTTTGTG TGGGTTTTGG GCGCCGCCTG TGTGGTTTGG TCGTCCCACG CGTGGACCTC GTCGACGACA ACACGACACC CACAACGACG CCATCGTACA AGTCCCGCAC GGCCGTATCC GCCAGTACCA CTACCGACGC GACGTCCACC GATCCCATCC ACACTCCGGA TGAACTCTTG ACGCCCGACA GCATTTCCAA ACTCCGGTTT CGGGAACTCA AACGCGAACT CCAAGCCCGC TCCTTGACCT TGGAAGGGAC GACCGGACAA TTACGTGTAC GCTTGCGACA GGCCGTCGGA TTGCCGGATC CTGAATGTAT CGTCAACGAA GACGGCATTG AAGACGATTG TCAAACGGTA CGTTCACCGG CATGGCTATG GTACCACCCC AACGCATCCT TCGATCGTTT CCGACCTGAC TTGGCTCACA CGGTTTCTCT GTTTGGGTGT GGAACAAATA TTGTACGTAT GCGCAGGAAT ACGAAATGGC CCGTATCGTC ACGTTTCGCG ACGAGTCGGA TCCGGAATAC GAAGTCAAAG AATTAATGGC GCAGATTGAA GAAAAGGCCG CCCTCGGACA CTGGAAGGCT GCCACCCGGA AACTCAAGAC CTTGACGAGG CGCTTTGCCA ACACAACCGT TCCGGAATCC GTATACCTCA CGACTCTCGA AGCCTGCATG GCCAATCGCT TGCAGGGTGC CCGTGCCAGT GAACCCGCTC GCAAAATACT CGAAAAAATG GTGGAGCAGG GATACGCCAT CCCCGATACT TTCGGAAACT TTTGCGTCAA AACCTGTCTC GGGGAACAAG GTACGGACTC GACGCACCAG GGCTTTGGTG GAATCGACAC CGCCCTCGCT ATTGTTGCCG CCATGGAACA GGCCAACACA CCACTCCAAC TCGAAACCTA CGACAAACTC ATCCAGGCTT TGGTCAAGGA AGGTTCCGTG GACCACGCCC TGGCACTCCT CCGAACCGTC GTCGTCGAGC AGGCCCAAAC GCCGTCTTTG GAAACCTTTG ATCGCGTGGC ACGCTGCGCG GTTTCCCGTG CCGTCCACGA CGATGAAGCC GTCCTCAGCG TCCTCACCAT TTGCAAGGCC TCCGGGTACG ATTTAGATAC CATTGCGGCC ACCGAAGCTG GTCGCGCGAT ACTCGCCTGC GGGGTCATCG CCGCCGAGCG ACTCGGCAAC GATGCCCTCG CCTTCCGGCT ACTCACCGCC GCCAGCAAAG CCAAGGGCGT CGCACCCGAT CGGGGAGACA TCATGATTGC ACTCGGATCC TCCACGGCGC AACGGGCCTG TACCCTTATC CACAAACGGG CCATTAATAA GGCCGTCGAA GACGGCCAGT GGCAGCTCTC CGTCAAGGTG CTGGAACTCA TGTTGGAGCG ATCGTTGAAA CCGTCTAACT GGGTGTGGCG GAACGTCGTC ACGTGCTGTG CCAAAGCGAA GAAGAGCAAA AAGGCGACGG CCGTCTTGCT CGACTGGGTC AAGCTCTCGG AAGACCTCAA GGCGGACAAA CCTCCACTGT CGGTCTTCAA CACGGTCGTC AACGTGTGCG AGATTTGCGA CGAACAGGAA TTGACGCTGT TGGTCCTCGA CAAGATGAAA CAAACGCACG ATACGGAAGG AAATATTATT ACCTTTAACA TTGCCCTCAA GCGACTCGCC AAACAAGGAA ACTACCAGGC ATGTGAAGGT ATCATTCTCG GCATGTTGCA GGCAGGGGTA GAGCCCTCAG TGGTTTCTTA TACGACGGCC ATTGCCTCGT GTGCCAGCGC GGAAGAAAGG CAGCCCACAA TGGCGTATGA GTGGTTGAAG CGTATGCGAT CACGAAACGT CAATCCCAAC GTCTTGACCT ACAACACGGC CATGGCGGCG TGCCTTGACG GCAAGCTTGA AAGCAGCTTT ATCGGCAGCA AACTAGCCAA GGAAATGCTG GATGACGTAA ATATGCAGTT GCAGCAAGGT GACGAAAGTA GTGAAGTGAA CGCCTACACG AACGTGATTC CCGATAGTGC CACGAAGAAT ATGGCACGGC AGCTCATGAC GCAGCTCAAG TCGAACTGGA AAGAAGGAGC GATTGATAAG CGAGTAGCAA CGGATACTGT CCGTGTTCCT TTGAAGGAAC TTGTCAATTT CTCTCGGTCG GAAGCCGCCG ACCGGGCTCG CCAAGAGACG GCGAAGCGGA CCATTGTGGA CGACGATCAG GCTGCGTCGA CCGCTCTCGA CGAAATCGAG CTGGAGTACA CTGCGGCCTC GAGCACGCAT CGATCCGCGG AAGTGTAA
|
Protein sequence | MRNTPKLAFV WVLGAACSRT AVSASTTTDA TSTDPIHTPD ELLTPDSISK LRFRELKREL QARSLTLEGT TGQLRVRLRQ AVGLPDPECI VNEDGIEDDC QTEYEMARIV TFRDESDPEY EVKELMAQIE EKAALGHWKA ATRKLKTLTR RFANTTVPES VYLTTLEACM ANRLQGARAS EPARKILEKM VEQGYAIPDT FGNFCVKTCL GEQGTDSTHQ GFGGIDTALA IVAAMEQANT PLQLETYDKL IQALVKEGSV DHALALLRTV VVEQAQTPSL ETFDRVARCA VSRAVHDDEA VLSVLTICKA SGYDLDTIAA TEAGRAILAC GVIAAERLGN DALAFRLLTA ASKAKGVAPD RGDIMIALGS STAQRACTLI HKRAINKAVE DGQWQLSVKV LELMLERSLK PSNWVWRNVV TCCAKAKKSK KATAVLLDWV KLSEDLKADK PPLSVFNTVV NVCEICDEQE LTLLVLDKMK QTHDTEGNII TFNIALKRLA KQGNYQACEG IILGMLQAGV EPSVVSYTTA IASCASAEER QPTMAYEWLK RMRSRNVNPN VLTYNTAMAA CLDGKLESSF IGSKLAKEML DDVNMQLQQG DESSEVNAYT NVIPDSATKN MARQLMTQLK SNWKEGAIDK RVATDTVRVP LKELVNFSRS EAADRARQET AKRTIVDDDQ AASTALDEIE LEYTAASSTH RSAEV
|
| |