Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_49215 |
Symbol | |
ID | 7195685 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011689 |
Strand | - |
Start bp | 247627 |
End bp | 249742 |
Gene Length | 2116 bp |
Protein Length | 582 aa |
Translation table | |
GC content | 51% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002183955 |
Protein GI | 219127465 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.108223 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAACGCAACA AAAACTAGTC TACTTGTCGC TACCTCGAGC AGCAGAGCTC ACAACATATC TAACTAACTG CGAGGCGGAC CTCTGAGAGC AAACGTCAGA ATTTCTGTAT GTGCACAAAG CACAACAATA CTGCGTACCG AATCAGTCCG CCACGTCGTC AACGATCGAC AGTGCTTCAC ATCAGTCGCG CAACCTCGTC CAAGGTGTGA GGGACTTCAC CAATAGTTGT CACAAAACTT CTGGGCTCAA AAAAAGATTG GAAAATCGTA AGACAGGACA ATGAAGACTT ATGGTTTTGC ATCCGTACTC ATACATCTTT CGTGGGGGCT TATATTCACC TTAGCCACCG AAGCCGACTT TGATGCCCTA ATAGCGTCGA TGACAGCCGA CGCAATCGAG CTTGCCCGAC AAGTTGAACT CCTCTATCAA AAGCGCTGCA ACGAGATTTC TTTGCGACAA TGTGCTCGTG GCAGTTACAA CGAGTGCACA TCACTTTACC CCAACCAGAC ATGCCCCGGT GGCGAAGATT TGAACGTTGC CCAATGCGGC GACGGAGTCT CCTGCAGCGG TCTCTGGGAC TATTCCATTT CGAATACGCG TTTGCATCAG AACTTGGTAG ATTCTGCTGA TGGCAACCCG TCCGATCCTA ATGTCATCGA GACCATATGC TTCACGCAGC AGCTAGACGA GTTCTTTGTT CAGAAGCGAG CTGAGCAAAA GCCATATTGG GACAGCCTAG GGCTGCGTAC TCCACAAATG TACTTCGGGT CACAGAATGG CGCCTTTCGG ATATACCCGG CTAGACAGTC GGAAACTTGC GGAGTATACG ATCCCCGTCT CCGGCCCTGG TATATTGCTG CCAGTAGCGG ACCCAAGAAC GTCGTACTTG TTCTCGATAC CAGCGGCAGT ATGACCGACG GAAACCGTCT CAGTCTTCTC AAGCAAGCTG CAAAGCAAGT CATCGAAACA TTGACAGTTG GAGACCGCGT TGCCATTGTT GAGTTTTCGT CGCAGGCGAA GCTGTTTGCC CAAGACAACA AGTTTCTCTT CACGGCAACG CAGAAGAACA AGGAACTTCT AGCAACCCAC ATCGACAGCT TCACGGCAGC AGGGGCAACG AACTTCCTGG ATGCGTTCAC AGCGGCCTTT GCAGTCCTGA ACGACTCCAT CGATCAGGAA TATCACGTGG GATGTACCAC AGCCATCCTC TTTTTAACCG ACGGTGAAAT GACGCAGCCA GAAAATGTAC AGGAAGCCGA TGTACTGGAC CTGGTCAACA CTGGCATTTC CAATTTGGAG GCGCGACTAG GTCGATCAGT CTTCTTGTTC ACCTTCAGTA TCTCCGACAA CAACAATGTC CATGCCTTCC CAAAGCAGAT TGCCTGCTCC ACAGGCGACA ATGGTATTTG GTCCAAGATC GTCGACGAGC GCGAAATATT TGACTCGTTG ACCAGCTACT ATCGTTTGTT GGCCATTGGA CTGGGCAGAG ACGGGAACGA GAACTTCGCG GCGTGGGTAG AGCCGTATCA ATTTGCTTCG GGCGAAATCT GGGGCACTAC AGTGTCAGTG CCCGTTTATG ATCGATCGGT GACTCCGAAC TTGTTTCTCG GTGTGGTTGG TGTCGACTTC ACCTTGACAG CTGCGGACAG AGCACTCGGG GTCCCCGATG GAAGCAAGGA GAGCATTAAT CGAATCGTAA GGCAGTCGAC TGCTGTTTGC CCGACAATTG ATCTCGCATT GTGCGAGCTC GAGAGCTTCC GCCGCCGCAG CAGTGCTGGT GACGAGGCGC TGTGCACGAC GAATTGCACC GCCAGTGAGT ATTTTGAGAT CGAGGAGCAG GCCTGTTCCT TTGTGAATGA TTATCCACGT GACCTCTTTA TCAACAACAT GCTCCAGGGC CTCTCTTACG AGGAACGCGT GTGCTGCGTT GTTGGAGAAA ATACTGTTCA TACAGCAGAT CAATGCGTGG CTGGCCCCGA CGAACTGAAT ATAGGCTTAA TTGTTGGAAC TGTGATTGGA GGAATTTTGG TCGTCCTGTT AGTGATAGCA GCTTGGATCT ACTTCAAGCA TTTTTATGGC AAAAAGGGAT CCTTTCGCGA CAGCCACGAG TCGGCT
|
Protein sequence | MKTYGFASVL IHLSWGLIFT LATEADFDAL IASMTADAIE LARQVELLYQ KRCNEISLRQ CARGSYNECT SLYPNQTCPG GEDLNVAQCG DGVSCSGLWD YSISNTRLHQ NLVDSADGNP SDPNVIETIC FTQQLDEFFV QKRAEQKPYW DSLGLRTPQM YFGSQNGAFR IYPARQSETC GVYDPRLRPW YIAASSGPKN VVLVLDTSGS MTDGNRLSLL KQAAKQVIET LTVGDRVAIV EFSSQAKLFA QDNKFLFTAT QKNKELLATH IDSFTAAGAT NFLDAFTAAF AVLNDSIDQE YHVGCTTAIL FLTDGEMTQP ENVQEADVLD LVNTGISNLE ARLGRSVFLF TFSISDNNNV HAFPKQIACS TGDNGIWSKI VDEREIFDSL TSYYRLLAIG LGRDGNENFA AWVEPYQFAS GEIWGTTVSV PVYDRSVTPN LFLGVVGVDF TLTAADRALG VPDGSKESIN RIVRQSTAVC PTIDLALCEL ESFRRRSSAG DEALCTTNCT ASEYFEIEEQ ACSFVNDYPR DLFINNMLQG LSYEERVCCV VGENTVHTAD QWDLLQAFLW QKGILSRQPR VG
|
| |