Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PHATRDRAFT_44445 |
Symbol | |
ID | 7197739 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Phaeodactylum tricornutum CCAP 1055/1 |
Kingdom | Eukaryota |
Replicon accession | NC_011672 |
Strand | + |
Start bp | 578096 |
End bp | 582029 |
Gene Length | 3934 bp |
Protein Length | 1163 aa |
Translation table | |
GC content | 50% |
IMG OID | |
Product | predicted protein |
Protein accession | XP_002178256 |
Protein GI | 219114921 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.671968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGTTCGCT CCCAGACTCT CGTCCTTGGT GTTTGTTGGT TCCGTGGCTG CTTCTTTCTA GATCGGTTCT CTCTGACTAT CTTTGATGAC TTCTGTTGCT GACCAGGAAG AGAACAGAAG AAAACGGACT TTGGGTAGCT CTTCGAGATG AATCGACGGC AGCGGCGCGG AGTTGTCTGG GCTGTTCTCG TTCTGCCATT GTTCCTTTCT CTCGGTCGTT CTGTATTCGG AGCACACAAG TCTTCTCGAA CGGAAGCTGG TGCCGAAGCA TCCACAACGG AGTCATTGTT TGTTCTATGC ACCGCCGACG GAACCATTTA CTCCATTGAC GCTTGGACGG GAGAATTCCA ATCAGTCGTG TTGACTGGAT CACCGTTGAT CTCACACCAC CAGCACGCAG CTCGAGACGA GCATGCGGAA AGTTCACACA GCATGATGGT ACCCGGCTTG GACGGAACCC TTTACTGGAA AGAAGATGGC GGGGCACTTC AAGCTTTGCC GCTCACGATG GAATCAATTC TGGAACATCC GGTACGGTCC TGTGATCCGG AAGACCAACA CTGTGGCATT CTCACAGCGA CAGCCCATAC ATCGCTTATT GCCCTAGATG AGTGGGGAAA GCTAGCGTGG AAAACGATTC CCGGATCGAC GGACGACCCG CTTCAGGCTC CGACGACTTC TTCCAATGCC GATGAGAACA CGACACCTCG TCCGAACGAA GAACAAGCGC AGGCATCGTC GGTACTGTTA CAGCGTAAAG ACTACTGGAT CCAACATATC TCCTCCGAAA CAGGTCGGCA AGTTTGGAAC GCGACACTAG GAATGTACCA GGCATTGGAA TTTGATGCCG ATGAGCAAGA CGAAGAAGGT CGACTCTTGC CTGGGGCCAG TACCTCTAGA CACTCCGGCG CTACACACTT TCCAGGCGTG GTTTTTTCGA ATGCGGGACG GACATTGGCC GGCGTGGATC TTCAAACACA ATCCATCTTG TGGCAGATCG ACACACCTAC CGTATTGGCG ACCGTATTTG GTATGCATGC GGGTCAATGG AAACCTGTAC ACGTGTTGCA ATCTCACCCA TTGGAATCTG AAACAATGCC CAAAGTGGTA CAGCAACGAG CCTTGCCGGA TCGCGCCTCG CCGGTAACAC AGTATGTAGA TTACGCGGAA CTTTGGAGAC AGCACCAATC GTCGGCGGAG CAAACGCATC GATGGTGTCC CCGAACGACC TGTCTCCCCA ATCAACCTTG TGAATGTTCG GAATTCAATG ACGACACTTG TGTGGATACC GAACCTTCTC GTCTCGAGCT ACCATCGCCC ACTTCTGCAG TGCCCATGTT GGTTTCTCAG GTTGACGGTT TGCTGTTGTC CTGGCGTGTC GTGTCGCTTA TCGTTGGGAG TTTGCTTGGA CTAGTGGTTT GTGGACAGTT TTGGTACATT CGCAAAAAAG AAAAGTGGTC GCAAGCGACG AAGACGTCCC GATCGCGTTC TTCTTCTTTT GCCACAAACT CGCATGAGGA TCAAGTTGAC GGACCCATGA TTGGAATGAA ACGCACCATG AGCTTACCGG CGATTCTAAA GCCTGAAAAT TCTACCGCTC TGCCGACGCA ATCTTCGGTT GATATTGTCC CGAGTGCTCC AGCATATTCG AGTCCTGCAC CCTCGTCGAA AGACGCCAGC GGTAGCATTC CGTTGGTGCG GTATTCTCGG TACGCGTCCG AATTTCAAGA GCTGCGGGCT TTAGGGAAAG GCGGATTTGG TACTGTCTTT CAGTGTCAAA ACTCCTTGGA CGGTCGAGAT TATGCCATTA AGAAGATATT GATCCGAGGA AATGACCTTA ATTTCCAAAC TCGTTTGGAG CGAGTTTTGC GAGAAGTCAA AATCTTGGCG GTGTTGGACC ATCCGCATAT TGTTCGATAC TACACGGCGT GGTTGGAACT AGAAGAAAGC AATCATGACG AGAGTTTGAA CGTGGACGAC GACGAAACAT TGTCTCGAAA ATTTTCGAGC TCACTCTTGA CAGAACAAAC AGATTGGAAT GCCCACAAAA CAGGACACAA CAAATCGCCA AGGCCAAGAA CACAAGGGCT AATAATGGGC GGTGAAGAAA ACGGTTGGCG AGACGATTTT GGACTTTCCG AAGAACCGTC ATTCATTGTG CGCCGGACTT TTCATAGGGA AATCTCCGAC TGCGGCTTTA TTTTCGAAAA CAGTAACGAA ACGGGAACGG GCACTTCTCC GGACGAGGTT ACAGCAAATC AAGACACAGA AGCGGTCGAG GATGTCTGTT CGCATCATCA AGACCTGAAG TCGCTGGATA AAATTGCGGA TCCTGTTGTC ACCGGCGATC TTCCTCGATC CCCGTCGAGT AGCTCAGACA TAAGCGCGAG TGTGGTCATT GCACCCGAAC TCGCGAAGGC GTTGAAAAAG TCTAGCGACC CACCTTTACG GACTGTCCGG CATATTTTAT ATATTCAAAT GCAGCTTTGC AGTCAAGAAA CGGTCAATGA TTTCTTGACT GACGCGAATG CGCGAAAGGG GGTAGCGCCG GAAGGCGTCG ACGTTCCATC GGCGTTGAAG CTATTTTTCC AAATCGCGCA AGCTGTCCAG CATGTACATG GACGGGGTTT AATCCATCGC GATCTGAAAC CCAGCAACTG CTTTATCGAT GGATCAGGCA ACGTGAAAGT CGGTGATTTT GGCTTGAGCC GCGAATCAAC AGACAAAGAC GAAGGGGAGG CTTCTTTTCC TCAATCTCGG CAAGAGGATC GAGTTTTCGA CAATCACACT GCCGGAATTG GGACCCGATC ATACGCGAGT CCGGAACAAA TGAATGGCTC AGACTATGAT TCAAGTACAG ATGTTTACTC GTTGGGAATT ATCTTGTTTG AATTGTGCTA TCCCATGTAC ACCGGTATGG AACGCAATAT CTGTTTGAGC CAGCTACGAT GTTTGCGCTT TCCAGAGACG TGGCATGCCA CAGTGGGACG AGGGTTTCCG ACACTTCAAA ATCTGATCAA ATCTATGCTC AGTCCCAATC CTAATGAACG CCCCACAGCA GGGGTCGTCG CGCAGCACAT TCAGTCGATT CTCGGGGAAT TTACGATTCA ATCTTTGGAT CTCCAGGACG CGCCCGGTAC AATCTTGTTG CGCGTCGAAG CAGAGCATCG AGATGATGTC CTGCGGTACA CAATGCAATG TATTCAAAAC GTGGCGGAGG AGGACGATGC AGCAGAGGGC AAGATAGAAA TTGTACAGTA TGGTCTTCGT AGCTCAAATC GCAACAAAGA TAAACCTACC GCCGTAATGG AGTTTGCCCT TCGATCCAGT CTTCCCGGGA CAATGTTTGT TGAGTACTTG CGGAAACGGC CGGAGGTATT CGTTGTGAGG CAAGTGTCCC ACTCTTCGGG ATCTAGCGAC TCGAGGAGTC ACTGACACCG AAAGATGAGG ATTGCTCAAG TGACTTTTCT CGCGTTCTCG CTCGTAGTTC CTTCATTCTA AGGCTTCCTC CTTACGCCAC TCATAAACCT ACCATCTACT GACCTGCATA GAAGAGTAGA AATTTGGAGC AAGGAGAATT TTGGTTCTTC GCTCCAAATG AAGAAGTAAG ATGCTCAAAT TCTGGTGAAC ATACCACCAC TAGGCCATTG GGACACTTTA ACGGAGCAAC TCGGCTCCAT GCCAAGCCAA ATCCATGTCT TTCATCGCGA TAAGTATAAT ACAGGAACAA CAAGCCTTTT TTTGGGTCTG TGTTCGACAG TTTAGTCTAC ATTGCAAATT TCCAAAAATC GGATCTGTAA CGCAGTTTAT CTTTACCAAA AGTGCAGCGC AGCCATTGGA TCTTCAATTG AGCACATCAA AGTCTGCAGC ACGACACATG ATGAGATGTA CCAACGAAGA TGGCTTTGCT TGGGGTGGGG AACCACCGGG ATGA
|
Protein sequence | MNRRQRRGVV WAVLVLPLFL SLGRSVFGAH KSSRTEAGAE ASTTESLFVL CTADGTIYSI DAWTGEFQSV VLTGSPLISH HQHAARDEHA ESSHSMMVPG LDGTLYWKED GGALQALPLT MESILEHPVR SCDPEDQHCG ILTATAHTSL IALDEWGKLA WKTIPGSTDD PLQAPTTSSN ADENTTPRPN EEQAQASSVL LQRKDYWIQH ISSETGRQVW NATLGMYQAL EFDADEQDEE GRLLPGASTS RHSGATHFPG VVFSNAGRTL AGVDLQTQSI LWQIDTPTVL ATVFGMHAGQ WKPVHVLQSH PLESETMPKV VQQRALPDRA SPVTQYVDYA ELWRQHQSSA EQTHRWCPRT TCLPNQPCEC SEFNDDTCVD TEPSRLELPS PTSAVPMLVS QVDGLLLSWR VVSLIVGSLL GLVVCGQFWY IRKKEKWSQA TKTSRSRSSS FATNSHEDQV DGPMIGMKRT MSLPAILKPE NSTALPTQSS VDIVPSAPAY SSPAPSSKDA SGSIPLVRYS RYASEFQELR ALGKGGFGTV FQCQNSLDGR DYAIKKILIR GNDLNFQTRL ERVLREVKIL AVLDHPHIVR YYTAWLELEE SNHDESLNVD DDETLSRKFS SSLLTEQTDW NAHKTGHNKS PRPRTQGLIM GGEENGWRDD FGLSEEPSFI VRRTFHREIS DCGFIFENSN ETGTGTSPDE VTANQDTEAV EDVCSHHQDL KSLDKIADPV VTGDLPRSPS SSSDISASVV IAPELAKALK KSSDPPLRTV RHILYIQMQL CSQETVNDFL TDANARKGVA PEGVDVPSAL KLFFQIAQAV QHVHGRGLIH RDLKPSNCFI DGSGNVKVGD FGLSRESTDK DEGEASFPQS RQEDRVFDNH TAGIGTRSYA SPEQMNGSDY DSSTDVYSLG IILFELCYPM YTGMERNICL SQLRCLRFPE TWHATVGRGF PTLQNLIKSM LSPNPNERPT AGVVAQHIQS ILGEFTIQSL DLQDAPGTIL LRVEAEHRDD VLRYTMQCIQ NVAEEDDAAE GKIEIVQYGL RSSNRNKDKP TAVMEFALRS SLPGTMFVEY LRKRPEKSRN LEQGEFWFFA PNEEEQQAFF WVCVRQFSLH CKFPKIGSVT QFIFTKSAAQ PLDLQLSTSK SAARHMMRCT NEDGFAWGGE PPG
|
| |