Gene PHATRDRAFT_44344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44344 
Symbol 
ID7198033 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp283724 
End bp286832 
Gene Length3109 bp 
Protein Length980 aa 
Translation table 
GC content48% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178200 
Protein GI219114809 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.195415 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCTGAGGCCG ACCATCCGGA GAAAACGCTG ACCGACACAA GCCACAATAA TTGTAAAGGG 
CATCAATAAC ATTTATCGAT GATCACGGTT CAGAAGGCAT CGCTGGTATT TTGGTGCGCC
ATCGCTTCTA CTTTTTCCCG GGTAGTTCAT GGTCGAACTA ACTGTGAGCA GCTGTTCGGT
TCAGTAATTT CGGCAATTCC TCGTGGAGGT GAGATCGCGG TGCCGTCAGT CGCGCCGTCC
TTCTTACCGA CAGGTGGTCG CTCAGACGAC ACGCACAGGC GAAAGAGAGT CGTAAAAAAG
AAGAAAGTGC ACAAAGGGGA ATCGAAATTG CCGATAACTC AAGACGATCG TTATTCCAAC
GGCGAAAGAA AAGCAAAGCA AAAGAAGATA AAGAAAGTGA AAAAGAGAAA GGTTTCGCAG
CCGGAATCTT CTGAAAGTTC TCCGAGTAAC GTTGCGAAGC AGCTGGAAGC GCCGCAATCA
AGAAAGCAAA AAAGGAAAGT AAAAAAGAAA CACATTGATA CGTCCAATGA GCCCGCTGTC
GCTTGGGCGA AGAAGGAATT TCGATCCCGG TCACCATCGG TAAAGGGGGG ACCTGTGAAC
TCAGTGGTAA GCGAAAAGCA GAATGGTTTG CAGAAGTTAC CGCAGAAGAA AAGTAAAAAG
GTCAAAAAGC GCAAAAAGGA GGCCGGCGAA GGGAAGGAGC AACCGTTATC TCACCCGGAC
CAAAGAATGC CCCCCGGGAC TCCGGCTCAA GACTCTTCGC AAAGTAAGGG CTCAGAGAGC
AGTGATGACT TGCCGATTAA TTACTCCGAG AGAGCAGAGA GAGAAGAGAC TTCTCTGGTT
GAGAAGTCTG AAGCACAAAT CGCTCCAATC GGACAGCAAG TGGATGAAGA CTCAGAGCGA
CCTGTCGAGC ACACAGTCGA AAAACGAGAC GGAGATTCGA CTTCAGGATC GCCAACACAG
ATTGACGCGA ATCTCAATTC AGCAAGCAGA TTGGTCGACG TCGAAGTCGC CCAATCTCTT
CAGAATGACA CGAATCTCGA CGATAGGAAA TTACACCAAA CAGAACTGAA GTCTTTTGGG
ACTGAAGATT CATTCCAGGA TACTTTTCCA GAAGAGAAGA GCACCGATCC GGTAGCCGAA
GACCACAATA CACCAGTAGT GGCTCCTAGA TTGAGCGGTG AGCCCTCGAT AGACGCTTCT
ATTGAAGAAG TCTCTGACTC AAGTGATACC ACCGATACAG AAGACAGTTC TAGTGACAGC
GACGAGAGTG AAACCACAGA GACAGAGGAT AGTTCTAGCG ACACCGACGA GAGTGAAACC
ACAGAGACAG AGGATAGTTC CAGCGACGCC GATGAGAGTG AAATTTCCAT TGATATGACG
GCATCAAGCG CCTCTAGTGA TAGAACCGAA AGCTACGAGG AATCGTTAAC GGCTGGCAAT
GAGACGGTCG AAGCTGATCT AGAACAAGGC TATTCATTGA ATGAGTCGAA AGACGCCGAA
ACCGAATCAA ATTCAGAGGT GAAAAACTTC AAAGACAACC CAACTTTATC GGGGTTAGCA
GGCGAAAGCG AGAACTCGAG TCCAAATTTT GACAAACGCA TCAAAAGTTG CAATACTGAA
TACACCTCAA ATCGAAACAG CTCGTTGATT ACGGATAAAC AAAGCTTGGC AAGCACAGCA
GGCGGATCCG AAGGAATTGG GGTGGATGAC GTGCAAACCA ACGCATCTAT TGCAGATCGA
AACTTTTCCT CTTTGGAAGA GGGAGAGGGA CACTATGACG CAACTAGTGA CGACGAAGAC
AAGAAATTTA AGCAACTCGA TGCTCCATCT TTGGAAGATT CGGACTATGC GGCTGGGGAA
GGCAGCATGA ACAGAAACTG GAAAGTCGAC CTCTCTGAGC TCCGATCGCT GCAGGACCAT
GAAGATGATA TAAATGTGTC GATTGTAACT TGGAATTTAG CTGAAGAGTC ACCTTCAGAG
GAAGACGCCT CATTTATTCG ACGTTTTCGA CGCCGAAATG ATGTACAGAA GTCCAGCGAT
TTCGTACTGA TATCAGGACA GGAGTGCGAA AACATTAAGC CGAGAAGGAC AGAAGGACAT
CGATCTCGAG AGTTCCGGCG GTTGATGATC AAGATGTTGG GGAAACAATA TGTGCCCATA
GCGCTGCATT CTCTAGGTGG AATTCAATTC GGATTGTTTT GCAAGCGATC GATTCTAAGT
GAGGTTGAAA CTATCTCTGT CGCGGACGTT ACCTGCGGAA TTGGCAACGT ATTCCACAAC
AAAGGCGCTA TCGCAGCATT CGTCCAGATC AAGGCGAAAC AATGTAGCGA GGGGGAAGCC
ATCGGACCAA ATCGTGACAA GTCCGTACGG ATGATGTTTG CGACCGCCCA CATGGCGGCT
CACGTGAAGA ACACTGAGGC TCGAGACTCT GATTTCTGGA GAATTGTGTC TGAGCTGGAA
GCGCAAGCGC CGCCGAGATT TCTCTCATCA AATATTGTCG AGTCTAGCAA GGAAAGGGAA
TGCTCAGGAT CAAAGCTTCT AGAATCAATG GATCGCATTT TCTTTTGTGG GGATCTTAAC
TACCGAGTTG ACCTTCCTCG CGAAATTTCT GAGCACACTC TGCTTCAGAT GAAGCGCCTC
CAGGAGATCG GAGACGAAAA GTCTTTACAA AAGGCCGAAC TCTTGCGATT AGAGCTCTTG
AGACACGATC AACTCATCTG TAGCATGTCT GAGAAACGAG CCTTCCCAGG CTTTGCGGAA
GGAAAAATAT CCTTTGCGCC GACTTTTAAA TTTGACAAAG GCACACCAGA GTACGATAGC
TCGTATAAAC AACGCATACC TGCATGGACA GATCGCGTTC TATTCAAACC CATCGGGACG
CGGGTACTGG AGTATGATAG CATCTCGGAT GCTCAGCATT CCGATCATCG TCCAGTCTAC
GCCACGTTTC GCGTCAGTCG TCAAGGGCGG CAAGTTCCCA AATCGAAGCC GAGAACAAAG
AAGCGAAGCC GTCGGAAGTG AACGCACATA TACCTACAAG TTAGCTCCAA GTTACTGGAT
TTTAAGATAG AAGCCATTTA GTGTACGATA ATCGCCGGAT ACACCTGAG
 
Protein sequence
MITVQKASLV FWCAIASTFS RVVHGRTNCE QLFGSVISAI PRGGEIAVPS VAPSFLPTGG 
RSDDTHRRKR VVKKKKVHKG ESKLPITQDD RYSNGERKAK QKKIKKVKKR KVSQPESSES
SPSNVAKQLE APQSRKQKRK VKKKHIDTSN EPAVAWAKKE FRSRSPSVKG GPVNSVVSEK
QNGLQKLPQK KSKKVKKRKK EAGEGKEQPL SHPDQRMPPG TPAQDSSQSK GSESSDDLPI
NYSERAEREE TSLVEKSEAQ IAPIGQQVDE DSERPVEHTV EKRDGDSTSG SPTQIDANLN
SASRLVDVEV AQSLQNDTNL DDRKLHQTEL KSFGTEDSFQ DTFPEEKSTD PVAEDHNTPV
VAPRLSGEPS IDASIEEVSD SSDTTDTEDS SSDSDESETT ETEDSSSDTD ESETTETEDS
SSDADESEIS IDMTASSASS DRTESYEESL TAGNETVEAD LEQGYSLNES KDAETESNSE
VKNFKDNPTL SGLAGESENS SPNFDKRIKS CNTEYTSNRN SSLITDKQSL ASTAGGSEGI
GVDDVQTNAS IADRNFSSLE EGEGHYDATS DDEDKKFKQL DAPSLEDSDY AAGEGSMNRN
WKVDLSELRS LQDHEDDINV SIVTWNLAEE SPSEEDASFI RRFRRRNDVQ KSSDFVLISG
QECENIKPRR TEGHRSREFR RLMIKMLGKQ YVPIALHSLG GIQFGLFCKR SILSEVETIS
VADVTCGIGN VFHNKGAIAA FVQIKAKQCS EGEAIGPNRD KSVRMMFATA HMAAHVKNTE
ARDSDFWRIV SELEAQAPPR FLSSNIVESS KERECSGSKL LESMDRIFFC GDLNYRVDLP
REISEHTLLQ MKRLQEIGDE KSLQKAELLR LELLRHDQLI CSMSEKRAFP GFAEGKISFA
PTFKFDKGTP EYDSSYKQRI PAWTDRVLFK PIGTRVLEYD SISDAQHSDH RPVYATFRVS
RQGRQVPKSK PRTKKRSRRK