Gene PHATRDRAFT_44952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44952 
Symbol 
ID7199626 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011673 
Strand
Start bp740238 
End bp741940 
Gene Length1703 bp 
Protein Length518 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002179057 
Protein GI219116524 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.865104 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAACCG TTGACCGTTC TTCGCCTGCT ACAGCAGAGC ATTCCGGCGC TGTTGAGGTA 
CCGCTCTTTC TGTCTCTAGA AACCTGCCGA GCTGCCGAAC TCGCAGCCTG GTTGCAAACT
CGCCGATTTC GGAACCAGGA CACCGACGAG GTCAGCCAGT GCCAGACGAA AGATTCCTTA
ACAGCCACAC AGGAATTCCT AGCTCAAGCG CTCGACTCAC TCCGTGGAAC GGATGTTGAT
ACCGGCAGCG ATCCGGTCGG CTACTCGGTA TCTCCGGAGG GGTCTCTGGA ACGCAGCGTT
GGTAGATCGC ACCACAAACA GCAGGTATCT CTTTTAAAGG ACGACAACAT CACCGTTTCG
TTTACGACAC CACGCGCCAA GTTTGTCGTG ACCGTTTGCG AAAACGGTTT GCTTGCTCGC
CACGACAAGC AAGGTACTCT ACACATTCCC TCCGGTGCCG TTACCCACAT AGTTTTCTTT
CCAAAACCCG AAGACTGTGC ACGCTCGAGT ACGTCCAAGT CACAAACTAC CACCAAGAAT
ACGTTTGACA TGTGTCTACT TTTGCTGGAA CCAAACGCCG TCCAGTTCCA CAAGAAGAAT
CCCAGCAAAA ATAACAACAA GACAATGAAC CAGGTCTGTT TTCAACTTCC CGAAACGCTC
CCTGTATACC GGAAAAACAC TAGCAACACC GAGAACCCCC TACAAGCGCC AAGGCCCCTA
GATCCAACAT CACAGTGGAG TAATGTTCTG TGTACAGCCC TGGGCGTAGA TTCTTCTCGA
AGCGTGGCCC GCGTGTGGCA TCCCTCGTCG CCACCACCAC CATACTGTTC CACCCCACGC
AACCCCTTCA TCTTTGCCTC GCACCAAGAC GCCACCACCA GCAGCACAGT CGACGGAATG
CCCTTTGTCA AATGCTATCA CGGCGTCCAC GACGGAGTTT TGTACCCATT GCGGGAAGGA
TTGCTCTTTT TCAAACCACC ATGTTTTGTG CCGCGCCAGA AGCTGGCGTC GATTGCCTGT
GGTCGTGGCG GAGACACTTC GTCCCGTTAC GTCGACTTGA CGTTGACCAC AACGCAAGAC
AATGAAACCT TCGAATTTAC CAACATTCAT CGGGATGAAC TTGCTACCAT AAATTCGTAC
ATTCACGAAA CTCTCATTCC AGCCATGCAA CGCGATGTTC TAAAGGACTC CGACATCGAA
GACGCATCCT TTAAAGGCAC TGTCTTGACA CGTACAGAGC TTAAACAGGA CAACGAAGAG
TACACTGTAC TTGAAAGAAG TTACTGCCGT CCCAAACGCA AAGCCAGCGT GGAGGCACGG
GCCATAAACC GCAAGGTGCA AAAAAGTCAA CTCGAAAATG ACGATGAGGA CGACGACGAC
GATGATGACG AATTTGTGAA TCGAGACCAA ACTATGGTTC AGGATGACGA CGAAGAGTTC
TCCTCGGACG AGCGTGAAGG AAAGTCCTCC GACGACGAAG AAACCAGCAA TGACGAGGAA
GCCGTGGTGG AGACAGAAGA TCACGAAGAC GAGACGGAAA GCGATGAGGA TTGCTGATGC
TGTTGTGAAT TGATTTTCTC AAATACTTGC GCTGCTCAGC CGCTTTAGTG GCACTGGGTA
TGGTCAACGT CGACCCATTA GTCTATCTCT CTAACGACGG ATAGACTTTA TATTACAAAT
ATAAATCAAC CCTTCATGAT ACT
 
Protein sequence
MRTVDRSSPA TAEHSGAVEV PLFLSLETCR AAELAAWLQT RRFRNQDTDE VSQCQTKDSL 
TATQEFLAQA LDSLRGTDVD TGSDPVGYSV SPEGSLERSV GRSHHKQQVS LLKDDNITVS
FTTPRAKFVV TVCENGLLAR HDKQGTLHIP SGAVTHIVFF PKPEDCARSS TSKSQTTTKN
TFDMCLLLLE PNAVQFHKKN PSKNNNKTMN QVCFQLPETL PVYRKNTSNT ENPLQAPRPL
DPTSQWSNVL CTALGVDSSR SVARVWHPSS PPPPYCSTPR NPFIFASHQD ATTSSTVDGM
PFVKCYHGVH DGVLYPLREG LLFFKPPCFV PRQKLASIAC GRGGDTSSRY VDLTLTTTQD
NETFEFTNIH RDELATINSY IHETLIPAMQ RDVLKDSDIE DASFKGTVLT RTELKQDNEE
YTVLERSYCR PKRKASVEAR AINRKVQKSQ LENDDEDDDD DDDEFVNRDQ TMVQDDDEEF
SSDEREGKSS DDEETSNDEE AVVETEDHED ETESDEDC