Gene PHATRDRAFT_41501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_41501 
Symbol 
ID7199372 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011699 
Strand
Start bp9823 
End bp11264 
Gene Length1442 bp 
Protein Length449 aa 
Translation table 
GC content50% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002185470 
Protein GI219130644 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.364066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCTTGC TGGAATCAAT CGGCGCTGCC GAGACTTCAG TACTGGCGGA AAGAAGCACA 
GAAAAGAAAG ACCTTTCCAG CAGAAGCACC AAAGAAACGA ATATTGACCT AATCGTCGAG
CTGTCGATCA CGATCGGATA CGCCATTGTG ACGGGTCTGC TTGCCCGTTG GCTGATAGAT
CGGTACTTGA CACCGCAACA ACTGACAGAT ACCGACCAAC CGTCTTCAAA GGAAGTATAC
AAAGGGTTGC AACGGATCCT CCAAAAACGG AATCGCGGCA ACACTCAATT GCCGCAGCTC
AATTCGTACG AGCTGCAAAT AGCGAACGAG ATTCTCGATC CAGACGATAT AGAAACCAAT
TTTGCCGAAA TTGGAGGTTT GGATTCCACC AAGACAGAAA TCTACGAATT GGCGGTGCTG
CCGTTGGTCC ATCCGGAACT ATTTACCGGG AAACTCGTAC AGCCTTGCAA AGGCATTCTC
CTCTACGGAC GACCGGGTAC GTCAATGGTA ACGCACACAG CACAAACACG CCCCTGTTTT
AATCGAGAAA CCTCTCACTT ACTCTCTCAT CCAACTTGGT CATATTAGGA ACTGGTAAGA
CTATGCTCGC CAAGGCGTTG GCCAAAGAAT CCGAAGCCGT ATTCATTCCT CTGCAGCTGT
CAAAACTCTT GAACAAATGG GTAGGGGAAT CGAACAAACT CATTGCCGGT GCCTTTTCAC
TGGCCCACAA ATTACAGCCT GCCATCTTGT TCATCGACGA GATCGATACG TTTCTGAAAG
CCAATGCTGG TGAAGGTGCA CAGTATCTCG ATACAATTAA ATCCGAGTTT CTGATACTAT
GGGACGGTGT TGCTACCTCC ACCAATTCGA GAGTCATGGT GCTGGGGGCG ACAAACAAGC
CGCAGACGAT TGATCCAGCC ATTCAACGGC GCATGCCGCG TACTTTCCAC GTCCCACTAC
CGAATGTCGC AGGGCGTCAG GCTATTTTAA ATATATTTCT ACAGGAAGAG AAATTGTCAA
TGGACGCACG AGCATGTCTT CCGGAATTGG CTAAGGCAAC GGTGAACTAT TCGGGAAGCG
ACTTAAAAGA GTTGTGCAAG GCTGCAGCCA TGGTTGGGAT ACAGGAGCGC ACCGCCGAAT
ATGCTCGGAA GCGTGTCATG GGCGAAAGTG TAGCTCTGGA TCAGACAATA GGAAATGCTC
CCATGCGACC TATATCGAAA GATGACTTGT TGTCTGCTTT TTCCAAAGTC CAACGAACGG
GGGCAGCAGC ACAAGCATAC GGCCGTCAAA CGGCACGGGA GGATGCTGCC GAGCAAGAGT
CAGAAAGTCC AGCAGTTGAT GCGGAGGCGT TGCGCAACTT AACTCGATTT TTGCATTCAA
TGTCGAATCT TTCCGTCGGC CAGAGCCGTG GGGACGGTAC AGATATCCCC GACCTAAATT
AG
 
Protein sequence
MTLLESIGAA ETSVLAERST EKKDLSSRST KETNIDLIVE LSITIGYAIV TGLLARWLID 
RYLTPQQLTD TDQPSSKEVY KGLQRILQKR NRGNTQLPQL NSYELQIANE ILDPDDIETN
FAEIGGLDST KTEIYELAVL PLVHPELFTG KLVQPCKGIL LYGRPGTGKT MLAKALAKES
EAVFIPLQLS KLLNKWVGES NKLIAGAFSL AHKLQPAILF IDEIDTFLKA NAGEGAQYLD
TIKSEFLILW DGVATSTNSR VMVLGATNKP QTIDPAIQRR MPRTFHVPLP NVAGRQAILN
IFLQEEKLSM DARACLPELA KATVNYSGSD LKELCKAAAM VGIQERTAEY ARKRVMGESV
ALDQTIGNAP MRPISKDDLL SAFSKVQRTG AAAQAYGRQT AREDAAEQES ESPAVDAEAL
RNLTRFLHSM SNLSVGQSRG DGTDIPDLN