Gene PHATRDRAFT_44623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_44623 
Symbol 
ID7198113 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011672 
Strand
Start bp1076152 
End bp1078372 
Gene Length2221 bp 
Protein Length606 aa 
Translation table 
GC content51% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002178637 
Protein GI219115683 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCTAGAGATC TCGCGAAATT CGAATTCAGA ATTCAGCTTC TCTCTCTTTG CTTTACCAGT 
GTCACCGGTA TTCCTTTTCT TCTTTCGTAA AGGCTCTTTT ACGGTACCGA TCCGACGTTG
ACGTAGAGAC AAAAAATGAA CAGTTGGATG CGTTGCTGCC ATTACCAGGA TTACTACTAG
ATCCCTCTAC TTTTCCAGTG TAACGAGCCA TCCTCAATTT CTAGACGACC AATCTCCCTT
GATAATTGCC GTGCACCGGC GCATCAGCGT AACGCAGACT CTGATCACTT GTTATCCTCC
TCCTGTACAG ATCAAGCCGT ACTACGTTTC GACGGTTTAC ATTCGCATCT TTTATACTGC
CTGCACTCTT TTACTCATAT CCACACCCAC ACGCACAGTC ATGTCGGCGG TTTCGTCTAC
TCGCGAGCGC CGCGCGACTG CTGGCAAGCG CATGAGCGCA CTCGTGGGCC AGGCCTTGGA
CGATGACGCT GCGTTTTGGA ATCACGACAC CTGGGCCGAC GAGCACGACG CATCTGGCAA
CGAGAGTTTC CGAGAATCGG ACGAAGATTC GACGGCTCGC GTCGATGCTT TTGATTCCGA
TTTCGACGAT AGCGAAAGTG ATCACGAGCA GCAGGAACAG GCGGCGGGTG CGGAAGAAGA
ACGGGAATTG CAAAAAACGG AGCGCGGCAA CAAAAAACGA AAACTGTTTG GTGCGGGGCA
GTACGCGGAT ATCGCCAAGG CGGGTCGCGA CCGTATGCCC AAAAAGAAGA AAGTCGGCAA
TCGCGTAGTG GGAGACGGTG TCAATGCTGG TATTGTTCTC AATGTACCGA ACGCTCTCCC
TCGCGGTCCA GCACAGTATT TCGTTGCTGG GGTTTCGAAA CCAATACCCC CGACCACTAT
TAGTTCCCTC CAAGCGAAAC AGTCCGCGGT CTTGGGCATT CGACCGAGGC GGGCTGGGCG
GCCGTCTTCT CGTTTCCGAA CTTCCCGGGG AGATTCAGCA CCATCCATGA CGATCAATGG
TAAACCAGAT GGGTCGAAAA AGCGGCAACC TAAACATAAC TTTACGCAGG AAGAATTGCT
TCTAGAAGCG GCGCTCGAAA CGGAACCGGA AAACCACCGT TGGCTACTTT CACGCCAGCG
AATCCAAGAT CAGCAAGACC GTGATGCCGG TCCGAACGAC CGAAGCCACA GACGAGGAGG
ACGGGTAATT GAGAGTTATG TCTCACGAAG AGGCGCCCCC AATACAATCA CATTCCCCGA
AATGGATTAT GTACCAGAAA TTTTGACGCA ATCCTCTAGC GCGATGCTCG AACGGCTCCG
GATACCGACC TTTTGTGTCG TTACGGGAGA GCGGGCCAAA TATAAAGATC CACTGACTGG
TTTAGAATAT TCCAACGCTG CTGCATTTAA AGAGCTGCGA CGGCGGCATG CTTCTGGTGA
GCTGAAGCCA CAGTCGTCCA AGCCCAAAAC AAAGAAAACG TTGTCAAAAG CCATTTCCAC
GAAAGCAAAT GTTGGTAAAG TTGCCAAACC GATCCCTTGT GGCGATGGAG TATCCACGGG
AGCGAGCAAC GCAAGCTCGG AGCATGGGCA AAATGGAGAG GGGTCTGTCC CTGCGAAACC
AGCCGACTCA AGGAATACTT TAACAGAAAG AGCGAGCAAA TTGAAGCGGG CAGCAACTGT
GACATTGTAC GGGACCTCTC CAATTTCGTC AGAATTGGAC ACAGCGGAAA GAGAGCAACC
GGAAGAACCA CATTTATCAA CTACGCCACC GACAGACAAG TCACTTCCAT TTTCCGACGA
TTCGAAAACA CCGTCACTAC CAGCTTCCAT CGCGCCGAGA ACAAATATTA AACTACTGAC
CAAAGGTGGC GATGCTGATG CTTCCACCGA AATACGAGAA CCTTTGAAAA GTCCAAAAAT
AGTATCGTTC CATGTCCTTT CCGCTTCCAA ATCGTCCGCC TTCGCGTCAC AGAATCATAC
AATCTCTACG GCAATCAACT CACCAACGAG ATCGGCCACT TCACCGCCTT CGGCTCCTCA
CACAACATCT TCCGCTCGCG GTTCTCCCGA TATTTCAGGT CACCGTAGGG CTTCGCCCCG
TCGACGTAAA CCCAGCAGCA AGCTTTTGGA AACTGAATAT ACGCCTTTGG CCATTCCGAA
CTCATCCTCA GGGGAAGAAG TCGACAAGTC GTCGATACCT TTCGAAACGA ACGCTTTGTA
G
 
Protein sequence
MSAVSSTRER RATAGKRMSA LVGQALDDDA AFWNHDTWAD EHDASGNESF RESDEDSTAR 
VDAFDSDFDD SESDHEQQEQ AAGAEEEREL QKTERGNKKR KLFGAGQYAD IAKAGRDRMP
KKKKVGNRVV GDGVNAGIVL NVPNALPRGP AQYFVAGVSK PIPPTTISSL QAKQSAVLGI
RPRRAGRPSS RFRTSRGDSA PSMTINGKPD GSKKRQPKHN FTQEELLLEA ALETEPENHR
WLLSRQRIQD QQDRDAGPND RSHRRGGRVI ESYVSRRGAP NTITFPEMDY VPEILTQSSS
AMLERLRIPT FCVVTGERAK YKDPLTGLEY SNAAAFKELR RRHASGELKP QSSKPKTKKT
LSKAISTKAN VGKVAKPIPC GDGVSTGASN ASSEHGQNGE GSVPAKPADS RNTLTERASK
LKRAATVTLY GTSPISSELD TAEREQPEEP HLSTTPPTDK SLPFSDDSKT PSLPASIAPR
TNIKLLTKGG DADASTEIRE PLKSPKIVSF HVLSASKSSA FASQNHTIST AINSPTRSAT
SPPSAPHTTS SARGSPDISG HRRASPRRRK PSSKLLETEY TPLAIPNSSS GEEVDKSSIP
FETNAL