Gene PHATRDRAFT_37723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_37723 
Symbol 
ID7202599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011681 
Strand
Start bp788425 
End bp789837 
Gene Length1413 bp 
Protein Length470 aa 
Translation table 
GC content60% 
IMG OID 
Productpredicted protein 
Protein accessionXP_002181804 
Protein GI219122961 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTGG CGGCCATTCC CGTCCAGCGC GCTTCCAGTA CGCATCTACC GCAGTGGCAG 
CGGTTCCAGC CGGAGCGTAC GGCATCGGAG CATCCCACCG GTACGCATCC CTACCAGCGA
CAGTATGCAC ACGTCTACCA CCAGCGCTTG GCTGTCCTCG GACCCCGCGT CTGGCAAGCG
GTCCTTCGAG ACAACCAGTG CGAGAACGAC AACCACAACC ACAGTAACAA TAACGACGAT
GCCCACACGG TTCGGCACGT CCCACGGATT CTCGAACTAG AAGAAGGCGT CTTATCGATT
GCCGTTGGAA CCATTGTTAA GGAATACGAA AACAGTGTCA CCGGTAACGA CATCAACAGC
GACGCTGTCG TTCCGGGCGC CGTCAACGGT GCCAAAGACG CTCTGGTCCT CGAAGACGAA
AGTGGACGCG TGGTGCTCGC TACCGCACTA GTACACCAGT ATCCGACCGG AGTCGTCCTG
GGGGTCCAAG GAACTGTCGG CACGGACGGA GTTTTGCAAG TCGAACGCTT CTACCATCCC
TGGACATGTG CACCACCCGC CTTGCCTTTG TATATCGACC ACACCAACAA TATCAACAAC
AACAACAACG ACCCTACCTA TATCATGCTC GTTTCGGGAT TGCACTGCGG CAGCCCCAAA
GTCTCGTCGC TACCCCGCGA CATGCTCCTC TCCTACCTAC AGGGTCGTTT CGGACACAAG
GCCCGTCACG TCGCCCGCGT CATTTTCGCC GGTGGTCTCA CCTCCACCGA CGCCACCGCC
GTACAGGAAC TCGACGGATT CCTACTCGCC CTCGCCGCAT CGGGTGTCCC TATTGACGTC
CTACCCGGTG AACACGACCC GACCACCGCC AATTGGCCCC AACGGAGTCT GCACCGGGCC
TTGCTCCCAC ACACCACCAC CCGCTACGGA ACACTCGTCG CGCGAACCCC CAACCCCTAC
GCCGCTCGAC ACGATCACGT CGTTTGCCTC GGAACGGACG GACGCAACGT GCGGGACTTG
TGTACCCGCG TGGGAGTTCC CGTCGATGAC CACGACTCGC CCGGAGCGTG GCGGCCCGTG
ACGGAACTCC AAGCGCTCGA ACGTACCCTC GCCTGGGGAC ACGTCTGTCC CACCGGTCCG
GACTCGGTTC CCACCGTCCC CCACGCTCTG CAAGATCCCA TGGTCATTGA ACCCCACTTA
CCGCATCTCT ACTTTGCCGG TAACGCCAAA AAGTTCGCCA CCCAACGTGT TGTTGCCGCG
CACGCGGATA CTGCTACTGC TGTGGACACC GACGATCGTG TCGCGTTCAC CCGACTCGTC
TGTGTTCCCC AGTTTAGTGA AACCGGACAG GCGGTACTCG TGAATCTGCA AACCCTGGAC
GTGGAAGTCG TGCGTTTTCA GGATGAAGAA TAG
 
Protein sequence
MTVAAIPVQR ASSTHLPQWQ RFQPERTASE HPTGTHPYQR QYAHVYHQRL AVLGPRVWQA 
VLRDNQCEND NHNHSNNNDD AHTVRHVPRI LELEEGVLSI AVGTIVKEYE NSVTGNDINS
DAVVPGAVNG AKDALVLEDE SGRVVLATAL VHQYPTGVVL GVQGTVGTDG VLQVERFYHP
WTCAPPALPL YIDHTNNINN NNNDPTYIML VSGLHCGSPK VSSLPRDMLL SYLQGRFGHK
ARHVARVIFA GGLTSTDATA VQELDGFLLA LAASGVPIDV LPGEHDPTTA NWPQRSLHRA
LLPHTTTRYG TLVARTPNPY AARHDHVVCL GTDGRNVRDL CTRVGVPVDD HDSPGAWRPV
TELQALERTL AWGHVCPTGP DSVPTVPHAL QDPMVIEPHL PHLYFAGNAK KFATQRVVAA
HADTATAVDT DDRVAFTRLV CVPQFSETGQ AVLVNLQTLD VEVVRFQDEE