Gene PHATRDRAFT_40880 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPHATRDRAFT_40880 
Symbol 
ID7198791 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism namePhaeodactylum tricornutum CCAP 1055/1 
KingdomEukaryota 
Replicon accessionNC_011694 
Strand
Start bp175436 
End bp176999 
Gene Length1564 bp 
Protein Length416 aa 
Translation table 
GC content54% 
IMG OID 
Productagmatinase 
Protein accessionXP_002184908 
Protein GI219129463 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCAGTG CCACGGTTGC CAACGCTCGT CTCGCCTTTC AATCGGTCGC TCGGAAAGCC 
GCCCGCTTTA CTCCGACCGT GGGGGCCTCG CAAACCCAGA TCCGTTTCCA CCATCCTGAT
CCCTTCAACC CCAAGGTCAC CAAAGGATGG AAGGCCGCTG TCAAGGTGCG TGTATGTATG
CAAGCATGAA TGCCTGCATA CTATCCTTGT CAACGATACG TGTTGCTCGT AGCCGAGCTT
GTTTTTTTGG TCGACGTAGC CGCATACTGC TTCGCTACTA GTTACGATAC TCTGCAGTCT
CCCGGTGCCC TCTGTTCACA GTCAATCTCC GTCCTGTCCT CTCACGTCTA CCACTTTGAC
TATTCTTCAA CAGGAAGCTG AGTTGCCGAC AACGCGGGCG GATCAAGAAA TTGCCAACGC
CCTCCACTTG GGTTTACAGG GTGCCAGCAG TATCGAGGAC AAGTCGATAC CGACGTTTTC
CCGTGGTGAA CTACCACACT TTGCCGGTAT CAATACCTTC CTCAAGGCTC CGTACGTCGA
AGATGTCCGG GACGTTGGCA AGTACGACGC CACCGTCTTT GGCGTGCCCT TTGACGGGGG
TTGTACCTAC CGTTCCGGTA CCCGTTTTGG CCCTCAAGGA ATCCGACGCA TCTCGGCATT
GTACACTCCG TACAATTACG AACGCGGCAT TGATCTTCGG GAACAAATGA CCTTGTGCGA
TGCGGGAGAT GTGTAAGTGA CGCTGCTTGA TCGCAGAGCG TAAGACAGAC TGACAACTTG
ACTCGGTGCA CGGTCGGTCT GACTGTTGAT CAACTCGTCA ATTTTTCTTC TTTTCAGATT
TACCATTCCC GCCAACTTGG AAAAGTCATT TGACCAGATT AGCAACGCTG TGGCTCACAT
TGCGAGTACC GGTACTATGC CCATTATTCT TGGTGGGGAT CATTCTATCG GTTTCCCCAC
CGTCCGTGGG TTGGCCTCGG TGACGACCAA AAACATTGGT ATCATTCACG TGGATCGTCA
CGCGGACATT CAGGAAAAGG ATTTGGACGA ACGCATGCAC ACGACACCGT ACTTCCACGC
GACCAATTTG CCCAACGTCA ACGCCAAGAA CCTCGTCCAA ATCGGTATCG GTGGATGGCA
GGTTCCCCGC CCGGCTGTGG CCAACATGGT CGAACGCGAA ACTAACATTT TTACCATGGA
CGACATTGAA GAATACGGTA TCGAAAAGAT TGCCGAAATG GCTTTGGAAC GTGCCTGGGA
CGGCTGCGAT GCGGTCTACA TGAGTTACGA CATTGACAGC ATCGAAGCCG CATTTGTGCC
CGGCACGGGT TGGCCCGAAC CGGGCGGTCT CTTGCCTCGT GAAGCCCTCA AACTAGTGGG
ACTCGTGGCC GCCGAAGGTC TCTGCGGCAT GGAAGTCGTC GAAGTCAGCC CGCCCTACGA
TCACGCCGAC ATTACGTCCC TCATGGCCTT GCGCATCGTC GTAGACGCCC TCGGCTCCAT
GGTTTCACAC GGAACCATGG GCAAACATAA GCACATTATC GACAAGGAAT TCGTTCCCTT
TTGA
 
Protein sequence
MSSATVANAR LAFQSVARKA ARFTPTVGAS QTQIRFHHPD PFNPKVTKGW KAAVKEAELP 
TTRADQEIAN ALHLGLQGAS SIEDKSIPTF SRGELPHFAG INTFLKAPYV EDVRDVGKYD
ATVFGVPFDG GCTYRSGTRF GPQGIRRISA LYTPYNYERG IDLREQMTLC DAGDVFTIPA
NLEKSFDQIS NAVAHIASTG TMPIILGGDH SIGFPTVRGL ASVTTKNIGI IHVDRHADIQ
EKDLDERMHT TPYFHATNLP NVNAKNLVQI GIGGWQVPRP AVANMVERET NIFTMDDIEE
YGIEKIAEMA LERAWDGCDA VYMSYDIDSI EAAFVPGTGW PEPGGLLPRE ALKLVGLVAA
EGLCGMEVVE VSPPYDHADI TSLMALRIVV DALGSMVSHG TMGKHKHIID KEFVPF