Gene Tery_3161 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3161 
SymbolpdxA 
ID4243832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4827852 
End bp4828892 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content38% 
IMG OID638108170 
Product4-hydroxythreonine-4-phosphate dehydrogenase 
Protein accessionYP_722762 
Protein GI113476701 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1995] Pyridoxal phosphate biosynthesis protein 
TIGRFAM ID[TIGR00557] 4-hydroxythreonine-4-phosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.551037 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00845056 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGACAG CCGTGAGTGA AATAAAAAGT CCAATTCCTC GTTTAGCAAT AACTATTGGA 
GATCCTACAG GTATAGGACC AGAAATAGTT CTGAAAGCTT TAGCAGATCC GAAGGTGAGA
GAAAATTGTG ATTTGACAAT AGTTGGGAGT CGTCAAATTT TGCAAGAAAC TTACCAACAA
CTATCTCTGG AAAAACTTGT CTATTGGGAA AATTTGAAAA TTTTAGATGT AGAGTTTGAT
GGAAATTTAG AAGATATTTC TCTGGGGAAA GGTAATGCTG TGAGTGGGGA AGCTAGTTTT
TGTTACATAG AAACCGCGAT CGCCCAAACT TTAGCTGGAG AGTTTCAAGG TATTGTCACT
GCTCCTATTT CCAAAACCTG CTGGCAGATG GCTGGTCATA AATATCCTGG TCAAACAGAA
CTTTTAGCCG AAATTGCAGG AGTAAAAAAT TTTGGAATGT TATTTGTTGC CCACTCACCC
CACAGTAATT TTGTACTTCG TTCTCTTTTA GCTACCACAC ATATACCATT AAGTCAAGTA
CCAGCAGCTC TAACACCAGA GTTAATGAGT TGGAAATTAG AATTATTGGT GGAAAGTTTA
CAAAAGGATT TTGGTATTTC TAAACCAAAA ATTGCTGTTG CCGGTTTAAA TCCTCACAGT
GGAGAAAATG GACAATTAGG AACAGAAGAA GAAGATTGGT TAATTCCTTG GTTAGAGAAA
GAAAGCGATC GCCAACCAGA TATACAATTG TATGGACCTG TGCCACCGGA TACCATTTGG
GTTAAACCGG GTCTTGCTTG GCGAGGTTTA GAAGCATTGA CTTGCGATGC TTATTTGGCA
CTTTATCATG ATCAAGGTTT AATTCCAATA AAACTAATGG CATTTGATTT AGCTGTTAAT
ACTACTATTG GTTTACCATT TGTTAGGACT TCTCCTGATC ATGGTACAGC ATTTGATATA
GCGGGTCGAG GTATTGCTGA TGCTACGAGT ATGAAAGCAG CGATAAAGTT AGGAAAAGAG
TTAATTTTAC AAAGAATTTA G
 
Protein sequence
MKTAVSEIKS PIPRLAITIG DPTGIGPEIV LKALADPKVR ENCDLTIVGS RQILQETYQQ 
LSLEKLVYWE NLKILDVEFD GNLEDISLGK GNAVSGEASF CYIETAIAQT LAGEFQGIVT
APISKTCWQM AGHKYPGQTE LLAEIAGVKN FGMLFVAHSP HSNFVLRSLL ATTHIPLSQV
PAALTPELMS WKLELLVESL QKDFGISKPK IAVAGLNPHS GENGQLGTEE EDWLIPWLEK
ESDRQPDIQL YGPVPPDTIW VKPGLAWRGL EALTCDAYLA LYHDQGLIPI KLMAFDLAVN
TTIGLPFVRT SPDHGTAFDI AGRGIADATS MKAAIKLGKE LILQRI