Gene Lferr_0344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLferr_0344 
Symbol 
ID6876295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidithiobacillus ferrooxidans ATCC 53993 
KingdomBacteria 
Replicon accessionNC_011206 
Strand
Start bp322838 
End bp323839 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content65% 
IMG OID642788217 
Product4-hydroxythreonine-4-phosphate dehydrogenase 
Protein accessionYP_002218805 
Protein GI198282484 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1995] Pyridoxal phosphate biosynthesis protein 
TIGRFAM ID[TIGR00557] 4-hydroxythreonine-4-phosphate dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.822014 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.618609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACGG AGCCACGTTT GCTGCTGACC GTGGGGGAAC CCGCGGGTAT TGGTCCGGAT 
ATCTGTTTGC AACTGGCCTT CCATGCCCTG CCCTCGGGGG TGCTGCTCAT CGGGGATCTG
CACTGTCTGC GCAGCCGCGC CCTGACTTTG GGCTTGTCGC TGCGACTGGA GCCCTGGCTG
GAAGGCAATC CCTGGCCGGC GCTGGAAAGG GGCGTGTTGC ATGTGCTGGA TGTGCCCCTG
GCTCAGCCTT GTCGGCCCGG TCGTCTGGAT ATGGCCAATG CGCCGGCGGT ACTGGCTACT
CTGGATAAGG CGATGCATCT GCTACGGGCA GGTGCCGCGG ATGCGCTGGT GACGGCCCCG
GTGCACAAGG GCATCATCAA CGACGCGGGG ATTCCCTTTA CCGGACATAC GGAATATCTC
GCGGCAGCGT GTGGCAGCCC GAAGGTGGTC ATGCTGCTCG CCGGTCGCGG CTTGCGGGTG
GCACTGGCAA CGACGCATCT GCCGTTGGCG CAAGTGGCGG CGGCCGTTAC CCAGGAGGGG
CTGGAGGGCA CGCTGCGCAT TCTGCACCGG GCGTTGCGCG AAGACTTCGC GCTCTCGGAA
CCCCGTATTC TGGTCGCCGG CCTGAATCCC CATGCCGGGG AGGGCGGGCA TCTGGGGCAC
GAGGAGCAGG ATGTCATTGC GCCGGTAATA GCGGCCCTGC AGGGAGAAAG TCTGCGGATC
AGCGGTCCGT GGCCGGCCGA TACGCTGTTT ACCCCGCGCC TGCTCGAAGA TGCCGATGCG
GTACTGGCCA TGTATCATGA TCAGGGCCTG CCGGTACTGA AGTACCATGC CTTCGGTGAG
GCGGTGAATA TTACTCTGGG CTTGCCCATC GTGCGCACCA GTGTGGACCA CGGCACGGCG
CTGGACATCG CGGGCACAGG CCGGGCAGAA GGCGGCAGCC TGCTCCGGGC CCTGGACAAT
GCTGCGGAAA TCGTTCGCAA CCGGCGCGCC GCCGGTTGCT GA
 
Protein sequence
MITEPRLLLT VGEPAGIGPD ICLQLAFHAL PSGVLLIGDL HCLRSRALTL GLSLRLEPWL 
EGNPWPALER GVLHVLDVPL AQPCRPGRLD MANAPAVLAT LDKAMHLLRA GAADALVTAP
VHKGIINDAG IPFTGHTEYL AAACGSPKVV MLLAGRGLRV ALATTHLPLA QVAAAVTQEG
LEGTLRILHR ALREDFALSE PRILVAGLNP HAGEGGHLGH EEQDVIAPVI AALQGESLRI
SGPWPADTLF TPRLLEDADA VLAMYHDQGL PVLKYHAFGE AVNITLGLPI VRTSVDHGTA
LDIAGTGRAE GGSLLRALDN AAEIVRNRRA AGC