Gene Lcho_2042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_2042 
Symbol 
ID6161950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2219206 
End bp2220282 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content67% 
IMG OID641664811 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001791074 
Protein GI171058725 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.0443757 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGATC TGTTCGAAAA CCCGATGGGC CTGATGGGCT TCGAGTTCGT CGAGTTCGCC 
TCGCCCACGC CGGGCGCGCT GGAGCCGGTG TTCGAGATGC TGGGCTTCAC GAAGGTGGCC
ACGCACCGCT CGAAGAACGT GGTGCTGTAC CGCCAGGGCG GCATCAACTT CATCGTCAAC
AACGAGCCGC GTAGCGCGGC CGCGTACTTC GCCGCCGAAC ACGGCCCTTC GGCCTGCGGC
ATGGCGTTCC GGGTGCGCGA CTCGCACAAG GCCTACGCCC GCGCGCTCGA ACTCGGCGCC
CAGCCGGTGG AGATCCCGAC CGGCCCGATG GAGCTGCGCC TGCCGGCGAT CAAGGGCATC
GGCGGCGCGC CGCTCTACCT GATCGACCGG TTCGAAGAAG GCAAGAGCAT CTACGACATC
GACTTCGATT TCCTGCCCGG CGTCGACCGC CACCCGGCGG GCCACGGCCT CAAGCTGATC
GACCACCTGA CGCACAACGT CTACCGCGGC CGCATGGCGT ACTGGGCGGC GTTCTACGAA
CGGCTGTTCA ACTTCAGGGA GCTGCGCTAC TTCGACATCA AGGGCGAATA CACCGGCCTG
ACGAGCAAGG CCATGAGCGC GCCCGACGGC AAGATCCGCA TCCCGCTGAA CGAGGAATCG
TCGCGCGGCT CGGGCCAGAT CGAGGAGTTC CTGATGCAGT TCAACGGCGA AGGCATCCAG
CACATCGCCC TCTACACCGA CGACCTGCTC GGCACCTGGG ATTCGCTCAA GAAGGCCGGC
CTGCCCTTCA TGACCGCCCC GCCGGCCACC TACTACGAGA TGCTCGAAGG CCGCCTGCCC
GGCCACGGCG AGCCGGTGGG CGAACTGCAG GCGCGCGGCA TCCTGCTCGA CGGCAGCAGC
ACGCCCGGCG ACCAGCGCCT GCTGCTGCAG ATCTTCTCGC AGACGCTGCT CGGCCCGGTC
TTCTTCGAGT TCATCCAGCG CAAGGGCGAC GACGGTTTCG GCGAGGGCAA CTTCAAGGCA
CTGTTCGAGT CCATCGAGCG CGACCAGGTG CGCCGCGGCG TGCTGGAGGC GGCATGA
 
Protein sequence
MSDLFENPMG LMGFEFVEFA SPTPGALEPV FEMLGFTKVA THRSKNVVLY RQGGINFIVN 
NEPRSAAAYF AAEHGPSACG MAFRVRDSHK AYARALELGA QPVEIPTGPM ELRLPAIKGI
GGAPLYLIDR FEEGKSIYDI DFDFLPGVDR HPAGHGLKLI DHLTHNVYRG RMAYWAAFYE
RLFNFRELRY FDIKGEYTGL TSKAMSAPDG KIRIPLNEES SRGSGQIEEF LMQFNGEGIQ
HIALYTDDLL GTWDSLKKAG LPFMTAPPAT YYEMLEGRLP GHGEPVGELQ ARGILLDGSS
TPGDQRLLLQ IFSQTLLGPV FFEFIQRKGD DGFGEGNFKA LFESIERDQV RRGVLEAA