Gene Lcho_2044 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLcho_2044 
Symbol 
ID6161952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLeptothrix cholodnii SP-6 
KingdomBacteria 
Replicon accessionNC_010524 
Strand
Start bp2220827 
End bp2222122 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content71% 
IMG OID641664813 
Productfumarylacetoacetase 
Protein accessionYP_001791076 
Protein GI171058727 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR01266] fumarylacetoacetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0798972 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCG ACGACACGCT GGACCCGGCG CTGCGCAGCT GGCTCGAATC GGCCAATGCG 
GCGGACACCG ACTTCCCGAT CCAGAACCTG CCGTTCGGTC GCTTCCGGCT GGCGGGTGAA
ACCGCGAGCG CCTGGCGCAT CGGCGTGGCG ATCGGCGACC AGGTGCTCGA CCTCCGGCGC
GCCGGCCTGA TCGAGCATGG CGACATGGCC CGGCTGATGT CCGCCTCCGC CTCCGTTGCC
GATCGCCGCG CGCTGCGCCG GGCGATCTCG GCGGGCTTGC GTGAAGGCAG TGCGCAGCGG
GCGACGTTCA GCGAGGCGCT GCTGCCGCAG GCCGCGGTGC AGATGGGCCT GCCGTGCGAG
ATCGGCGACT ACACCGACTT CTACACCGGC ATCCACCACG CCACCACCGT CGGCAAGCTG
TTCCGGCCCG AGGCCCCGCT GCTGCCCAAC TACAAGTGGG TGCCGATCGG CTATCACGGC
CGGGCCTCGT CGATCGTGGC GAGCGGCCAG GACTTCCACC GCCCGCTCGG CCAGGTCAAG
GCCGCCGATG CCGAGGCGCC GGTGCTGCGC CCGAGCGGCC GGCTCGACTA CGAGCTGGAG
CTGGGCATCG TGATGGCGCG GCCCAATGCG CTGGGCGAGC CGGTGCCGAT GGCTGCGGCC
GAGGACCACG TGTTCGGTCT CACGCTGCTC AACGATTGGA GCGCACGCGA CCTGCAGGCC
TGGGAATACC AGCCGCTCGG GCCCTTCCTG TCGAAGAACT TCGCCAGCAC CGTGTCGCCG
TGGATCGTCA CGCTGGAGGC GCTGCAGCCG TTTCGCGCGC CGCCCGAGCG CCCGGCGGGC
GATCCGCTGC CGCTGCCGTA TCTGGATTCG CCGTACAACC GCGAAGCCGG CGCGATCGAC
ATCACGCTCG AAGTCTGGCT GCAGACCGCC GCGATGCGTC GTGCGGGCCT GGCCGCGCAG
CGGCTCAGCA CGTCCAACTA CCGCGACGCC TACTGGACGC TGGCGCAGCT GGTGGCGCAC
CACACCGTCA ACGGCTGCAA CCTGCGCAGC GGCGATCTGC TGGGCACCGG CACCTTGTCC
GGCCCGCAGC CTGATCAAGC CGGCTCGCTG CTCGAACTGA GCCTGGGCGG CCAGCAGCCG
GTCACGCTGG CCAACGGCGA ACAGCGGCGC TTTCTGGAGG ACGGCGACAG CGTGATCCTG
CGCGCCTACT GCGAGCGGGA TGGCCACCGG CGCATCGGTT TCGGCGAATG CATCGGCACC
GTGCTGCCGG CGCGCCAACT TGAAGGAGCG ACGTGA
 
Protein sequence
MSFDDTLDPA LRSWLESANA ADTDFPIQNL PFGRFRLAGE TASAWRIGVA IGDQVLDLRR 
AGLIEHGDMA RLMSASASVA DRRALRRAIS AGLREGSAQR ATFSEALLPQ AAVQMGLPCE
IGDYTDFYTG IHHATTVGKL FRPEAPLLPN YKWVPIGYHG RASSIVASGQ DFHRPLGQVK
AADAEAPVLR PSGRLDYELE LGIVMARPNA LGEPVPMAAA EDHVFGLTLL NDWSARDLQA
WEYQPLGPFL SKNFASTVSP WIVTLEALQP FRAPPERPAG DPLPLPYLDS PYNREAGAID
ITLEVWLQTA AMRRAGLAAQ RLSTSNYRDA YWTLAQLVAH HTVNGCNLRS GDLLGTGTLS
GPQPDQAGSL LELSLGGQQP VTLANGEQRR FLEDGDSVIL RAYCERDGHR RIGFGECIGT
VLPARQLEGA T