Gene Acid345_2556 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2556 
Symbol 
ID4072200 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp3019411 
End bp3020862 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content58% 
IMG OID637984573 
Productamidohydrolase 
Protein accessionYP_591631 
Protein GI94969583 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCTGT TGCTTTCAAT ATCCGCACTC GTTTGCTGCA GCGCGATGCT CTCCGCGCAG 
GGGAAACTCA GCGAAGATGT GCAGCGTTAT GTGAAGGTGA ATTCCGCGCG CGTGGTGCTG
GAACATGTTC GCGTGATCGA CGGTACCGGC AAAGCGCCTG TCGAGGACCA GAACGTCGTG
ATCGAGAACG GCAAGATCAC CGCGATTCAA CTTGGTGCAG ACGTGAAGGC CGGCGCAAAC
GAAACCGTGC TCGATCTTCG CGGTTCCACG GTTTTTCCGG GGATCGTCGG GATGCACGAC
CATATGTACT ACATCGCGCG ACCGAACTTG GCTGCCGACG GCAGCTCCGA GCCGCCGTTG
ATCGTGCCGC AGATGACGTT CACTTCGCCC AGGCTTTATC TGGCGGCGGG GGTGACCACG
CTGCGTACGA CCGGCAGCGT TGAGCCGTAC ACCGATCTCA ACCTCCGCGA CCTGATCAAC
AAAGGCGAAC TGGTTGGTCC GCACATGGAC GTTACCGGCC CCTACCTCGA AGGGTCGGGC
AGTCCGTTCA TGCAGATGCA TCCGCTGAAG GACGCGGAGG ATGCGCGGAA GACGGTTGCG
TTTTGGGCGG ACCAGGGCGC GACGTCATTC AAGGCTTATA TGAACATCAC TCGCGATGAG
CTGAAAGCGG CTATTGATGA GGCGCATCGC CGCGGGTTGA AGATTACCGG TCATCTTTGC
TCAGTCACCT ATCCGGAAGC CGCCGACTTG GGCATAGACG ACCTTGAACA TGGCTTCTGG
GTGAACACTC AACTGGACCC TGACAAAGCG CCGGATGTGT GCTCCAAGGC GGCAGGCGGA
CCGACGCTCG AGAAGATGGA TCCAAACGGT GCTGAGGCCA AGGCACTCAT TGAGAAGCTC
GTCAGCAAGC ACGTGGCAAT TACCTCTACG CTGCCGGTGT TTGAAAATAT CGTGCCGGGG
CGTCCGGCGC TTTCGAAGCG CAACATGGAC ATCCTCTCGC CGCCCTCCAA AGAAGCCTAT
CTGTTTGCGC GCAACCGTCG CTACGCCACG TCCAAAGGGA ATGAAGCGCA ACTGTTTCGT
CGCGACATGG ATTTAGAAGT GGCTTTTGTC CGCGCTGGCG GGTTGCTGCT CGCCGGGCCC
GATCCCACCG GTAACGGGGG AACGTACCCA GGCTTCAGCG ATCAGCGTGA AATCGAGTTA
CTCGTGGAAG CTGGCTTTGC GCCAGTAGAA GCGATCAAAA TCGCGACCTT TAACGGTGCT
CTCTATATGG GCAAGCAGGA GAGCATCGGT TCACTTGGCG CAGGCAAGAA CGCCGATCTC
GTGGTGGTGA AGGGAAATCC GGCACAGAAG ATTGATGACA TCGAAAACGT TGAGATCGTC
TTCAAGGATG GGGTGGGTTA CGACTCTGCG AAGCTGATCG AATCAGTGCG CGGACGTTAC
GGACAATACT GA
 
Protein sequence
MKLLLSISAL VCCSAMLSAQ GKLSEDVQRY VKVNSARVVL EHVRVIDGTG KAPVEDQNVV 
IENGKITAIQ LGADVKAGAN ETVLDLRGST VFPGIVGMHD HMYYIARPNL AADGSSEPPL
IVPQMTFTSP RLYLAAGVTT LRTTGSVEPY TDLNLRDLIN KGELVGPHMD VTGPYLEGSG
SPFMQMHPLK DAEDARKTVA FWADQGATSF KAYMNITRDE LKAAIDEAHR RGLKITGHLC
SVTYPEAADL GIDDLEHGFW VNTQLDPDKA PDVCSKAAGG PTLEKMDPNG AEAKALIEKL
VSKHVAITST LPVFENIVPG RPALSKRNMD ILSPPSKEAY LFARNRRYAT SKGNEAQLFR
RDMDLEVAFV RAGGLLLAGP DPTGNGGTYP GFSDQREIEL LVEAGFAPVE AIKIATFNGA
LYMGKQESIG SLGAGKNADL VVVKGNPAQK IDDIENVEIV FKDGVGYDSA KLIESVRGRY
GQY