Gene Acid345_1841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1841 
Symbol 
ID4072902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2218847 
End bp2219827 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content56% 
IMG OID637983850 
Productamidohydrolase 2 
Protein accessionYP_590916 
Protein GI94968868 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.921197 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGGAC TGCGCCGCAC ATCAGTTCTC TGCATATTTC TTGCTCTCCT TCCCTTCGCT 
ACTGCGCAAT CGTCTAACGC CCGAGCTGTT CAATCGTGGC GGGCCGATCA CCACATGCAT
TTATCGTCAG CCGATCTGTG CGCGCGCCTT GGTGATTGTC CTGATTGCGA GTGTCTCAAA
TCCGATCAGC CCCCAGCGGT GCTTGCCGCC GACGCGATAA AAGCTCTCGA TGACGCACAT
GTCTCGAAGG GCGTGATCTT GTCGGGCGCC TATTTGTATG CGAGGCCGTC GGTCCATCTT
TCCGCAGGCG AGACAGCCAA GAAGGTTCGT TTGGAAAATG AGTTCACGGC CGCCGAAGTG
GCAAAGTATC CCAAGCGACT GGTTGGGTTT TTCTCCGTGA ATCCGTTGCA GGATTCAGCC
GTCGAGGAAG TCCGCTATTG GGGTGCGAAG TCGCAGTTCG CCGGACTTAA GCTGCACTTC
AACGCGTCCG CGGTGAACGT CAGGAACGCG GAGGACCGAA AGAAAGTAAG CCGCATCCTG
GCAGAAGCAG CGAAAAAAGG CCTACCGATG GTGATTCACG TGGGAGGCGG AAACTTCAAC
GCATCCGACG CAGAGTTGTT CATCACCGAG ATTCTCCCCA GTGCCGGCGA TTCATGGGTA
CAGATCGCGC ACGCCGGTGG AGGTATGCCG AGCCGCAATG GGAATAATCT CGCGGTCCTG
CGCACCTTTG GAGACCACAT CGTGAGGAAC GACCCGCGGA CGCGAAGGAT ACTTTTTGAT
TTGTCATTTG TTCCGGCGCC AGATGACAGC CCACAGGGAT TCGCTCAGGA GATCCGGAGG
ATCGGGTTTA AACATTTTGT GTTCGGATCG GATTTCAGTG TCCAGATGCC GAGCGACGCG
ATCGTGAATT TGAAGCGGCT AGGACTGTCA GCGGAAGAGA TGCAGACTTT GAGTCAGAAT
TGTGCGCCAT GGGCGTGCTG A
 
Protein sequence
MIGLRRTSVL CIFLALLPFA TAQSSNARAV QSWRADHHMH LSSADLCARL GDCPDCECLK 
SDQPPAVLAA DAIKALDDAH VSKGVILSGA YLYARPSVHL SAGETAKKVR LENEFTAAEV
AKYPKRLVGF FSVNPLQDSA VEEVRYWGAK SQFAGLKLHF NASAVNVRNA EDRKKVSRIL
AEAAKKGLPM VIHVGGGNFN ASDAELFITE ILPSAGDSWV QIAHAGGGMP SRNGNNLAVL
RTFGDHIVRN DPRTRRILFD LSFVPAPDDS PQGFAQEIRR IGFKHFVFGS DFSVQMPSDA
IVNLKRLGLS AEEMQTLSQN CAPWAC