Gene Acid345_0988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0988 
Symbol 
ID4068655 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1252916 
End bp1254214 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content60% 
IMG OID637982995 
Productamidohydrolase 2 
Protein accessionYP_590065 
Protein GI94968017 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00393632 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGCATCC TTGTGACGTT TGCCGTGATG GTGTTGTGCG CTACGGGTTT CGCTCAGGAA 
CCCGACGCCG AAATTGCGCA CCACATCGAA TCCATCAAGG CGATCGACAA CCATTCGCAC
GTGATCGCCG CCGATCCCGC CGACAAAGGT TTCGACCAAT TGCGTTGTGA GATGCTGCCC
GACAGCGGCA TTGGCGCCGC CAGCCAGCGC TATCCCAACC CTGACTGGAT GAACGCCATC
CACGCGCTCT ACGGATTCAC CCCGAAAGAC GGCAGCGACG CCGAAATGAA GCGCGTAGAC
GATGCCCGCG CCGCCGAGAT GCAACACCAC GGCGACCAGC GGTGGGTGCT CGACAAAGCC
GGCATCGGCA CGGTCCTCGC CAATCGTCTC GATCTCACGC CCGAGATGAA AGCCCCGCGC
GTGCTCTGGG TCCCGTATGA GGATGCGCTG CTTTTCCCGC TGAACAATAC GGGCGAGAAG
TCGGTGAACC CCGATCGCAA AGCGTTGTTT GAAATGGCCG AGCACCTGCA AACCCACTAC
CTAGAACTAG CGGGCCTGAA GAAGCTCCCG CCAACGCTCG ATCAATACGT GAAGCAGGTC
CTGGTTCCGA CGCTCGAGCG TCAGAGAAAA GGTGGCGCCG TTGCGTTGAA ATTTGAAGCC
GCGTATCTCC GCGCGCTCGA CTTTGAGCCA GTGCTGCCGT ACCAGGCCCA ACAGGTGTAC
GCGAAGCACG TCAACGGTTC TATCGCGCAA CCCGCGGATT ACAAACTGCT TCAGGACTAC
CTCTTCAAGC AGATTGCGCT CGAAGCTGGG AAACTCGGAA TGGCGGTCCA CATCCACACC
GGTAGCGGTT GCGGCGCCTT CTTCAACGAT CCCGGAGCTG ACGCGGTTCT GCTCTCGCCC
ATGCTCAACG ATCCCGACCT GCGCAAAACA AACTTCGTCC TGCTGCACGG CAATTGGACG
CAGGAACGCA AAGTCATCGG CCTCATCCTC AAGCCGAATG TCTACGTGGA TACGTCGCTG
ATCGAGTACT TCCTCACGCC GCGCGAATAC GCAGAGATCC TGAAGTCGTG GCTCGAACAA
ATGCCCGAGC GCGTCCTCTT CGGTACCGAC GCCTCGCCCG GCGGCCCCGG CCAGAACTGG
CCCGAAACCA CACTATGGGG CGCGGCAAAG TTCCGCCGCT CGCTGGCAAT CGCTCTGACT
GAGATGGTGC GAGAGGGAAG TATCGACAAG CAACGCGCGA AGGAGATTGC GGACCTCGTG
CTGCGCGAAA ACGCTGCCAA GCTCTACGCC GTGAAGTAA
 
Protein sequence
MRILVTFAVM VLCATGFAQE PDAEIAHHIE SIKAIDNHSH VIAADPADKG FDQLRCEMLP 
DSGIGAASQR YPNPDWMNAI HALYGFTPKD GSDAEMKRVD DARAAEMQHH GDQRWVLDKA
GIGTVLANRL DLTPEMKAPR VLWVPYEDAL LFPLNNTGEK SVNPDRKALF EMAEHLQTHY
LELAGLKKLP PTLDQYVKQV LVPTLERQRK GGAVALKFEA AYLRALDFEP VLPYQAQQVY
AKHVNGSIAQ PADYKLLQDY LFKQIALEAG KLGMAVHIHT GSGCGAFFND PGADAVLLSP
MLNDPDLRKT NFVLLHGNWT QERKVIGLIL KPNVYVDTSL IEYFLTPREY AEILKSWLEQ
MPERVLFGTD ASPGGPGQNW PETTLWGAAK FRRSLAIALT EMVREGSIDK QRAKEIADLV
LRENAAKLYA VK