Gene Acid345_0971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0971 
Symbol 
ID4072959 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1230248 
End bp1232248 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content59% 
IMG OID637982978 
Productamidohydrolase 3 
Protein accessionYP_590048 
Protein GI94968000 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.59323 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0406107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTTTG CTCGTTTTCT CCTGGTGTGC TTTTTCCTCT CTGGCGCGAT CGTGTTGGCG 
CAGCAGACTG CCACCTCCGG CAGTTTCGTC CTGCATAAGT TCGCTCGTCC CATCGGCAGT
GAGACCTATT CCATCGCCAC GGACAAAGAC AGCTACACGC TGACTTCACA TTTTCTGTTT
ACGGATCGCG GTACGAAGGT CCCGCTGGAG ACTACGTTTG TCGCCGGCAC GCGCGACATG
GCGCCGCGCT CGTTCAGCGC GAAGGGCAAG CCCTCGCGTC AGGCTGAGAT GGATGACTCG
GTTACAGTGG CGGGCGACAC CGTATCCATC ACGCGTAGCG GCAAATCCGA GACGCAAAAA
GCCGACAAAT CATGGTTCGT TGTGGACGGA TATTCACCGG TGGCGATGCA GGAACAGATG
ATGCGCTGGT GGCTGAAGCA TGGAAAGCCA CAGGAGTTCA CTGCTTATCC ATCGAAGGCG
ACGGTTCGTA TCACCCCTGC GGGAACGCTG GCAATCGACG GTAAGGCGAC GCACGGATAC
ACGGTAAGTG GCTTGATCTG GGGACAGGAA TCGTTGTGGA TGGACGACGC GCAGAACCTG
GTCGCGCTGG TCAGCATCGA TGCGGAATTC GATCACTTCG AAGCCGTCCG CGAGAAGTAT
GCGAAGAGCC TCAATCTGTT TATTGCGGAC GCGGTGAAGG CCGATCTTGC GAATTTGAAG
AAGCTGAGTG CGACCGCGCG CATGGCTCCA TCACGGCGGC TCGCTATCGT CGGCGCGACG
ATCGAAGACT CGATTGCGCC GCCGATCCAA AATGGCGTGA TCTTGATCGA AGATGGAGTT
ATTCGAGCTG TGGGCCCGAA AGACCAGGTC ACGATACCCA GCGATGCGAA AGTGCTAGAC
GCCACCGGCA AGTTCGCAGT CCCGGGACTG TGGGACATGC ATGCTCACTA CGAACAGGTG
GAGTGGGGAC CGATTTACCT TGCCGCTGGC GTGACCACGG TCCGCGACGT CGGGAATGAG
TTTGAGTTTA TCCAGACACT TCATGACGAA CTTGATCGCA AGCAGGATCC CGCGATTGGT
CCGCACCTTG AATTTGCGGG CGTGATTGAC GGATCGGGAC AATTGACGAT CGGCGTGACC
ATTGCCGACA CGCCCGAGCA GGCGCGGGAA TGGGTGGACA AATATGCATC TGCGGGTGCA
AGGCAAATCA AGATCTACAG CTCAGTGAAG CCGGAGATCG TGAAGGCGAT TACCACCGAA
GCACACGCAA AAGGGATGAC CGTAACTGGC CATATCCCTG AAGGAATGAC GGCGATTCAG
GGCATCCACC TTGGAATGGA CCAGATCAAT CACATCAGCT ACGAACTGCA GTACTCGACC
CGTCCCATCT TCGGCGCTGA TGGCAAACCG GACCGTTCCA AGCCGGCGGT GCTCGAATTG
GAAGGGGCGC GGATGAAGGA CCTGGTCTCG ACCTTGCAAG CACACCACAC CGTCCTCGAC
CCGACGGCGG CGTTGTATGA GAGCTTCTCG ATTACGGTGC CGCTCCACGA AGTTGAGCCG
GGCGTCGACC ACCTTCCACC ACAATTGCGC GAGGCTTTGG ATAGTCCGCC GCCAACTGGA
GACCGCGCCG CAATTGCCGA TGCGCGAAGG AAGGCGATTA TCGCCACGCT GCGCGCGCTT
CACGAAGCAA AGGTCCCGAT CGTCGCCGGA ACCGACCAAG CCATTCCTGG ATATTCCCTG
CACCGCGAGC TGGAACTGTA CGTGGAGGCC GGCTTCACTC CGCAGGAAGC GATCCAGGCT
GCAACTATTG AGGCGGCGAG GGCCGTGGGC GTGGAGAAAG AGTCGGGTTC ACTGGAAGCC
GGAAAACGCG GCGACGTTCT GCTGCTGAAC GCCGACCCGC TCGCCGACAT TCACAACACA
CGTAAAGTCT GGCGAACGGT GGCGGCTGGC GCAGTGTACG ATCCGGCGCC GCTGTGGCAG
GTGGTAGGGT TCCTGCCGTA A
 
Protein sequence
MRFARFLLVC FFLSGAIVLA QQTATSGSFV LHKFARPIGS ETYSIATDKD SYTLTSHFLF 
TDRGTKVPLE TTFVAGTRDM APRSFSAKGK PSRQAEMDDS VTVAGDTVSI TRSGKSETQK
ADKSWFVVDG YSPVAMQEQM MRWWLKHGKP QEFTAYPSKA TVRITPAGTL AIDGKATHGY
TVSGLIWGQE SLWMDDAQNL VALVSIDAEF DHFEAVREKY AKSLNLFIAD AVKADLANLK
KLSATARMAP SRRLAIVGAT IEDSIAPPIQ NGVILIEDGV IRAVGPKDQV TIPSDAKVLD
ATGKFAVPGL WDMHAHYEQV EWGPIYLAAG VTTVRDVGNE FEFIQTLHDE LDRKQDPAIG
PHLEFAGVID GSGQLTIGVT IADTPEQARE WVDKYASAGA RQIKIYSSVK PEIVKAITTE
AHAKGMTVTG HIPEGMTAIQ GIHLGMDQIN HISYELQYST RPIFGADGKP DRSKPAVLEL
EGARMKDLVS TLQAHHTVLD PTAALYESFS ITVPLHEVEP GVDHLPPQLR EALDSPPPTG
DRAAIADARR KAIIATLRAL HEAKVPIVAG TDQAIPGYSL HRELELYVEA GFTPQEAIQA
ATIEAARAVG VEKESGSLEA GKRGDVLLLN ADPLADIHNT RKVWRTVAAG AVYDPAPLWQ
VVGFLP