Gene Acid345_3982 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3982 
Symbol 
ID4072455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4709697 
End bp4711034 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content62% 
IMG OID637986009 
Productamidohydrolase 
Protein accessionYP_593056 
Protein GI94971008 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACCA AGTTCGTGCT CGCGCTCTTA CTCACCACGC TTGCTTTCGC GCAACAACCC 
AAGCCGAAGC CCGTGCCCAA GGTCACCTAC ATCCGCGCCG GCCACCTCTT CGACGCCACC
TCCGACAATC GTCGCGACAA TGTGGTCATC GTCGTGGAAG GCGACCGTAT TAAATCCGTG
GAAGCCGGGA GCTTCGCGAT CCCGAACGGA TCGACAGTTA TCGATCTCCG CAACGCTACA
GTGCTGCCCG GCCTGATTGA TTGCCACACC CACCTGACCG CGCGCGCCGA CCGCTACGAT
CCCATCAATT ACTTCAAGGG AACGCCGTTC ACCGAGGGGT TTGCCGCGGT GCGTAACGCA
CATTCCACGC TGCTCGCGGG ATTCACCACG GTGCGCGATG TCGGATCGCC GCCGTTTGCC
GCCGTGGACC TGCGCAATGC CATCAATGAT GGCTTTATCT CCGGCCCGCG CGTGGTTGCC
AGCGGCCCAC CACTCTCCAT CACCGGCGGA CACGGCGACG TGAACGGATT TTCGCCCGAA
ACCCGCGTCA CCCTGTTCCC CGACCAGCGC GACTTCCGCA TTGCTGACGG TGTGGACCAG
GTCCGCCAGA CGGTGCGCGC GCAGGTGAAA TACGGCGTTG ACGTAATTAA AGTCCTCGCT
ACCGGCGGCG TCCTTTCGCA AGGCGATAGC CCCGGCGCGC CGCAATTCAC CTTTGAAGAA
CTTAAAACTG CCGCCGACGA AGCCCACGCT GCGGGCCGCA AGGTTGCCGC GCACGCCCAC
GGCGCCGAAG GCATCAAGCG CGCCATACTC GCGGGCATTG ATTCCATCGA GCACGCCTCG
CTCGCCAACG ACGAAGACAT CGCGCTTGCC AAGGAGCACG GCACGTATTT CGTCATGGAC
ATCTACAACG ACGACTACAT TCTCGGCAAA GCCGTTGAAT TCGGCCTGCC CGCGGCGAAC
GTCGAGAAAG AAAAGATGGT CGGCCGCACC CAGCGCGAGA ACTTTGAGAA GGCGTTCAAA
GCGGGCGTGA AGATGGCGTT CGGCACCGAC GCCGGCGTCT ACCCGCACGG CGACAACGCG
AAGCAGTTTA AATACATGGT GCAATTCGGC ATGACCCCTG CGCAGGCGAT CCGCGCCGCC
ACGTTCAATG CCGCCGACCT CATCGGCCGC AGTAAAGACG TCGGCACCGT CGAAGCCGGG
AAATTCGCCG ACATCATCGC CGTCAGCGAC GATCCGCTCG CGAACGTGCA GGCGCTGGAG
AACGTGCAGT TCGTGATGAA GGGCGGGGTG GTTTACAAGG ACAAGATCGC GAATACGGCG
GTCGCTGCAA TCGAATAA
 
Protein sequence
MKTKFVLALL LTTLAFAQQP KPKPVPKVTY IRAGHLFDAT SDNRRDNVVI VVEGDRIKSV 
EAGSFAIPNG STVIDLRNAT VLPGLIDCHT HLTARADRYD PINYFKGTPF TEGFAAVRNA
HSTLLAGFTT VRDVGSPPFA AVDLRNAIND GFISGPRVVA SGPPLSITGG HGDVNGFSPE
TRVTLFPDQR DFRIADGVDQ VRQTVRAQVK YGVDVIKVLA TGGVLSQGDS PGAPQFTFEE
LKTAADEAHA AGRKVAAHAH GAEGIKRAIL AGIDSIEHAS LANDEDIALA KEHGTYFVMD
IYNDDYILGK AVEFGLPAAN VEKEKMVGRT QRENFEKAFK AGVKMAFGTD AGVYPHGDNA
KQFKYMVQFG MTPAQAIRAA TFNAADLIGR SKDVGTVEAG KFADIIAVSD DPLANVQALE
NVQFVMKGGV VYKDKIANTA VAAIE