Gene Acid345_4501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4501 
Symbol 
ID4070179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp5343356 
End bp5344681 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content59% 
IMG OID637986540 
Productamidohydrolase 
Protein accessionYP_593575 
Protein GI94971527 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.787584 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTAG CCGCGAGGGC CGCATGGTCC CGCACCCTAC TTCTAGTCTT TGCGCTGGTG 
ATGCTTGCTG CCACGCTCTC TGCACAGAGC GCCGGGCCGG CAAGTCAGTC CAGCGAATTC
GTCATCAAGA ACGCCACCAT CCTCACGGCC TCGCACGGCC GCATTGAGCA CGGCTCCATT
TACGTAAAAA ACGGCAAGAT TGCCGCTGTT GGTACCGATG TTTCTGCGCC CGCTGGCGTA
CAAGCCGTCG ACGTCAACGG CGCTTTCGTT ACCCCCGGCA TCATCGACCC ACACTCGCAC
ATGGCGCTTG ACGACGATGT CAACGAAGCC ACCAGCCCCG TCGTCCCGCA CATGATGATG
AAGGACGCCT TCGTCTACAC CGACAAGGAG ATCTATCGCG CCCTCGCCGG CGGCGTCACC
TCCGCCCTGC TCCTCCACGG GTCGGCCGAC ATGATCGGTG GTCAGGCCGT CGTGATCAAG
ACCAAGTTCG GTCTCTCGCG CGACCAGATG CTCTTCCCTG GTGCGCCGCA ATCCATTAAG
TTCGCCAGCG GCGAAAATCC CAAGCGCGTC TTCGGTAGCA AAGGTCAACT GCCTTCCACG
CGCATGGGCA ACTTCGAAGT CATGCGCGAA GCCTTTATCC AGGCGCAGGA GTACCGCCGC
GAGTGGGACG AATACAACGC AAAAGCGCAA AAGGGCGACA AGGACGCCAA GATGCCGCAT
CGCGACCTGA AGCTCGAAGC CCTCGCCGAC GTTCTCCGCG GCAAGCTCCT GGTTCAGATC
CACATTTACC GCGCTGACGA ATTCCTCACC GAAATCGCGC TGGCAAACGA GTTCGGTTAT
AAGATTCGTG CCTTCCATCA CGCCCTAGAG GCCTACAAGG TTCCTGACGA GATCGCCAAG
TCCGGCGCCG CTATCGCTAC TTTCAGCGAC TGGTGGGGCT ACAAGTACGA AGCCTTCGAT
GCAATTCCCT GGAACGCCAC CATGGCCATG CGTCACGGCG TTCGTGTGGC GATCAAGAGT
GACTCTGACG ATTACATTCG TCGCTTAAAT CAGGAAGCCG CAAAGACCAT GCGTTACGGC
GGCGCAACCG AAGACGAAGC CATCAAGATG ATCACCATCA ATCCGGCGTG GATCATCGGC
GTGGACGACA AGACCGGCTC CATCGACGTC GGCAAAGATG CCGACTTGGT TCTCTGGAAC
AGCTACCCGC TCTCCAGCTA CGCACTCGCC GACAAGGTCT GGATCGACGG TCAGTTGTTC
TTCGACCGCT CGACGCCAGG CTACGGTATG CCGAACTACA AGAGCGATCC TGAGGAGGGC
CAGTAA
 
Protein sequence
MSLAARAAWS RTLLLVFALV MLAATLSAQS AGPASQSSEF VIKNATILTA SHGRIEHGSI 
YVKNGKIAAV GTDVSAPAGV QAVDVNGAFV TPGIIDPHSH MALDDDVNEA TSPVVPHMMM
KDAFVYTDKE IYRALAGGVT SALLLHGSAD MIGGQAVVIK TKFGLSRDQM LFPGAPQSIK
FASGENPKRV FGSKGQLPST RMGNFEVMRE AFIQAQEYRR EWDEYNAKAQ KGDKDAKMPH
RDLKLEALAD VLRGKLLVQI HIYRADEFLT EIALANEFGY KIRAFHHALE AYKVPDEIAK
SGAAIATFSD WWGYKYEAFD AIPWNATMAM RHGVRVAIKS DSDDYIRRLN QEAAKTMRYG
GATEDEAIKM ITINPAWIIG VDDKTGSIDV GKDADLVLWN SYPLSSYALA DKVWIDGQLF
FDRSTPGYGM PNYKSDPEEG Q