Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4501 |
Symbol | |
ID | 4070179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5343356 |
End bp | 5344681 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637986540 |
Product | amidohydrolase |
Protein accession | YP_593575 |
Protein GI | 94971527 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.787584 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATTAG CCGCGAGGGC CGCATGGTCC CGCACCCTAC TTCTAGTCTT TGCGCTGGTG ATGCTTGCTG CCACGCTCTC TGCACAGAGC GCCGGGCCGG CAAGTCAGTC CAGCGAATTC GTCATCAAGA ACGCCACCAT CCTCACGGCC TCGCACGGCC GCATTGAGCA CGGCTCCATT TACGTAAAAA ACGGCAAGAT TGCCGCTGTT GGTACCGATG TTTCTGCGCC CGCTGGCGTA CAAGCCGTCG ACGTCAACGG CGCTTTCGTT ACCCCCGGCA TCATCGACCC ACACTCGCAC ATGGCGCTTG ACGACGATGT CAACGAAGCC ACCAGCCCCG TCGTCCCGCA CATGATGATG AAGGACGCCT TCGTCTACAC CGACAAGGAG ATCTATCGCG CCCTCGCCGG CGGCGTCACC TCCGCCCTGC TCCTCCACGG GTCGGCCGAC ATGATCGGTG GTCAGGCCGT CGTGATCAAG ACCAAGTTCG GTCTCTCGCG CGACCAGATG CTCTTCCCTG GTGCGCCGCA ATCCATTAAG TTCGCCAGCG GCGAAAATCC CAAGCGCGTC TTCGGTAGCA AAGGTCAACT GCCTTCCACG CGCATGGGCA ACTTCGAAGT CATGCGCGAA GCCTTTATCC AGGCGCAGGA GTACCGCCGC GAGTGGGACG AATACAACGC AAAAGCGCAA AAGGGCGACA AGGACGCCAA GATGCCGCAT CGCGACCTGA AGCTCGAAGC CCTCGCCGAC GTTCTCCGCG GCAAGCTCCT GGTTCAGATC CACATTTACC GCGCTGACGA ATTCCTCACC GAAATCGCGC TGGCAAACGA GTTCGGTTAT AAGATTCGTG CCTTCCATCA CGCCCTAGAG GCCTACAAGG TTCCTGACGA GATCGCCAAG TCCGGCGCCG CTATCGCTAC TTTCAGCGAC TGGTGGGGCT ACAAGTACGA AGCCTTCGAT GCAATTCCCT GGAACGCCAC CATGGCCATG CGTCACGGCG TTCGTGTGGC GATCAAGAGT GACTCTGACG ATTACATTCG TCGCTTAAAT CAGGAAGCCG CAAAGACCAT GCGTTACGGC GGCGCAACCG AAGACGAAGC CATCAAGATG ATCACCATCA ATCCGGCGTG GATCATCGGC GTGGACGACA AGACCGGCTC CATCGACGTC GGCAAAGATG CCGACTTGGT TCTCTGGAAC AGCTACCCGC TCTCCAGCTA CGCACTCGCC GACAAGGTCT GGATCGACGG TCAGTTGTTC TTCGACCGCT CGACGCCAGG CTACGGTATG CCGAACTACA AGAGCGATCC TGAGGAGGGC CAGTAA
|
Protein sequence | MSLAARAAWS RTLLLVFALV MLAATLSAQS AGPASQSSEF VIKNATILTA SHGRIEHGSI YVKNGKIAAV GTDVSAPAGV QAVDVNGAFV TPGIIDPHSH MALDDDVNEA TSPVVPHMMM KDAFVYTDKE IYRALAGGVT SALLLHGSAD MIGGQAVVIK TKFGLSRDQM LFPGAPQSIK FASGENPKRV FGSKGQLPST RMGNFEVMRE AFIQAQEYRR EWDEYNAKAQ KGDKDAKMPH RDLKLEALAD VLRGKLLVQI HIYRADEFLT EIALANEFGY KIRAFHHALE AYKVPDEIAK SGAAIATFSD WWGYKYEAFD AIPWNATMAM RHGVRVAIKS DSDDYIRRLN QEAAKTMRYG GATEDEAIKM ITINPAWIIG VDDKTGSIDV GKDADLVLWN SYPLSSYALA DKVWIDGQLF FDRSTPGYGM PNYKSDPEEG Q
|
| |