Gene Acid345_1139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1139 
Symbol 
ID4069910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1417932 
End bp1418942 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content50% 
IMG OID637983149 
Productmetallo-beta-lactamase superfamily hydrolase 
Protein accessionYP_590216 
Protein GI94968168 
COG category[R] General function prediction only 
COG ID[COG2333] Predicted hydrolase (metallo-beta-lactamase superfamily) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.187703 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0577692 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAACCG TTCACTTTCT GAATGTGAAA AATGGTGACT GTTCGATCAT CCAACACAAC 
TCTGGCCGAG TTACGATGAT CGACGTTTGT AATGCCAGCC CGACTGAGGA ACTGAAGGAG
CAGATAATCA AGGCGGCTGC GGATTTGCAA AAGGGCGTGA ATGGTGATTT CAGACAGAAG
CTCTTCCCGG TGAATCCTGT CTCCTATTTG AAGCAGCGAG AGATCGCGTC TATTTTTCGG
TTCATCATCA CCCATCCGGA CATGGATCAC ATGGACGGGA TCAAGTCAGT TTTTGACGCG
TTCTCTCCCC TCAATTTTTG GGATCACGAC AATAACAAAG AGATGAACTT CGAGGACGGT
AGTCCATACG ACGAGGACGA TTGGAACTTC TATACCCTAT TGAGAGACGG GAAACCCGAG
CACGACCCCA AAAGACTCAC CTTGTTCTCT GGAACGGCGG GCCAGTACTA CAACCAAGCA
GAGGGTGGGG GCCAAGGTGG CGACGGATTA ACGATACTGG CACCCACGTC CGGTTTGGTT
ACACAGGCAT GTGAGGATAA AGAGTTTAAT GATTGCTCCT ACGTCCTTCT CTACCGAACG
GGAAATAACA AGATCATTTT CGCAGGCGAC TCCGAAGACG CGACGTGGGA ACACATATTG
GCAAATCATG GGGATGCGGT ACGAGACGTT GACGTCCTAA TTGCACCTCA TCATGGGCGG
GATTCAGGAC GAGATTGGGA ATTTCTCGAT GTGCTAAAGC CAGCATTGAC GTTATTCGGA
AATGCAAGCT CCGAACACCT CGCCTATCAA GCATGGTGGA ATCGCGGGCT GCCTATCATC
ACGAACAACC AAGCGAACTG TATTGTGCTG AACCCGAGCG TTTCTCCGAT CGATGTATAC
GTCACATGTG AGCAGTTTGC GCGACTTGCG AACCCATATA CTCAGTATTC CGCCTTTTAT
GACGCCTATT ACTGGGGCAG CGCAGCAAGA AAGAAGGCTG TCGGTAAATA G
 
Protein sequence
MATVHFLNVK NGDCSIIQHN SGRVTMIDVC NASPTEELKE QIIKAAADLQ KGVNGDFRQK 
LFPVNPVSYL KQREIASIFR FIITHPDMDH MDGIKSVFDA FSPLNFWDHD NNKEMNFEDG
SPYDEDDWNF YTLLRDGKPE HDPKRLTLFS GTAGQYYNQA EGGGQGGDGL TILAPTSGLV
TQACEDKEFN DCSYVLLYRT GNNKIIFAGD SEDATWEHIL ANHGDAVRDV DVLIAPHHGR
DSGRDWEFLD VLKPALTLFG NASSEHLAYQ AWWNRGLPII TNNQANCIVL NPSVSPIDVY
VTCEQFARLA NPYTQYSAFY DAYYWGSAAR KKAVGK